213 lines
8.5 KiB
Text
213 lines
8.5 KiB
Text
|
Linux* Base Driver for the Intel(R) PRO/10GbE Family of Adapters
|
||
|
================================================================
|
||
|
|
||
|
November 17, 2004
|
||
|
|
||
|
|
||
|
Contents
|
||
|
========
|
||
|
|
||
|
- In This Release
|
||
|
- Identifying Your Adapter
|
||
|
- Command Line Parameters
|
||
|
- Improving Performance
|
||
|
- Support
|
||
|
|
||
|
|
||
|
In This Release
|
||
|
===============
|
||
|
|
||
|
This file describes the Linux* Base Driver for the Intel(R) PRO/10GbE Family
|
||
|
of Adapters, version 1.0.x.
|
||
|
|
||
|
For questions related to hardware requirements, refer to the documentation
|
||
|
supplied with your Intel PRO/10GbE adapter. All hardware requirements listed
|
||
|
apply to use with Linux.
|
||
|
|
||
|
Identifying Your Adapter
|
||
|
========================
|
||
|
|
||
|
To verify your Intel adapter is supported, find the board ID number on the
|
||
|
adapter. Look for a label that has a barcode and a number in the format
|
||
|
A12345-001.
|
||
|
|
||
|
Use the above information and the Adapter & Driver ID Guide at:
|
||
|
|
||
|
http://support.intel.com/support/network/adapter/pro100/21397.htm
|
||
|
|
||
|
For the latest Intel network drivers for Linux, go to:
|
||
|
|
||
|
http://downloadfinder.intel.com/scripts-df/support_intel.asp
|
||
|
|
||
|
Command Line Parameters
|
||
|
=======================
|
||
|
|
||
|
If the driver is built as a module, the following optional parameters are
|
||
|
used by entering them on the command line with the modprobe or insmod command
|
||
|
using this syntax:
|
||
|
|
||
|
modprobe ixgb [<option>=<VAL1>,<VAL2>,...]
|
||
|
|
||
|
insmod ixgb [<option>=<VAL1>,<VAL2>,...]
|
||
|
|
||
|
For example, with two PRO/10GbE PCI adapters, entering:
|
||
|
|
||
|
insmod ixgb TxDescriptors=80,128
|
||
|
|
||
|
loads the ixgb driver with 80 TX resources for the first adapter and 128 TX
|
||
|
resources for the second adapter.
|
||
|
|
||
|
The default value for each parameter is generally the recommended setting,
|
||
|
unless otherwise noted. Also, if the driver is statically built into the
|
||
|
kernel, the driver is loaded with the default values for all the parameters.
|
||
|
Ethtool can be used to change some of the parameters at runtime.
|
||
|
|
||
|
FlowControl
|
||
|
Valid Range: 0-3 (0=none, 1=Rx only, 2=Tx only, 3=Rx&Tx)
|
||
|
Default: Read from the EEPROM
|
||
|
If EEPROM is not detected, default is 3
|
||
|
This parameter controls the automatic generation(Tx) and response(Rx) to
|
||
|
Ethernet PAUSE frames.
|
||
|
|
||
|
RxDescriptors
|
||
|
Valid Range: 64-512
|
||
|
Default Value: 512
|
||
|
This value is the number of receive descriptors allocated by the driver.
|
||
|
Increasing this value allows the driver to buffer more incoming packets.
|
||
|
Each descriptor is 16 bytes. A receive buffer is also allocated for
|
||
|
each descriptor and can be either 2048, 4056, 8192, or 16384 bytes,
|
||
|
depending on the MTU setting. When the MTU size is 1500 or less, the
|
||
|
receive buffer size is 2048 bytes. When the MTU is greater than 1500 the
|
||
|
receive buffer size will be either 4056, 8192, or 16384 bytes. The
|
||
|
maximum MTU size is 16114.
|
||
|
|
||
|
RxIntDelay
|
||
|
Valid Range: 0-65535 (0=off)
|
||
|
Default Value: 6
|
||
|
This value delays the generation of receive interrupts in units of
|
||
|
0.8192 microseconds. Receive interrupt reduction can improve CPU
|
||
|
efficiency if properly tuned for specific network traffic. Increasing
|
||
|
this value adds extra latency to frame reception and can end up
|
||
|
decreasing the throughput of TCP traffic. If the system is reporting
|
||
|
dropped receives, this value may be set too high, causing the driver to
|
||
|
run out of available receive descriptors.
|
||
|
|
||
|
TxDescriptors
|
||
|
Valid Range: 64-4096
|
||
|
Default Value: 256
|
||
|
This value is the number of transmit descriptors allocated by the driver.
|
||
|
Increasing this value allows the driver to queue more transmits. Each
|
||
|
descriptor is 16 bytes.
|
||
|
|
||
|
XsumRX
|
||
|
Valid Range: 0-1
|
||
|
Default Value: 1
|
||
|
A value of '1' indicates that the driver should enable IP checksum
|
||
|
offload for received packets (both UDP and TCP) to the adapter hardware.
|
||
|
|
||
|
XsumTX
|
||
|
Valid Range: 0-1
|
||
|
Default Value: 1
|
||
|
A value of '1' indicates that the driver should enable IP checksum
|
||
|
offload for transmitted packets (both UDP and TCP) to the adapter
|
||
|
hardware.
|
||
|
|
||
|
Improving Performance
|
||
|
=====================
|
||
|
|
||
|
With the Intel PRO/10 GbE adapter, the default Linux configuration will very
|
||
|
likely limit the total available throughput artificially. There is a set of
|
||
|
things that when applied together increase the ability of Linux to transmit
|
||
|
and receive data. The following enhancements were originally acquired from
|
||
|
settings published at http://www.spec.org/web99 for various submitted results
|
||
|
using Linux.
|
||
|
|
||
|
NOTE: These changes are only suggestions, and serve as a starting point for
|
||
|
tuning your network performance.
|
||
|
|
||
|
The changes are made in three major ways, listed in order of greatest effect:
|
||
|
- Use ifconfig to modify the mtu (maximum transmission unit) and the txqueuelen
|
||
|
parameter.
|
||
|
- Use sysctl to modify /proc parameters (essentially kernel tuning)
|
||
|
- Use setpci to modify the MMRBC field in PCI-X configuration space to increase
|
||
|
transmit burst lengths on the bus.
|
||
|
|
||
|
NOTE: setpci modifies the adapter's configuration registers to allow it to read
|
||
|
up to 4k bytes at a time (for transmits). However, for some systems the
|
||
|
behavior after modifying this register may be undefined (possibly errors of some
|
||
|
kind). A power-cycle, hard reset or explicitly setting the e6 register back to
|
||
|
22 (setpci -d 8086:1048 e6.b=22) may be required to get back to a stable
|
||
|
configuration.
|
||
|
|
||
|
- COPY these lines and paste them into ixgb_perf.sh:
|
||
|
#!/bin/bash
|
||
|
echo "configuring network performance , edit this file to change the interface"
|
||
|
# set mmrbc to 4k reads, modify only Intel 10GbE device IDs
|
||
|
setpci -d 8086:1048 e6.b=2e
|
||
|
# set the MTU (max transmission unit) - it requires your switch and clients to change too!
|
||
|
# set the txqueuelen
|
||
|
# your ixgb adapter should be loaded as eth1 for this to work, change if needed
|
||
|
ifconfig eth1 mtu 9000 txqueuelen 1000 up
|
||
|
# call the sysctl utility to modify /proc/sys entries
|
||
|
sysctl -p ./sysctl_ixgb.conf
|
||
|
- END ixgb_perf.sh
|
||
|
|
||
|
- COPY these lines and paste them into sysctl_ixgb.conf:
|
||
|
# some of the defaults may be different for your kernel
|
||
|
# call this file with sysctl -p <this file>
|
||
|
# these are just suggested values that worked well to increase throughput in
|
||
|
# several network benchmark tests, your mileage may vary
|
||
|
|
||
|
### IPV4 specific settings
|
||
|
net.ipv4.tcp_timestamps = 0 # turns TCP timestamp support off, default 1, reduces CPU use
|
||
|
net.ipv4.tcp_sack = 0 # turn SACK support off, default on
|
||
|
# on systems with a VERY fast bus -> memory interface this is the big gainer
|
||
|
net.ipv4.tcp_rmem = 10000000 10000000 10000000 # sets min/default/max TCP read buffer, default 4096 87380 174760
|
||
|
net.ipv4.tcp_wmem = 10000000 10000000 10000000 # sets min/pressure/max TCP write buffer, default 4096 16384 131072
|
||
|
net.ipv4.tcp_mem = 10000000 10000000 10000000 # sets min/pressure/max TCP buffer space, default 31744 32256 32768
|
||
|
|
||
|
### CORE settings (mostly for socket and UDP effect)
|
||
|
net.core.rmem_max = 524287 # maximum receive socket buffer size, default 131071
|
||
|
net.core.wmem_max = 524287 # maximum send socket buffer size, default 131071
|
||
|
net.core.rmem_default = 524287 # default receive socket buffer size, default 65535
|
||
|
net.core.wmem_default = 524287 # default send socket buffer size, default 65535
|
||
|
net.core.optmem_max = 524287 # maximum amount of option memory buffers, default 10240
|
||
|
net.core.netdev_max_backlog = 300000 # number of unprocessed input packets before kernel starts dropping them, default 300
|
||
|
- END sysctl_ixgb.conf
|
||
|
|
||
|
Edit the ixgb_perf.sh script if necessary to change eth1 to whatever interface
|
||
|
your ixgb driver is using.
|
||
|
|
||
|
NOTE: Unless these scripts are added to the boot process, these changes will
|
||
|
only last only until the next system reboot.
|
||
|
|
||
|
|
||
|
Resolving Slow UDP Traffic
|
||
|
--------------------------
|
||
|
|
||
|
If your server does not seem to be able to receive UDP traffic as fast as it
|
||
|
can receive TCP traffic, it could be because Linux, by default, does not set
|
||
|
the network stack buffers as large as they need to be to support high UDP
|
||
|
transfer rates. One way to alleviate this problem is to allow more memory to
|
||
|
be used by the IP stack to store incoming data.
|
||
|
|
||
|
For instance, use the commands:
|
||
|
sysctl -w net.core.rmem_max=262143
|
||
|
and
|
||
|
sysctl -w net.core.rmem_default=262143
|
||
|
to increase the read buffer memory max and default to 262143 (256k - 1) from
|
||
|
defaults of max=131071 (128k - 1) and default=65535 (64k - 1). These variables
|
||
|
will increase the amount of memory used by the network stack for receives, and
|
||
|
can be increased significantly more if necessary for your application.
|
||
|
|
||
|
Support
|
||
|
=======
|
||
|
|
||
|
For general information and support, go to the Intel support website at:
|
||
|
|
||
|
http://support.intel.com
|
||
|
|
||
|
If an issue is identified with the released source code on the supported
|
||
|
kernel with a supported adapter, email the specific information related to
|
||
|
the issue to linux.nics@intel.com.
|