Back to my home page
dummynet
1. Description
dummynet is a flexible tool originally designed
for testing networking protocols, and since then (mis)used for bandwidth
management.
It simulates/enforces queue and bandwidth limitations, delays, packet losses,
and multipath effects. It also implements a variant of Weighted Fair Queueing
called WF2Q+. It can be used on user's workstations, or on FreeBSD machines
acting as routers or bridges.
Just to get the idea of what you can do with dummynet, e.g. by using dummynet
on your workstation, or putting a PC with two ethernet cards between your
network and your router and booting from the floppy-image
below, here are a few examples :
These rules limit the total ICMP traffic (inbound+outbound) to 50Kbit/s
ipfw add pipe 1 icmp from any to any
ipfw pipe 1 config bw 50Kbit/s queue 10
These rules limit inbound traffic to 300Kbit/s for each host on your network
10.1.2.0/24.
ipfw add pipe 2 ip from any to 10.1.2.0/24
ipfw pipe 2 config bw 300Kbit/s queue 20 mask dst-ip 0x000000ff
If you want all machines to share evenly a single link, you should use
instead: ipfw add queue 1 ip from any to 10.1.2.0/24
ipfw queue 1 config weight 5 pipe 2 mask dst-ip 0x000000ff
ipfw pipe 2 config bw 300Kbit/s
And these rules simulate an ADSL link to the moon:
ipfw add pipe 3 ip from any to any out
ipfw add pipe 4 ip from any to any in
ipfw pipe 3 config bw 128Kbit/s queue 10 delay 1000ms
ipfw pipe 4 config bw 640Kbit/s queue 30 delay 1000ms
dummynet works by intercepting packets (selected by ipfw rules -
ipfw is one of the FreeBSD firewalls) in their way through the protocol stack,
and passing them through one or more objects called queues and
pipes, which simulate the effects of bandwidth limitations, propagation
delays, bounded-size queues, packet losses, multipath. Pipes are fixed-bandwidth
channels. Queues represent instead queues of packets, associated with a weight,
which share the bandwidth of the pipe they are connected to proportionally to
their weight.
Each pipe and queue can be configured separately, so you can apply different
limitations/delays to different traffic according to the ipfw rules (e.g.
selecting on protocols, addresses and ports ranges, interfaces, etc.). Pipes and
queues can be created dynamically, so using a single set of rules you can apply
independent limitations to all hosts in a subnet, or to all types of traffic,
etc. You can also configure the system to build cascades of pipes, so you can
simulate networks with multiple links and paths between source(s) and
destination(s).
2. Performance, status and availability
Unlike other traffic shaping
packages which run in userland, dummynet has a very little overhead, as all
processing is done within the kernel. There is no data copying involved to move
packets through pipes, just a bit of pointer shuffling, and the implementation
is able to handle thousands of pipes with O(log N) cost, where N is the number
of active pipes.
The WFQ variant we implement, called WF2Q+, has a complexity which is O(log
N) in the number of active flows, so again it is able to handle efficiently
thousands of flows. dummynet is part of FreeBSD since Sept.1998. It has been recently
(Jan.2000 and June 2000) rewritten, so the most recent, feature-rich and robust
versions are in FreeBSD 3.4-STABLE and newer releases.
You don't need to install FreeBSD on your hard disk to use it, as below you will
find a bootable single-floppy version of FreeBSD which includes dummynet,
bridging, and a lot of other goodies.
Dummynet is being heavily used by lots of people, and the code seems to be
extremely stable and robust, especially in the 3.4-STABLE version and above. Bug
fixes are generally applied to the FreeBSD source tree and are available from
the CVS tree or in
newer snapshot/releases of FreeBSD. From time to time i update the floppy image
on this site as well.
3. Support
If you have found some bug, please report it to me by
email, but don't forget to include information on which version of FreeBSD and
dummynet you are using, your rules (ipfw show; ipfw pipe show), your
configuration (bridge or router) etc.
If you have a simple question, again
just email me and i generally try to reply as soon as possible. Again, please
supply details!
For more complex things (like "i have no time to learn how to
use it, i just want this work done"), or customizations and additions of new
features to dummynet/ipfw, I am available (through my department) for doing
support on a contract basis.
Email [email protected] for discussing details.
This said, FreeBSD users should be able to use dummynet without the need for
support.
The relevant manpages (ipfw(8), dummynet(4), bridge(4)) are
a great source of information, so please read updated version of them before
asking questions.
You can also try posting on the various FreeBSD mailing
lists or newsgroups, they are usually a very good source of information.
4. Using dummynet
Dummynet is entirely controlled by the ipfw
commands and a set of sysctl variables.
4.1 Basic ipfw commands
The basic structure of ipfw commands is ipfw add [N] [prob X] action PROTO from SRC to DST [options]
where N is the rule number ;
X is a number between 0 and 1 that, when
present, indicates the probability of getting a match on this rule if all other
fields are correct. The default is deterministic match;
action is
one of the actions executed on a match, which can be any of allow, deny,
skipto N, pipe N and others. To send a packet to a dummynet pipe, we have
to use pipe N; PROTO is the protocol type we want to match (IP, TCP,
UDP, ...);
SRC and DST are address specifier (we can use addresses with
netmasks and optionally followed by ports or port ranges);
options
can be used to restrict the attention to packets coming from/to specific
interfaces, or carrying some TCP flags or ICMP options, or bridged, etc.
4.2 Sysctl variables
The following are the main sysctl variables to
control the behaviour of ipfw, bridging and dummynet:
Controlling ipfw
The firewall is mostly controlled by ipfw, and
the sysctl variables only serve to give global configuration and default
parameters. net.inet.ip.fw.enable: 1
enables firewall in the IP stack
net.inet.ip.fw.one_pass: 1
Forces a single pass through the firewall. If set to 0,
packets coming out of a pipe will be reinjected into the
firewall starting with the rule after the matching one.
NOTE: there is always one pass for bridged packets.
net.inet.ip.fw.dyn_buckets: 256 (readonly)
Current hash table size used for dynamic rules.
net.inet.ip.fw.curr_dyn_buckets: 256
Desired hash table size used for dynamic rules.
net.inet.ip.fw.dyn_count: 3
Current number of dynamic rules. (readonly)
net.inet.ip.fw.dyn_max: 1000
Max number of dynamic rules. If you exceed this limit, you will
have to wait for a rule to expire before being able to create
a new one.
net.inet.ip.fw.dyn_ack_lifetime: 300
net.inet.ip.fw.dyn_syn_lifetime: 20
net.inet.ip.fw.dyn_fin_lifetime: 20
net.inet.ip.fw.dyn_rst_lifetime: 5
net.inet.ip.fw.dyn_short_lifetime: 5
Lifetime (in seconds) for various types of dynamic rules.
Controlling dummynet
Also dummynet is mostly controlled by
ipfw, with the sysctl variables serving mostly for default parameters. net.inet.ip.dummynet.hash_size: 64
Size of hash table for dynamic pipes.
net.inet.ip.dummynet.expire: 1
Delete dynamic pipes when they become empty.
net.inet.ip.dummynet.max_chain_len: 16
Max ratio between number of dynamic queues and hash buckets.
When you exceed (max_chain_len*buckets) queues on a pipe,
packets not matching any of these will be all put into the
same default queue.
Controlling bridging
Bridging is almost exclusively controlled by sysctl
variables. net.link.ether.bridge_cfg: ed2:1,rl0:1,
set of interfaces for which bridging is enabled, and cluster
they belong to.
net.link.ether.bridge: 0
enable bridging.
net.link.ether.bridge_ipfw: 0
enable ipfw for bridging.
4.3 Pipe and queue configuration
The following ipfw commands
control dummynet pipes
- ipfw pipe NN config ...
This command is used to create or
reconfigure a pipe. NN is the numeric identifier (between 1 and 65535) of the
pipe. Issuing multiple time the configuration command results in the pipe
being reconfigured.
- ipfw [-s field] pipe [NN] show
This command shows the
parameters of a pipe. If the pipe is a dynamic one (see mask
parameter), then all dynamic pipes created from this one are listed. The list
can be very very long. The -s option allows you to sort the listing on one of
the four counters associated to the pipe.
- ipfw pipe NN delete Destroys a single pipe. Remember that packets
sent to a non-existing pipe are silently dropped.
- ipfw pipe flush Destroys all pipes.
The following
parameters can be configured for a pipe, adding the command in the pipe
config... line:
- Bandwidth: bw NNunit
NN is the bandwidth assigned to
the pipe, unit (which must follow the number with no intervening spaces) can
be any of bit/s Kbit/s Mbit/s Byte/s KByte/s MByte/s or non-ambiguous
abbreviations.
A bandwidth of 0 (or no bandwidth) results in no bandwidth
limitations (hence, no queues will ever build up).
- Queue size: queue NN [unit]
Sets the queue size, in
slots if only NN is specified, otherwise in Bytes or KBytes. When there is no
room in the queue, packets are dropped. The default queue size is 50
slots.
The combination of bandwidth and queue size influence the queueing
delay. Be careful when using low bandwidths not to use too large queues, or
you might end up with several seconds of queueing delay.
Also be careful
when you specify the queue size in packets: if you run tests over the loopback
interface, a packet can be very large, e.g. 16KB, again resulting in huge
delays.
- Delay: delay NN ms
Sets the propagation delay of the
pipe, in milliseconds. Note that the queueing delay component is independent
of the propagation delay. Also note that all delays are approximated with a
granularity of 1/HZ seconds (HZ is typically 100, but we suggest using HZ=1000
and maybe even larger values).
- Random Packet Loss: plr X
X is a floating point number
between 0 and 1 which causes packets to be dropped at random. This is done
generally to simulate lossy links. The default is 0, or no loss.
- Dynamic queue creation: mask ...
It is possible to
associate a mask to a pipe so that bandwidth and queue limitations are
enforced separately for packets belonging to different flows.
The mask command lets you specify which parts of the following
fields contribute to identify a flow:
[proto N] [src-ip N] [dst-ip N] [src-port N] [dst-port N]
where N is a bitmask where significant bits are set to 1. You can
specify one or more masks, or the all keyword to mean that all fields
are fully significant.
The default (when no mask are specified) is to
ignore all fields, so that all packets are considered to belong to the same
flow.
Whenever a new flow is encountered, a new queue (with the specified
bandwidth and queue size) is created.
WARNING!!! the number of dynamic queues that can be created in this
way can become very large. They are accessed through a hash table, whose size
you can define using the buckets NN specifier after the mask command.
To use WF2Q+, packets must be passed to queues which in turn must
be connected to a pipe.
The following ipfw commands control dummynet pipes
- ipfw queue NN config ...
This command is used to create or
reconfigure a queue. NN is the numeric identifier (between 1 and 65535) of the
queue. Issuing multiple time the configuration command results in the queue
being reconfigured.
- ipfw queue NN delete Destroys a single queue. Remember that
packets sent to a non-existing queue are silently dropped.
- ipfw queue flush Destroys all queues.
The following
parameters can be configured for a queue, adding the command in the queue
config... line:
- Pipe: pipe NN
NN is the identifier of the pipe used for
regulating traffic.
- Weight: weight NN
NN is the weight (1..100, default 1)
associated to the queue.
- Per-Flow queueing: mask ...
The syntax is the same as
for pipes. However, all queues created dynamically will share the parent
pipe's bandwidth according to the weight.
- Queue size, Random Packet Loss:
Same as for pipes.
5. Using dummynet for testing protocols
Dummynet was originally created
to test network protocols and applications, possibly even on a standalone
system. As a consequence, some of its features such as delay emulation, random
loss etc. are explicitly designed for that purpose.
There are a few things you should take in mind when doing such tests, to
avoid getting incorrect results. They are all obvious things, still it is better
to have them in mind.
- Choosing a reasonable buffer size.
As said earlier, packet can
be subject to a delay which is proportional to the total queue size (in
bytes), and inversely proportional to the bandwidth. At low bandwidths, this
queueing delays can be extremely high, especially if the queue size is defined
in terms of packets and packets are large. The default queue size is almost
certainly too large for most purposes, and it is often preferable to define
the queue size in terms of bytes rather than packets.
- Half-duplex vs. Full-duplex channels.
With the exception of
shared-medium networks such as the ethernet, most links that you want to
simulate for your experiments are full duplex. As such, the proper
configuration is the following: ipfw add pipe 1 ip from A to B
ipfw add pipe 2 ip from B to A
ipfw pipe 1 config ...
ipfw pipe 2 config ...
Should you really need to mode a half duplex network, then you can use
the following sequence. But think twice before you do so, as it is often a
non-realistic mode. ipfw add pipe 3 ip from A to B
ipfw add pipe 3 ip from B to A
ipfw pipe 3 config ...
- Interactions between bridging and multicast
You can use ipfw
(and dummynet) in a bridge by setting some sysctl variables: sysctl -w net.link.ether.bridge=1
sysctl -w net.link.ether.bridge_ipfw=1
and then specify your firewall configuration.
Be careful when you run
experiment involving multicast traffic through a dummynet-enabled bridge.
Unless you set the rules right, multicast traffic in a bridge goes through the
firewall code twice: once during forwarding at level 2, once when the packet
is passed to the local IP stack of the bridge.
Starting from Feb.2000, there are to avoid this problem. One involves a
sysctl variable:
sysctl -w net.inet.ip.fw.enable=0
which avoids that the firewall is invoked at the ip level. Otherwise,
you can use the bridged specifier in your ruleset to match only
bridged packets: ipfw add pipe 1 ip from any to any bridged
- Running over the loopback interface.
Dummynet was originally
designed for running experiments on a standalone machine. The loopback
interface lets you run senders and receivers on the same machine, but you
should remember a few things:
- TCP defaults.
Be very careful when using TCP, especially between
processes running on the same machine, or on the same subnet.
Apart from
the MTU issue mentioned earlier, at least on FreeBSD, TCP starts with a full
window when the remote endpoint is on the same subnet as one of the local
addresses. You need a simple fix in the source (tcp_input.c i believe) to fix
this behaviour in FreeBSD 3.x, whereas FreeBSD 4.x has sysctl variable(s) to
set the initial window.
Secondly, when you do experiments on configuration
with a large delay-bandwidth product, remember that many applications use the
default window size which is small, something like 16KB. You might end up not
using the full bandwidth just because your data transfer is window-limited.
5.1 Simulating multipath
One nice feature of the new version of dummynet
is the ability to simulate multiple paths between sender and receiver. This is
done using probabilistic match, e.g.: ipfw add prob 0.33 pipe 1 ip from A to B
ipfw add prob 0.5 pipe 2 ip from A to B
ipfw add pipe 3 ip from A to B
ipfw pipe 1 config ...
ipfw pipe 2 config ...
ipfw pipe 3 config ...
Given the right packet, the first rule will match with probability 1/3; in
the remaining 2/3 of occurrence we move to the second rule, which will match
with prob 1/2 (so overall 1/2*1/3 = 1/3), and the remaining 1/3 of occurrence
will move to the third rule, which has a deterministic match. We can then
configure the three pipes as desired to emulate phenomena such as packet
reordering etc.
6 Related links
Here i collect some info on how to do various
ipfw-related things. Most of this is just URLs collected from the mailing list
so the reliability of the info might be different (for good or bad) from what is
in this page.
A PicoBSD floppy
To conclude... if you want to try dummynet, here is
a bootable floppy image of a system with FreeBSD, bridging, ipfw, dummynet,
natd, ppp, drivers for a few interfaces, and accessible via telnet.
To setup this system, download the 1.44MB image, pico.000608.bin
and copy it to a floppy using dd under FreeBSD, or rawrite under
DOS/Windows.
Then put the floppy into a machine with hopefully at least one interface, and
wait for it to boot. When the system comes up, login as root, password "setup",
and you can play with bridging, ipfw and dummynet using the above commands.
Luigi Rizzo
Dipartimento di Ingegneria dell'Informazione -- Univ. di
Pisa
via Diotisalvi 2 -- 56126 PISA
tel. +39-050-568533 Fax
+39-050-568522
email: [email protected]