Mailing List Archive: Xen & I/O in clusters

Xen & I/O in clusters - problems!

Oct 14, 2004, 10:45 AM

Post #1 of 33 (1378 views)

Hi, we are benchmarking Xen in a cluster, and got some bad results. We
might do something wrong, and wonder if anyone
have similar problems.

When we benchmark troughput from native Linux to native Linux (two
physical nodes in the cluster) we get 786.034 MByte/s
When we benchmark from a virtual domain (running on Xen on a physical
node) to an another virtual domain (on another physical node) we get
56.480 MByte/s (1:16)

The difference is huge, and we wonder if the bottleneck could be the
fact that we are using software routing (We use this in order to route
from the physical node to the virtual OSs), or if this is just a
downside of Xen?

I would guess it IS the SW routing, so is there any good alternatives
to make virtual domains communicate on a cluster without sw routing?

cheers,
Rune J.A

Re: Xen & I/O in clusters - problems! [ In reply to ]

Ian.Pratt at cl

Oct 14, 2004, 11:17 AM

Post #2 of 33 (1367 views)

Permalink

> Hi, we are benchmarking Xen in a cluster, and got some bad results. We
> might do something wrong, and wonder if anyone
> have similar problems.
>
> When we benchmark troughput from native Linux to native Linux (two
> physical nodes in the cluster) we get 786.034 MByte/s
> When we benchmark from a virtual domain (running on Xen on a physical
> node) to an another virtual domain (on another physical node) we get
> 56.480 MByte/s (1:16)

(Presumably you mean MBits rather than Mbytes)

The numbers you're getting are terrible compared to what we see.
Running between virtual domains on a cluster we measure
throughput as high as 897Mb/s (same as Linux native).

Our results were recorded with dual 2.4GHz Xeons with tg3 NICs
and a 128KB socket buffer, measured using ttcp. With the virtual
domain running on the other physical CPU from domain 0 we get
897Mb/s. We get similar results running the virtual domain on the
other hyperthread of the same physical CPU. We observe a
performance reduction if we run the virtual domain on the same
(logical) CPU as domain 0, down to 660Mb/s [.843Mb/s on a dual
3GHz machine, so we appear to be CPU limited in this case].

> The difference is huge, and we wonder if the bottleneck could be the
> fact that we are using software routing (We use this in order to route
> from the physical node to the virtual OSs), or if this is just a
> downside of Xen?

Our results were recorded using the dom0 linux bridge code rather
than using routing.

One thing to check is that you have don't have
CONFIG_IP_NF_CONNTRACK set to 'y' -- this slays performance.

Also, if you're running multiple domains on the same CPU you may
be running into CPU scheduling issues. Some tweaks to scheduler
parameters may fix this.

> I would guess it IS the SW routing, so is there any good alternatives
> to make virtual domains communicate on a cluster without sw routing?

The Xen 2.0 architecture is not as slick as the
monolithic-hypervisor approach of Xen 1.2, but we get better
hardware support and a lot more flexibility. However, we do burn
more CPU to achieve the same IO rate. We just have to wait for
Moore's law to catch up ;-)

Ian

-------------------------------------------------------
This SF.net email is sponsored by: IT Product Guide on ITManagersJournal
Use IT products in your business? Tell us what you think of them. Give us
Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more
http://productguide.itmanagersjournal.com/guidepromo.tmpl
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xen-devel

Re: Xen & I/O in clusters - problems! [ In reply to ]

mark.williamson at cl

Oct 14, 2004, 4:24 PM

Post #3 of 33 (1369 views)

Permalink

> When we benchmark from a virtual domain (running on Xen on a physical
> node) to an another virtual domain (on another physical node) we get
> 56.480 MByte/s (1:16)

Ouch. How are you benchmarking this? (what tool, what parameters, etc.).
It'll help me reproduce this on our test systems. Then we'll know if it's
your config or if there's something to track down.

We did see some weird performance for small packets at one stage and I'm not
sure if that was ever resolved. If it's the same problem, I can do a binary
chop search of changesets in order to locate it.

Cheers,
Mark

-------------------------------------------------------
This SF.net email is sponsored by: IT Product Guide on ITManagersJournal
Use IT products in your business? Tell us what you think of them. Give us
Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more
http://productguide.itmanagersjournal.com/guidepromo.tmpl
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xen-devel

Re: Xen & I/O in clusters - problems! [ In reply to ]

Havard.Bjerke at idi

Oct 15, 2004, 3:48 AM

Post #4 of 33 (1369 views)

Permalink

Here are some details on our horrible benchmark results:

System setup:

Ethernet controller:
Intel Corp. 82547GI Gigabit Ethernet Controller

Benchmark:
PARKBENCH low-level pingpong benchmark comms1_mpi

MPI library:
Scali MPI (http://www.scali.com/)

Xen distribution:
2.0 stable, downloaded with bitkeeper

Xen configuration:
default, i.e. we just leave config-2.6.8.1-xen0 the way it is

CPU:
Single Intel(R) Pentium(R) 4 CPU 3.40GHz

Memory:
1 GB

Bandwidth measurements:

Between two nodes running plain redhat EL3 with kernel 2.4.21-15.EL:

786.034 MByte/s

Between two nodes each running only xen domain 0:

56.480 MByte/s

A graph over the measurements - x = message length, y = time:

http://www.idi.ntnu.no/~havarbj/tmp/plot.png

Cheers,
Havard

-------------------------------------------------------
This SF.net email is sponsored by: IT Product Guide on ITManagersJournal
Use IT products in your business? Tell us what you think of them. Give us
Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more
http://productguide.itmanagersjournal.com/guidepromo.tmpl
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xen-devel

Re: Xen & I/O in clusters - problems! [ In reply to ]

runejoha at idi

Oct 15, 2004, 6:48 AM

Post #5 of 33 (1370 views)

Permalink

Hi, we benchmark with Scali MPI. We are in the first stage with an easy
ping program which
first send 1 byte to measure the latency, then we tested with 10^5,
10^7 and 10^8 size packages. We also
get the same results when sending from domain0 to domain0 in the
cluster. We are now testing if the routing table
is the bottleneck, I let you know the results. Thank you :)

Cheers,
Rune J.A

On Oct 15, 2004, at 1:24 AM, Mark A. Williamson wrote:

>> When we benchmark from a virtual domain (running on Xen on a physical
>> node) to an another virtual domain (on another physical node) we get
>> 56.480 MByte/s (1:16)
>
> Ouch. How are you benchmarking this? (what tool, what parameters,
> etc.).
> It'll help me reproduce this on our test systems. Then we'll know if
> it's
> your config or if there's something to track down.
>
> We did see some weird performance for small packets at one stage and
> I'm not
> sure if that was ever resolved. If it's the same problem, I can do a
> binary
> chop search of changesets in order to locate it.
>
> Cheers,
> Mark

-------------------------------------------------------
This SF.net email is sponsored by: IT Product Guide on ITManagersJournal
Use IT products in your business? Tell us what you think of them. Give us
Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more
http://productguide.itmanagersjournal.com/guidepromo.tmpl
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xen-devel

Re: Xen & I/O in clusters - problems! [ In reply to ]

mark.williamson at cl

Oct 15, 2004, 7:02 AM

Post #6 of 33 (1369 views)

Permalink

> Between two nodes running plain redhat EL3 with kernel 2.4.21-15.EL:
>
> 786.034 MByte/s
>
> Between two nodes each running only xen domain 0:
>
> 56.480 MByte/s

That's surprising - I'd have expected any performance problems to involve
unpriv domains somehow. We've never had any performance problems when just
running domain 0, even when the code was still under development...

It'd be interesting to see your config file (I know it's just the default but
it'd be interesting for comparison as there's no obvious reason for your
problems).

Cheers,
Mark

-------------------------------------------------------
This SF.net email is sponsored by: IT Product Guide on ITManagersJournal
Use IT products in your business? Tell us what you think of them. Give us
Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more
http://productguide.itmanagersjournal.com/guidepromo.tmpl
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xen-devel

Re: Re: Xen & I/O in clusters - problems! [ In reply to ]

Havard.Bjerke at idi

Oct 15, 2004, 9:27 AM

Post #7 of 33 (1372 views)

Permalink

On Fri, Oct 15, 2004 at 02:02:31PM +0000, Mark A. Williamson wrote:
> > Between two nodes running plain redhat EL3 with kernel 2.4.21-15.EL:
> >
> > 786.034 MByte/s
> >
> > Between two nodes each running only xen domain 0:
> >
> > 56.480 MByte/s
>
> That's surprising - I'd have expected any performance problems to involve
> unpriv domains somehow. We've never had any performance problems when just
> running domain 0, even when the code was still under development...
>

What's possibly even more funny is that when I do the same benchmark localhost <-> localhost, ie. through the loopback interface, on domain 0, the bandwidth is halved. CPU use is ~100% during both these benchmarks on domain 0 (50% per process in the last benchmark). This indicates to me that the bandwidth depends on CPU resources. Some heavy processing is happening somewhere.

I suspect this might have something to do with the MPI library scaMPI, which is supposed to be more closely linked with the lower layers of the OSI protocol stack or something. I will investigate it further.

Another thing that might be worth noting is that we're using the 2.6.8.1 kernel on domain 0 as opposed to a 2.4 kernel. I don't think that would make any difference, though (except for the fact that modules aren't loaded, but I don't think we need any of them anyway).

> It'd be interesting to see your config file (I know it's just the default but
> it'd be interesting for comparison as there's no obvious reason for your
> problems).
>

Attached

Cheers,
Havard

Re: Xen & I/O in clusters - problems! [ In reply to ]

john.enok at vollestad

Oct 16, 2004, 1:05 PM

Post #8 of 33 (1368 views)

Permalink

HÃ¥vard Bjerke <Havard.Bjerke <at> idi.ntnu.no> writes:

>
> On Fri, Oct 15, 2004 at 02:02:31PM +0000, Mark A. Williamson wrote:
> > > Between two nodes running plain redhat EL3 with kernel 2.4.21-15.EL:
> > >
> > > 786.034 MByte/s
> > >
> > > Between two nodes each running only xen domain 0:
> > >
> > > 56.480 MByte/s
> >
> > That's surprising - I'd have expected any performance problems to involve
> > unpriv domains somehow. We've never had any performance problems when just
> > running domain 0, even when the code was still under development...
> >
>
> What's possibly even more funny is that when I do the same benchmark localhost
<-> localhost, ie. through
> the loopback interface, on domain 0, the bandwidth is halved. CPU use is ~100%
during both these
> benchmarks on domain 0 (50% per process in the last benchmark). This indicates
to me that the bandwidth
> depends on CPU resources. Some heavy processing is happening somewhere.
>
> I suspect this might have something to do with the MPI library scaMPI, which
is supposed to be more closely
> linked with the lower layers of the OSI protocol stack or something. I will
investigate it further.

Around 2000 the driver used a mix of polling and interrupts to get the latency
down. If I remember correctly there done polling for abount half the time of an
interrupt.

To get the adapters manipulate memory directly the driver also have to allocate
memory in physical continuous blocks. Exact how the reading and writing of this
areas is done through the drivers I do not know but this should not make any
performance hit except if Xen have any issues with MMU manipulation.

You could send SCALI an email..

--
John Enok

-------------------------------------------------------
This SF.net email is sponsored by: IT Product Guide on ITManagersJournal
Use IT products in your business? Tell us what you think of them. Give us
Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more
http://productguide.itmanagersjournal.com/guidepromo.tmpl
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xen-devel