Mailing List Archive: Lossy cogent p2p experiences?

Lossy cogent p2p experiences?

dhubbard at dino

Aug 31, 2023, 8:55 AM

Post #1 of 82 (461 views)

Hi all, curious if anyone who has used Cogent as a point to point provider has gone through packet loss issues with them and were able to successfully resolve? I?ve got a non-rate-limited 10gig circuit between two geographic locations that have about 52ms of latency. Mine is set up to support both jumbo frames and vlan tagging. I do know Cogent packetizes these circuits, so they?re not like waves, and that the expected single session TCP performance may be limited to a few gbit/sec, but I should otherwise be able to fully utilize the circuit given enough flows.

Circuit went live earlier this year, had zero issues with it. Testing with common tools like iperf would allow several gbit/sec of TCP traffic using single flows, even without an optimized TCP stack. Using parallel flows or UDP we could easily get close to wire speed. Starting about ten weeks ago we had a significant slowdown, to even complete failure, of bursty data replication tasks between equipment that was using this circuit. Rounds of testing demonstrate that new flows often experience significant initial packet loss of several thousand packets, and will then have ongoing lesser packet loss every five to ten seconds after that. There are times we can?t do better than 50 Mbit/sec, but it?s rare to achieve gigabit most of the time unless we do a bunch of streams with a lot of tuning. UDP we also see the loss, but can still push many gigabits through with one sender, or wire speed with several nodes.

For equipment which doesn?t use a tunable TCP stack, such as storage arrays or vmware, the retransmits completely ruin performance or may result in ongoing failure we can?t overcome.

Cogent support has been about as bad as you can get. Everything is great, clean your fiber, iperf isn?t a good test, install a physical loop oh wait we don?t want that so go pull it back off, new updates come at three to seven day intervals, etc. If the performance had never been good to begin with I?d have just attributed this to their circuits, but since it worked until late June, I know something has changed. I?m hoping someone else has run into this and maybe knows of some hints I could give them to investigate. To me it sounds like there?s a rate limiter / policer defined somewhere in the circuit, or an overloaded interface/device we?re forced to traverse, but they assure me this is not the case and claim to have destroyed and rebuilt the logical circuit.

Thanks!

Re: Lossy cogent p2p experiences? [ In reply to ]

eric.kuhnke at gmail

Aug 31, 2023, 1:51 PM

Post #2 of 82 (461 views)

Cogent has asked many people NOT to purchase their ethernet private circuit
point to point service unless they can guarantee that you won't move any
single flow of greater than 2 Gbps. This works fine as long as the service
is used mostly for mixed IP traffic like a bunch of randomly mixed
customers together.

What you are trying to do is probably against the guidelines their
engineering group has given them for what they can sell now.

This is a known weird limitation with Cogent's private circuit service.

The best working theory that several people I know in the neteng community
have come up with is because Cogent does not want to adversely impact all
other customers on their router in some sites, where the site's upstreams
and links to neighboring POPs are implemented as something like 4 x 10
Gbps. In places where they have not upgraded that specific router to a full
100 Gbps upstream. Moving large flows >2Gbps could result in flat topping a
traffic chart on just 1 of those 10Gbps circuits.

On Thu, Aug 31, 2023 at 10:04?AM David Hubbard <
dhubbard@dino.hostasaurus.com> wrote:

> Hi all, curious if anyone who has used Cogent as a point to point provider
> has gone through packet loss issues with them and were able to successfully
> resolve? I’ve got a non-rate-limited 10gig circuit between two geographic
> locations that have about 52ms of latency. Mine is set up to support both
> jumbo frames and vlan tagging. I do know Cogent packetizes these circuits,
> so they’re not like waves, and that the expected single session TCP
> performance may be limited to a few gbit/sec, but I should otherwise be
> able to fully utilize the circuit given enough flows.
>
>
>
> Circuit went live earlier this year, had zero issues with it. Testing
> with common tools like iperf would allow several gbit/sec of TCP traffic
> using single flows, even without an optimized TCP stack. Using parallel
> flows or UDP we could easily get close to wire speed. Starting about ten
> weeks ago we had a significant slowdown, to even complete failure, of
> bursty data replication tasks between equipment that was using this
> circuit. Rounds of testing demonstrate that new flows often experience
> significant initial packet loss of several thousand packets, and will then
> have ongoing lesser packet loss every five to ten seconds after that.
> There are times we can’t do better than 50 Mbit/sec, but it’s rare to
> achieve gigabit most of the time unless we do a bunch of streams with a lot
> of tuning. UDP we also see the loss, but can still push many gigabits
> through with one sender, or wire speed with several nodes.
>
>
>
> For equipment which doesn’t use a tunable TCP stack, such as storage
> arrays or vmware, the retransmits completely ruin performance or may result
> in ongoing failure we can’t overcome.
>
>
>
> Cogent support has been about as bad as you can get. Everything is great,
> clean your fiber, iperf isn’t a good test, install a physical loop oh wait
> we don’t want that so go pull it back off, new updates come at three to
> seven day intervals, etc. If the performance had never been good to begin
> with I’d have just attributed this to their circuits, but since it worked
> until late June, I know something has changed. I’m hoping someone else has
> run into this and maybe knows of some hints I could give them to
> investigate. To me it sounds like there’s a rate limiter / policer defined
> somewhere in the circuit, or an overloaded interface/device we’re forced to
> traverse, but they assure me this is not the case and claim to have
> destroyed and rebuilt the logical circuit.
>
>
>
> Thanks!
>

Re: Lossy cogent p2p experiences? [ In reply to ]

dhubbard at dino

Aug 31, 2023, 2:42 PM

Post #3 of 82 (461 views)

That’s not what I’m trying to do, that’s just what I’m using during testing to demonstrate the loss to them. It’s intended to bridge a number of networks with hundreds of flows, including inbound internet sources, but any new TCP flow is subject to numerous dropped packets at establishment and then ongoing loss every five to ten seconds. The initial loss and ongoing bursts of loss cause the TCP window to shrink so much that any single flow, between systems that can’t be optimized, ends up varying from 50 Mbit/sec to something far short of a gigabit. It was also fine for six months before this miserable behavior began in late June.

From: Eric Kuhnke <eric.kuhnke@gmail.com>
Date: Thursday, August 31, 2023 at 4:51 PM
To: David Hubbard <dhubbard@dino.hostasaurus.com>
Cc: Nanog@nanog.org <nanog@nanog.org>
Subject: Re: Lossy cogent p2p experiences?
Cogent has asked many people NOT to purchase their ethernet private circuit point to point service unless they can guarantee that you won't move any single flow of greater than 2 Gbps. This works fine as long as the service is used mostly for mixed IP traffic like a bunch of randomly mixed customers together.

What you are trying to do is probably against the guidelines their engineering group has given them for what they can sell now.

This is a known weird limitation with Cogent's private circuit service.

The best working theory that several people I know in the neteng community have come up with is because Cogent does not want to adversely impact all other customers on their router in some sites, where the site's upstreams and links to neighboring POPs are implemented as something like 4 x 10 Gbps. In places where they have not upgraded that specific router to a full 100 Gbps upstream. Moving large flows >2Gbps could result in flat topping a traffic chart on just 1 of those 10Gbps circuits.

On Thu, Aug 31, 2023 at 10:04?AM David Hubbard <dhubbard@dino.hostasaurus.com<mailto:dhubbard@dino.hostasaurus.com>> wrote:
Hi all, curious if anyone who has used Cogent as a point to point provider has gone through packet loss issues with them and were able to successfully resolve? I’ve got a non-rate-limited 10gig circuit between two geographic locations that have about 52ms of latency. Mine is set up to support both jumbo frames and vlan tagging. I do know Cogent packetizes these circuits, so they’re not like waves, and that the expected single session TCP performance may be limited to a few gbit/sec, but I should otherwise be able to fully utilize the circuit given enough flows.

Circuit went live earlier this year, had zero issues with it. Testing with common tools like iperf would allow several gbit/sec of TCP traffic using single flows, even without an optimized TCP stack. Using parallel flows or UDP we could easily get close to wire speed. Starting about ten weeks ago we had a significant slowdown, to even complete failure, of bursty data replication tasks between equipment that was using this circuit. Rounds of testing demonstrate that new flows often experience significant initial packet loss of several thousand packets, and will then have ongoing lesser packet loss every five to ten seconds after that. There are times we can’t do better than 50 Mbit/sec, but it’s rare to achieve gigabit most of the time unless we do a bunch of streams with a lot of tuning. UDP we also see the loss, but can still push many gigabits through with one sender, or wire speed with several nodes.

For equipment which doesn’t use a tunable TCP stack, such as storage arrays or vmware, the retransmits completely ruin performance or may result in ongoing failure we can’t overcome.

Cogent support has been about as bad as you can get. Everything is great, clean your fiber, iperf isn’t a good test, install a physical loop oh wait we don’t want that so go pull it back off, new updates come at three to seven day intervals, etc. If the performance had never been good to begin with I’d have just attributed this to their circuits, but since it worked until late June, I know something has changed. I’m hoping someone else has run into this and maybe knows of some hints I could give them to investigate. To me it sounds like there’s a rate limiter / policer defined somewhere in the circuit, or an overloaded interface/device we’re forced to traverse, but they assure me this is not the case and claim to have destroyed and rebuilt the logical circuit.

Thanks!

Re: Lossy cogent p2p experiences? [ In reply to ]

Sep 1, 2023, 1:50 AM

Post #4 of 82 (460 views)

On Thu, 31 Aug 2023 at 23:56, Eric Kuhnke <eric.kuhnke@gmail.com> wrote:

> The best working theory that several people I know in the neteng community have come up with is because Cogent does not want to adversely impact all other customers on their router in some sites, where the site's upstreams and links to neighboring POPs are implemented as something like 4 x 10 Gbps. In places where they have not upgraded that specific router to a full 100 Gbps upstream. Moving large flows >2Gbps could result in flat topping a traffic chart on just 1 of those 10Gbps circuits.

It is a very plausible theory, and everyone has this problem to a
lesser or greater degree. There was a time when edge interfaces were
much lower capacity than backbone interfaces, but I don't think that
time will ever come back. So this problem is systemic.
Luckily there is quite a reasonable solution to the problem, called
'adaptive load balancing', where software monitors balancing, and
biases the hash_result => egress_interface tables to improve balancing
when dealing with elephant flows.

--
++ytti

Re: Lossy cogent p2p experiences? [ In reply to ]

Sep 1, 2023, 4:49 AM

Post #5 of 82 (458 views)

On 9/1/23 10:50, Saku Ytti wrote:

> It is a very plausible theory, and everyone has this problem to a
> lesser or greater degree. There was a time when edge interfaces were
> much lower capacity than backbone interfaces, but I don't think that
> time will ever come back. So this problem is systemic.
> Luckily there is quite a reasonable solution to the problem, called
> 'adaptive load balancing', where software monitors balancing, and
> biases the hash_result => egress_interface tables to improve balancing
> when dealing with elephant flows.

We didn't have much success with FAT when the PE was an MX480 and the P
a CRS-X (FP40 + FP140 line cards). This was regardless of whether the
core links were native IP/MPLS or 802.1AX.

When we switched our P devices to PTX1000 and PTX10001, we've had
surprisingly good performance of all manner of traffic across native
IP/MPLS and 802.1AX links, even without explicitly configuring FAT for
EoMPLS traffic.

Of course, our policy is to never transport EoMPLS servics in excess of
40Gbps. Once a customer requires 41Gbps of EoMPLS service or more, we
move them to EoDWDM. Cheaper and more scalable that way. It does help
that we operate both a Transport and IP/MPLS network, but I understand
this may not be the case for most networks.

Mark.

Re: Lossy cogent p2p experiences? [ In reply to ]

Sep 1, 2023, 6:29 AM

Post #6 of 82 (458 views)

On Fri, 1 Sept 2023 at 14:54, Mark Tinka <mark@tinka.africa> wrote:

> When we switched our P devices to PTX1000 and PTX10001, we've had
> surprisingly good performance of all manner of traffic across native
> IP/MPLS and 802.1AX links, even without explicitly configuring FAT for
> EoMPLS traffic.

PTX and MX as LSR look inside pseudowire to see if it's IP (dangerous
guess to make for LSR), CSR/ASR9k does not. So PTX and MX LSR will
balance your pseudowire even without FAT. I've had no problem having
ASR9k LSR balancing FAT PWs.

However this is a bit of a sidebar, because the original problem is
about elephant flows, which FAT does not help with. But adaptive
balancing does.

--
++ytti

Re: Lossy cogent p2p experiences? [ In reply to ]

nanog at ics-il

Sep 1, 2023, 6:44 AM

Post #7 of 82 (458 views)

and I would say the OP wasn't even about elephant flows, just about a network that can't deliver anything acceptable.

-----
Mike Hammett
Intelligent Computing Solutions
http://www.ics-il.com

Midwest-IX
http://www.midwest-ix.com

----- Original Message -----

From: "Saku Ytti" <saku@ytti.fi>
To: "Mark Tinka" <mark@tinka.africa>
Cc: nanog@nanog.org
Sent: Friday, September 1, 2023 8:29:12 AM
Subject: Re: Lossy cogent p2p experiences?

On Fri, 1 Sept 2023 at 14:54, Mark Tinka <mark@tinka.africa> wrote:

> When we switched our P devices to PTX1000 and PTX10001, we've had
> surprisingly good performance of all manner of traffic across native
> IP/MPLS and 802.1AX links, even without explicitly configuring FAT for
> EoMPLS traffic.

PTX and MX as LSR look inside pseudowire to see if it's IP (dangerous
guess to make for LSR), CSR/ASR9k does not. So PTX and MX LSR will
balance your pseudowire even without FAT. I've had no problem having
ASR9k LSR balancing FAT PWs.

However this is a bit of a sidebar, because the original problem is
about elephant flows, which FAT does not help with. But adaptive
balancing does.

--
++ytti

Re: Lossy cogent p2p experiences? [ In reply to ]

Sep 1, 2023, 6:46 AM

Post #8 of 82 (458 views)

On 9/1/23 15:29, Saku Ytti wrote:

> PTX and MX as LSR look inside pseudowire to see if it's IP (dangerous
> guess to make for LSR), CSR/ASR9k does not. So PTX and MX LSR will
> balance your pseudowire even without FAT.

Yes, this was our conclusion as well after moving our core to PTX1000/10001.

Mark.

Re: Lossy cogent p2p experiences? [ In reply to ]

Sep 1, 2023, 6:55 AM

Post #9 of 82 (458 views)

On Fri, 1 Sept 2023 at 16:46, Mark Tinka <mark@tinka.africa> wrote:

> Yes, this was our conclusion as well after moving our core to PTX1000/10001.

Personally I would recommend turning off LSR payload heuristics,
because there is no accurate way for LSR to tell what the label is
carrying, and wrong guess while rare will be extremely hard to root
cause, because you will never hear it, because the person suffering
from it is too many hops away from problem being in your horizon.
I strongly believe edge imposing entropy or fat is the right way to
give LSR hashing hints.

--
++ytti

Re: Lossy cogent p2p experiences? [ In reply to ]

nanog at ics-il

Sep 1, 2023, 6:59 AM

Post #10 of 82 (458 views)

I wouldn't call 50 megabit/s an elephant flow

-----
Mike Hammett
Intelligent Computing Solutions
http://www.ics-il.com

Midwest-IX
http://www.midwest-ix.com

----- Original Message -----

From: "Mark Tinka" <mark@tinka.africa>
To: "Mike Hammett" <nanog@ics-il.net>, "Saku Ytti" <saku@ytti.fi>
Cc: nanog@nanog.org
Sent: Friday, September 1, 2023 8:56:03 AM
Subject: Re: Lossy cogent p2p experiences?

On 9/1/23 15:44, Mike Hammett wrote:

and I would say the OP wasn't even about elephant flows, just about a network that can't deliver anything acceptable.

Unless Cogent are not trying to accept (and by extension, may not be able to guarantee) large Ethernet flows because they can't balance them across their various core links, end-to-end...

Pure conjecture...

Mark.

Re: Lossy cogent p2p experiences? [ In reply to ]

Sep 1, 2023, 7:08 AM

Post #11 of 82 (458 views)

On 9/1/23 15:44, Mike Hammett wrote:
> and I would say the OP wasn't even about elephant flows, just about a
> network that can't deliver anything acceptable.

Unless Cogent are not trying to accept (and by extension, may not be
able to guarantee) large Ethernet flows because they can't balance them
across their various core links, end-to-end...

Pure conjecture...

Mark.

Re: Lossy cogent p2p experiences? [ In reply to ]

dhubbard at dino

Sep 1, 2023, 7:19 AM

Post #12 of 82 (458 views)

The initial and recurring packet loss occurs on any flow of more than ~140 Mbit. The fact that it?s loss-free under that rate is what furthers my opinion it?s config-based somewhere, even though they say it isn?t.

From: NANOG <nanog-bounces+dhubbard=dino.hostasaurus.com@nanog.org> on behalf of Mark Tinka <mark@tinka.africa>
Date: Friday, September 1, 2023 at 10:13 AM
To: Mike Hammett <nanog@ics-il.net>, Saku Ytti <saku@ytti.fi>
Cc: nanog@nanog.org <nanog@nanog.org>
Subject: Re: Lossy cogent p2p experiences?

On 9/1/23 15:44, Mike Hammett wrote:
and I would say the OP wasn't even about elephant flows, just about a network that can't deliver anything acceptable.

Unless Cogent are not trying to accept (and by extension, may not be able to guarantee) large Ethernet flows because they can't balance them across their various core links, end-to-end...

Pure conjecture...

Mark.

Re: Lossy cogent p2p experiences? [ In reply to ]

Sep 1, 2023, 8:37 AM

Post #13 of 82 (457 views)

On Fri, 1 Sept 2023 at 15:55, Saku Ytti <saku@ytti.fi> wrote:
>
> On Fri, 1 Sept 2023 at 16:46, Mark Tinka <mark@tinka.africa> wrote:
>
> > Yes, this was our conclusion as well after moving our core to PTX1000/10001.
>
> Personally I would recommend turning off LSR payload heuristics,
> because there is no accurate way for LSR to tell what the label is
> carrying, and wrong guess while rare will be extremely hard to root
> cause, because you will never hear it, because the person suffering
> from it is too many hops away from problem being in your horizon.
> I strongly believe edge imposing entropy or fat is the right way to
> give LSR hashing hints.

If you need to load-balance labelled IP traffic though, all your edge
devices would have to impose entropy/fat.

On the hand a workaround at the edge at least for EoMPLS would be to
enable control-word.

Lukas

Re: Lossy cogent p2p experiences? [ In reply to ]

Sep 1, 2023, 9:47 AM

Post #14 of 82 (457 views)

On Fri, 1 Sept 2023 at 18:37, Lukas Tribus <lukas@ltri.eu> wrote:

> On the hand a workaround at the edge at least for EoMPLS would be to
> enable control-word.

Juniper LSR can actually do heuristics on pseudowires with CW.

--
++ytti

RE: Lossy cogent p2p experiences? [ In reply to ]

Sep 1, 2023, 12:32 PM

Post #15 of 82 (456 views)

Yes adaptive load balancing very much helps but the weakness is it is normally only fully supported on vendor silicon not merchant silicon. Much of the transport edge is merchant silicon due to the per packet cost being far lower and the general requirement to just pass not manipulate packets. Using the Nokia kit for example the 7750 does a great job of "adaptive-load-balancing" but the 7250 is lacklustre at best.

-----Original Message-----
From: NANOG <nanog-bounces+tony=wicks.co.nz@nanog.org> On Behalf Of Saku Ytti
Sent: Friday, September 1, 2023 8:51 PM
To: Eric Kuhnke <eric.kuhnke@gmail.com>
Cc: nanog@nanog.org
Subject: Re: Lossy cogent p2p experiences?

Luckily there is quite a reasonable solution to the problem, called 'adaptive load balancing', where software monitors balancing, and biases the hash_result => egress_interface tables to improve balancing when dealing with elephant flows.

Re: Lossy cogent p2p experiences? [ In reply to ]

Sep 1, 2023, 12:47 PM

Post #16 of 82 (456 views)

On 9/1/23 15:59, Mike Hammett wrote:

> I wouldn't call 50 megabit/s an elephant flow

Fair point.

Mark.

Re: Lossy cogent p2p experiences? [ In reply to ]

nanog at ics-il

Sep 1, 2023, 12:52 PM

Post #17 of 82 (456 views)

It doesn't help the OP at all, but this is why (thus far, anyway), I overwhelmingly prefer wavelength transport to anything switched. Can't have over-subscription or congestion issues on a wavelength.

-----
Mike Hammett
Intelligent Computing Solutions
http://www.ics-il.com

Midwest-IX
http://www.midwest-ix.com

----- Original Message -----

From: "David Hubbard" <dhubbard@dino.hostasaurus.com>
To: "Nanog@nanog.org" <nanog@nanog.org>
Sent: Thursday, August 31, 2023 10:55:19 AM
Subject: Lossy cogent p2p experiences?

Hi all, curious if anyone who has used Cogent as a point to point provider has gone through packet loss issues with them and were able to successfully resolve? I’ve got a non-rate-limited 10gig circuit between two geographic locations that have about 52ms of latency. Mine is set up to support both jumbo frames and vlan tagging. I do know Cogent packetizes these circuits, so they’re not like waves, and that the expected single session TCP performance may be limited to a few gbit/sec, but I should otherwise be able to fully utilize the circuit given enough flows.

Circuit went live earlier this year, had zero issues with it. Testing with common tools like iperf would allow several gbit/sec of TCP traffic using single flows, even without an optimized TCP stack. Using parallel flows or UDP we could easily get close to wire speed. Starting about ten weeks ago we had a significant slowdown, to even complete failure, of bursty data replication tasks between equipment that was using this circuit. Rounds of testing demonstrate that new flows often experience significant initial packet loss of several thousand packets, and will then have ongoing lesser packet loss every five to ten seconds after that. There are times we can’t do better than 50 Mbit/sec, but it’s rare to achieve gigabit most of the time unless we do a bunch of streams with a lot of tuning. UDP we also see the loss, but can still push many gigabits through with one sender, or wire speed with several nodes.

For equipment which doesn’t use a tunable TCP stack, such as storage arrays or vmware, the retransmits completely ruin performance or may result in ongoing failure we can’t overcome.

Cogent support has been about as bad as you can get. Everything is great, clean your fiber, iperf isn’t a good test, install a physical loop oh wait we don’t want that so go pull it back off, new updates come at three to seven day intervals, etc. If the performance had never been good to begin with I’d have just attributed this to their circuits, but since it worked until late June, I know something has changed. I’m hoping someone else has run into this and maybe knows of some hints I could give them to investigate. To me it sounds like there’s a rate limiter / policer defined somewhere in the circuit, or an overloaded interface/device we’re forced to traverse, but they assure me this is not the case and claim to have destroyed and rebuilt the logical circuit.

Thanks!

Re: Lossy cogent p2p experiences? [ In reply to ]

Sep 1, 2023, 12:56 PM

Post #18 of 82 (456 views)

On 9/1/23 15:55, Saku Ytti wrote:

> Personally I would recommend turning off LSR payload heuristics,
> because there is no accurate way for LSR to tell what the label is
> carrying, and wrong guess while rare will be extremely hard to root
> cause, because you will never hear it, because the person suffering
> from it is too many hops away from problem being in your horizon.
> I strongly believe edge imposing entropy or fat is the right way to
> give LSR hashing hints.

PTX1000/10001 (Express) offers no real configurable options for load
balancing the same way MX (Trio) does. This is what took us by surprise.

This is all we have on our PTX:

tinka@router# show forwarding-options
family inet6 {
route-accounting;
}
load-balance-label-capability;

[edit]
tinka@router#

Mark.

Re: Lossy cogent p2p experiences? [ In reply to ]

Sep 1, 2023, 1:06 PM

Post #19 of 82 (456 views)

On 9/1/23 21:52, Mike Hammett wrote:

> It doesn't help the OP at all, but this is why (thus far, anyway), I
> overwhelmingly prefer wavelength transport to anything switched. Can't
> have over-subscription or congestion issues on a wavelength.

Large IP/MPLS operators insist on optical transport for their own
backbone, but are more than willing to sell packet for transport. I find
this amusing :-).

I submit that customers who can't afford large links (1Gbps or below)
are forced into EoMPLS transport due to cost.

Other customers are also forced into EoMPLS transport because there is
no other option for long haul transport in their city other than a
provider who can only offer EoMPLS.

There is a struggling trend from some medium sized operators looking to
turn an optical network into a packet network, i.e., they will ask for a
100Gbps EoDWDM port, but only seek to pay for a 25Gbps service. The
large port is to allow them to scale in the future without too much
hassle, but they want to pay for the bandwidth they use, which is hard
to limit anyway if it's a proper EoDWDM channel. I am swatting such
requests away because you tie up a full 100Gbps channel on the line side
for the majority of hardware that does pure EoDWDM, which is a
contradiction to the reason a packet network makes sense for sub-rate
services.

Mark.

Re: Lossy cogent p2p experiences? [ In reply to ]

Sep 1, 2023, 11:43 PM

Post #20 of 82 (455 views)

On Fri, 1 Sept 2023 at 22:56, Mark Tinka <mark@tinka.africa> wrote:

> PTX1000/10001 (Express) offers no real configurable options for load
> balancing the same way MX (Trio) does. This is what took us by surprise.

What in particular are you missing?

As I explained, PTX/MX both allow for example speculating on transit
pseudowires having CW on them. Which is non-default and requires
'zero-control-word'. You should be looking at 'hash-key' on PTX and
'enhanced-hash-key' on MX. You don't appear to have a single stanza
configured, but I do wonder what you wanted to configure when you
noticed the missing ability to do so.

--
++ytti

Re: Lossy cogent p2p experiences? [ In reply to ]

Sep 2, 2023, 12:23 AM

Post #21 of 82 (455 views)

On 9/2/23 08:43, Saku Ytti wrote:

> What in particular are you missing?
> As I explained, PTX/MX both allow for example speculating on transit
> pseudowires having CW on them. Which is non-default and requires
> 'zero-control-word'. You should be looking at 'hash-key' on PTX and
> 'enhanced-hash-key' on MX. You don't appear to have a single stanza
> configured, but I do wonder what you wanted to configure when you
> noticed the missing ability to do so.

Sorry for the confusion - let me provide some background context since
we deployed the PTX ages ago (and core nodes are typically boring).

The issue we ran into was to do with our deployment tooling, which was
based on 'enhanced-hash-key' that is required for MPC's on the MX.

The tooling used to deploy the PTX was largely built on what we use to
deploy the MX, with tweaks of critically different items. At the time,
we did not know that the PTX required 'hash-key' as opposed to
'enhanced-hash-key'. So nothing got deployed on the PTX specifically for
load balancing (we might have assumed it to have been non-existent or
incomplete feature at the time).

So the "surprise" I speak of is how well it all worked with load
balancing across LAG's and EoMPLS traffic compared to the CRS-X, despite
not having any load balancing features explicitly configured, which is
still the case today.

It works, so we aren't keen to break it.

Mark.

Re: Lossy cogent p2p experiences? [ In reply to ]

mohta at necom830

Sep 2, 2023, 8:04 AM

Post #22 of 82 (455 views)

Mark Tinka wrote:

> On 9/1/23 15:59, Mike Hammett wrote:
>
>> I wouldn't call 50 megabit/s an elephant flow
>
> Fair point.

Both of you are totally wrong, because the proper thing to do
here is to police, if *ANY*, based on total traffic without
detecting any flow.

100 50Mbps flows are as harmful as 1 5Gbps flow.

Moreover, as David Hubbard wrote:

> I’ve got a non-rate-limited 10gig circuit

there is no point of policing.

Detection of elephant flows were wrongly considered useful
with flow driven architecture to automatically bypass L3
processing for the flows, when L3 processing capability
were wrongly considered limited.

Then, topology driven architecture of MPLS appeared, even
though topology driven is flow driven (you can't put inner
labels of MPLS without knowing detailed routing information
at the destinations, which is hidden at the source through
route aggregation, on demand after detecting flows.)

Masataka Ohta

Re: Lossy cogent p2p experiences? [ In reply to ]

Sep 2, 2023, 8:23 AM

Post #23 of 82 (455 views)

On 9/2/23 17:04, Masataka Ohta wrote:

> Both of you are totally wrong, because the proper thing to do
> here is to police, if *ANY*, based on total traffic without
> detecting any flow.

I don't think it's as much an issue of flow detection as it is the
core's ability to balance the Layer 2 payload across multiple links
effectively.

At our shop, we understand the limitations of trying to carry large
EoMPLS flows across an IP/MPLS network that is, primarily, built to
carry IP traffic.

While some vendors have implemented adaptive load balancing algorithms
on decent (if not custom) silicon that can balance EoMPLS flows as well
as they can IP flows, it is hit & miss depending on the code, hardware,
vendor, e.t.c.

In our case, our ability to load balance EoMPLS flows as well as we do
IP flows has improved since we moved to the PTX1000/10001 for our core
routers. But even then, we will not sell anything above 40Gbps as an
EoMPLS service. Once it gets there, time for EoDWDM. At least, until
800Gbps or 1Tbps Ethernet ports become both technically viable and
commercially feasible.

For as long as core links are based on 100Gbps and 400Gbps ports,
optical carriage for 40Gbps and above is more sensible than EoMPLS.

Mark.

Re: Lossy cogent p2p experiences? [ In reply to ]

mohta at necom830

Sep 2, 2023, 8:38 AM

Post #24 of 82 (455 views)

Mark Tinka wrote:

> it is the
> core's ability to balance the Layer 2 payload across multiple links
> effectively.

Wrong. It can be performed only at the edges by policing total
incoming traffic without detecting flows.

> While some vendors have implemented adaptive load balancing algorithms

There is no such algorithms because, as I wrote:

: 100 50Mbps flows are as harmful as 1 5Gbps flow.

Masataka Ohta

Re: Lossy cogent p2p experiences? [ In reply to ]

Sep 2, 2023, 8:46 AM

Post #25 of 82 (455 views)

On 9/2/23 17:38, Masataka Ohta wrote:

> Wrong. It can be performed only at the edges by policing total
> incoming traffic without detecting flows.

I am not talking about policing in the core, I am talking about
detection in the core.

Policing at the edge is pretty standard. You can police a 50Gbps EoMPLS
flow coming in from a customer port in the edge. If you've got N x
10Gbps links in the core and the core is unable to detect that flow in
depth to hash it across all those 10Gbps links, you can end up putting
all or a good chunk of that 50Gbps of EoMPLS traffic into a single
10Gbps link in the core, despite all other 10Gbps links having ample
capacity available.

> There is no such algorithms because, as I wrote:
>
> : 100 50Mbps flows are as harmful as 1 5Gbps flow.

Do you operate a large scale IP/MPLS network? Because I do, and I know
what I see with the equipment we deploy.

You are welcome to deny it all you want, however. Not much I can do
about that.

Mark.