Mailing List Archive

Level3 Chicago
I'm seeing some minor packet loss traversing Level3 in Chicago.

We connect to their network in Detroit, and we're clean until the
first hop into Chicago.

Anyone else seeing anything like that?



---
Clayton Zekelman
Managed Network Systems Inc. (MNSi)
344-300 Tecumseh Rd. E.
Windsor, Ontario
N8X 5E8

tel. 519-985-8410
fax. 519-985-8409
Level3 Chicago [ In reply to ]
On Tue, 2009-08-18 at 13:05 -0400, Clayton Zekelman wrote:
> I'm seeing some minor packet loss traversing Level3 in Chicago.
>
> We connect to their network in Detroit, and we're clean until the
> first hop into Chicago.
>
> Anyone else seeing anything like that?
Out of downtown Chicago I see the following:

sudo traceroute-nanog www.mnsi.net
traceroute to www.mnsi.net (216.8.137.229), 64 hops max, 40 byte packets
1 10.10.10.1 (10.10.10.1) 1 ms 7 ms 4 ms
2 10.20.0.1 (10.20.0.1) 13 ms 9 ms 7 ms
3 vl2.aggr1.chgo.il.rcn.net (207.229.191.130) 8 ms 11 ms 7 ms
4 tge3-1.border2.eqnx.il.rcn.net (207.172.19.159) 12 ms 17 ms 15 ms
5 te-8-3.car3.Chicago1.Level3.net (4.71.101.73) 9 ms 7 ms 13 ms
6 ae-32-52.ebr2.Chicago1.Level3.net (4.68.101.62) 8 ms 13 ms 18 ms
7 ae-8-8.car1.Detroit1.Level3.net (4.69.133.241) 14 ms 14 ms 16 ms
8 ae-11-11.car2.Detroit1.Level3.net (4.69.133.246) 16 ms 14 ms 18
ms
9 MANAGED-NET.car2.Detroit1.Level3.net (64.152.144.78) 16 ms 15 ms
17 ms
L3 looks clean from me to you,or as close as I can get, might the
problem be in Detroit? Anyone?
Richard Golodner
Level3 Chicago [ In reply to ]
Further to the issue below - I'm typically seeing packet loss at hop
5. This particular trace doesn't show it though, although there are
timeouts on hop 6.

1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec
2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0 msec
3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec
4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec
5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec
6 * *
ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec
7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec
ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec
ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec
8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec
ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec
ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec
9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec
28 msec 40 msec
10 * * *
11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec
12 206.24.199.2 48 msec 32 msec 32 msec
13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec
14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40 msec
15 209.202.116.58 40 msec 48 msec 36 msec
16 www.govital.net (209.202.88.24) 36 msec 40 msec 36
msec


At 01:05 PM 8/18/2009, Clayton Zekelman wrote:

>I'm seeing some minor packet loss traversing Level3 in Chicago.
>
>We connect to their network in Detroit, and we're clean until the
>first hop into Chicago.
>
>Anyone else seeing anything like that?
>
>
>
>---
>Clayton Zekelman
>Managed Network Systems Inc. (MNSi)
>344-300 Tecumseh Rd. E.
>Windsor, Ontario
>N8X 5E8
>
>tel. 519-985-8410
>fax. 519-985-8409
>
>_______________________________________________
>outages mailing list
>outages at outages.org
>https://puck.nether.net/mailman/listinfo/outages
>
>
>No virus found in this incoming message.
>Checked by AVG - www.avg.com
>Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date:
>08/18/09 06:03:00

---
Clayton Zekelman
Managed Network Systems Inc. (MNSi)
344-300 Tecumseh Rd. E.
Windsor, Ontario
N8X 5E8

tel. 519-985-8410
fax. 519-985-8409
Level3 Chicago [ In reply to ]
It would be helpful if you could use something like mtr instead of
traceroute in this case. The below trace could be indicating ICMP
de-prioritisation on L3 routers, which is known to be enabled, but could
also be an indicator of packet loss starting at hop #5 and "trickling
down" through succeeding hops (possibly #10 or #11).

mtr --order=LSRNABW can help in diagnosing this.

--
| Jeremy Chadwick jdc at parodius.com |
| Parodius Networking http://www.parodius.com/ |
| UNIX Systems Administrator Mountain View, CA, USA |
| Making life hard for others since 1977. PGP: 4BD6C0CB |

On Tue, Aug 18, 2009 at 01:36:58PM -0400, Clayton Zekelman wrote:
>
> Further to the issue below - I'm typically seeing packet loss at hop
> 5. This particular trace doesn't show it though, although there are
> timeouts on hop 6.
>
> 1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec
> 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0 msec
> 3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec
> 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec
> 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec
> 6 * *
> ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec
> 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec
> ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec
> ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec
> 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec
> ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec
> ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec
> 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec
> 28 msec 40 msec
> 10 * * *
> 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec
> 12 206.24.199.2 48 msec 32 msec 32 msec
> 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec
> 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40 msec
> 15 209.202.116.58 40 msec 48 msec 36 msec
> 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec
>
>
> At 01:05 PM 8/18/2009, Clayton Zekelman wrote:
>
> >I'm seeing some minor packet loss traversing Level3 in Chicago.
> >
> >We connect to their network in Detroit, and we're clean until the
> >first hop into Chicago.
> >
> >Anyone else seeing anything like that?
> >
> >
> >
> >---
> >Clayton Zekelman
> >Managed Network Systems Inc. (MNSi)
> >344-300 Tecumseh Rd. E.
> >Windsor, Ontario
> >N8X 5E8
> >
> >tel. 519-985-8410
> >fax. 519-985-8409
> >
> >_______________________________________________
> >outages mailing list
> >outages at outages.org
> >https://puck.nether.net/mailman/listinfo/outages
> >
> >
> >No virus found in this incoming message.
> >Checked by AVG - www.avg.com
> >Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date:
> >08/18/09 06:03:00
>
> ---
> Clayton Zekelman
> Managed Network Systems Inc. (MNSi)
> 344-300 Tecumseh Rd. E.
> Windsor, Ontario
> N8X 5E8
>
> tel. 519-985-8410
> fax. 519-985-8409
>
> _______________________________________________
> outages mailing list
> outages at outages.org
> https://puck.nether.net/mailman/listinfo/outages
Level3 Chicago [ In reply to ]
I'm here in Kansas City, KS region and a trace to that IP in Detroit for
Level3 shows this with the flags you have added for mtr. Level3 is just
having a bad week me thinks.

fed11.home.lan (0.0.0.0) Tue Aug 18
13:28:42 2009
Keys: Help Display mode Restart statistics Order of fields quit
Packets Pings
Host Loss% Snt Rcv Last Avg
Best Wrst
1. router.home.lan 0.0% 274 274 0.2 0.2
0.1 0.3
2. ks-76-7-1-1.sta.embarqhsd.net 0.0% 274 274 7.7 8.3
6.2 114.8
3. ks-76-7-255-241.sta.embarqhsd.net 0.0% 274 274 8.4 8.6
6.1 114.4
4. ge-7-19.car1.StLouis1.Level3.net 41.5% 273 159 34.1 26.6
12.6 213.9
5. ae-11-11.car2.StLouis1.Level3.net 47.3% 273 144 12.6 25.9
12.6 205.6
6. ae-4-4.ebr2.Chicago1.Level3.net 1.1% 273 270 19.2 26.4
18.5 140.8
7. ae-8-8.car1.Detroit1.Level3.net 0.0% 273 273 27.8 38.8
24.1 229.2
8. ge-6-12-222.car2.Detroit1.Level3. 0.0% 273 273 26.5 39.1
24.1 239.0

--Brett


Jeremy Chadwick wrote:
> It would be helpful if you could use something like mtr instead of
> traceroute in this case. The below trace could be indicating ICMP
> de-prioritisation on L3 routers, which is known to be enabled, but could
> also be an indicator of packet loss starting at hop #5 and "trickling
> down" through succeeding hops (possibly #10 or #11).
>
> mtr --order=LSRNABW can help in diagnosing this.
>
Level3 Chicago [ In reply to ]
The packet loss shown at hops #4, #5, and possibly #6 is either ICMP
deprioritisation or high CPU utilisation on said routers. My vote is
the lesser, given that it's common these days, and most backbone
providers are known to use it (L3, Abovenet, Sprint, Verizon/MCI, and
AT&T just to name a few).

The 18ms jump between hop #3 and #4 is likely normal due to geographic
distance, though I don't know where hop #3 is located; #4 is obviously
Missouri. The same goes for the 12ms increase between #6 and #7

If this evidence was presented to Level 3, I can assure you they'd tell
you the same thing. Here's a similar example showing the exact
behaviour I describe but within the Comcast network. Note hop #6:

HOST: icarus.home.lan Loss% Snt Rcv Last Avg Best Wrst
1. -------------- 0.0% 45 45 0.3 0.4 0.3 2.4
2. ??? 100.0 45 0 0.0 0.0 0.0 0.0
3. 68.85.191.253 0.0% 45 45 8.4 8.1 6.7 11.8
4. 68.85.154.149 0.0% 45 45 16.3 13.1 8.9 16.3
5. 68.86.90.137 0.0% 45 45 14.1 13.1 10.9 17.1
6. 68.86.85.181 97.8% 45 1 22.4 22.4 22.4 22.4
7. 4.71.118.9 0.0% 45 45 13.6 14.9 11.4 49.1
8. 4.68.18.195 0.0% 45 45 14.0 26.1 11.5 172.7
9. 4.79.219.106 0.0% 45 45 14.0 14.5 12.4 17.3
10. 209.128.95.111 0.0% 45 45 13.5 14.3 12.0 27.8
11. 72.20.109.194 0.0% 45 45 14.0 15.2 12.0 40.0
12. 72.20.106.125 0.0% 45 45 12.7 14.2 12.2 19.4

Here's another. Note hops #7, #8, and #9 (all Cogent), and also note in
this example the increased latency at hop #7 and #8 which doesn't
continue downwards through later hops:

HOST: icarus.home.lan Loss% Snt Rcv Last Avg Best Wrst
1. -------------- 0.0% 45 45 0.4 0.4 0.3 0.4
2. ??? 100.0 45 0 0.0 0.0 0.0 0.0
3. 68.85.191.253 0.0% 45 45 6.9 8.1 6.5 12.6
4. 68.85.154.149 0.0% 45 45 9.0 10.0 8.6 12.6
5. 68.86.91.225 0.0% 45 45 11.1 11.9 10.7 13.7
6. 68.86.85.78 0.0% 45 45 12.0 14.3 11.6 42.5
7. 154.54.11.105 68.9% 45 14 191.4 26.3 11.9 191.4
8. 154.54.28.81 51.1% 45 22 206.6 37.7 12.9 206.6
9. 66.28.4.150 46.7% 45 24 14.9 16.8 14.5 29.7
10. 38.112.39.114 0.0% 45 45 16.9 17.4 15.4 20.7
11. 38.104.134.30 0.0% 45 45 14.7 18.5 13.9 26.6
12. ??? 100.0 45 0 0.0 0.0 0.0 0.0

The easiest way to determine if there's a problem is whether or not the
loss witnessed at a hop continues down through succeeding hops. Here's
a real life example (IPs removed for security reasons):

HOST: ---------------------- Loss% Snt Last Avg Best Wrst StDev
1. --------------- 0.0% 60 6.8 6.0 0.4 85.9 16.1
2. --------------- 0.0% 60 1.0 48.1 0.6 320.0 76.4
3. 204.70.193.101 86.7% 60 29.3 122.4 23.5 290.8 106.1
4. 206.24.227.105 86.7% 60 50.7 9.4 1.3 50.7 17.5
5. 204.70.192.53 88.3% 60 15.0 11.3 1.6 15.6 6.6
6. 204.70.194.178 88.3% 60 15.3 25.4 15.1 34.7 9.8
7. 204.70.192.70 86.7% 60 34.7 40.9 34.6 84.2 17.5
8. 204.70.194.18 88.3% 60 35.0 34.9 34.7 35.1 0.1
9. 204.70.194.10 88.3% 60 34.9 35.0 34.7 35.6 0.3
10. 208.175.175.18 88.3% 60 35.7 35.6 35.2 36.2 0.4
11. 216.52.191.11 95.0% 60 35.8 35.7 35.6 35.8 0.1
12. --------------- 100.0 60 0.0 0.0 0.0 0.0 0.0

What you see here was an issue with SAVVIS. The root cause was a router
of theirs which rebooted unexpectedly.

The destination (hop #12) drops ICMP, so 100% loss there was completely
normal -- the rest wasn't. This was determined by comparing the loss to
historic data when the issue with SAVVIS wasn't occurring.

And one more example -- which just occurred about 10 minutes ago on my
home Comcast connection:

HOST: ---------------------- Loss% Snt Rcv Last Avg Best Wrst
1. --------------- 28.9% 45 32 0.5 0.6 0.3 5.6
2. ??? 100.0 45 0 0.0 0.0 0.0 0.0
3. 68.85.191.253 28.9% 45 32 9.8 12.4 6.8 134.1
4. 68.85.154.149 28.9% 45 32 10.6 10.7 8.7 16.1
5. 68.86.90.141 28.9% 45 32 12.8 13.3 10.7 36.0
6. 68.86.85.181 28.9% 45 32 13.1 14.4 11.7 29.4
7. 68.86.86.50 28.9% 45 32 16.6 17.1 15.3 21.8
8. 75.149.228.254 28.9% 45 32 27.2 20.6 14.7 40.3
9. 209.58.116.50 28.9% 45 32 28.2 17.3 14.5 34.4
10. 216.6.33.6 28.9% 45 32 17.8 17.2 15.1 23.1
11. 144.223.243.21 28.9% 45 32 16.5 17.5 15.9 21.3
12. 144.232.8.111 28.9% 45 32 20.0 20.8 19.1 25.5
13. 144.232.24.35 28.9% 45 32 23.2 24.3 22.8 28.9
14. 144.232.20.60 31.1% 45 31 76.5 78.9 75.5 110.0
15. 144.232.20.3 28.9% 45 32 76.3 76.7 74.9 81.4
16. 144.223.34.242 28.9% 45 32 76.6 76.7 74.9 81.7
17. 64.127.129.10 28.9% 45 32 107.7 89.0 85.8 107.7
18. 96.34.2.9 28.9% 45 32 87.8 88.3 86.0 100.6

What happened here? Based on my router logs, it appears DHCP expired
and numerous daemons on the router were automatically restarted (this is
normal in such a circumstance). It's possible that Comcast's DHCP
servers took too long to respond to a renewal and the router chose to
down/up the WAN interface, or possibly restart relevant daemons. Hard
to say -- the logging isn't as verbose as I'd like.

My cable modem log doesn't indicate LOS nor being rebooted, so this was
purely an IP-related issue, or issue with my own equipment.

--
| Jeremy Chadwick jdc at parodius.com |
| Parodius Networking http://www.parodius.com/ |
| UNIX Systems Administrator Mountain View, CA, USA |
| Making life hard for others since 1977. PGP: 4BD6C0CB |

On Tue, Aug 18, 2009 at 01:31:28PM -0500, Brett Cooper wrote:
> I'm here in Kansas City, KS region and a trace to that IP in Detroit
> for Level3 shows this with the flags you have added for mtr. Level3
> is just having a bad week me thinks.
>
> fed11.home.lan (0.0.0.0) Tue Aug 18
> 13:28:42 2009
> Keys: Help Display mode Restart statistics Order of fields quit
> Packets Pings
> Host Loss% Snt Rcv Last Avg Best Wrst
> 1. router.home.lan 0.0% 274 274 0.2 0.2 0.1 0.3
> 2. ks-76-7-1-1.sta.embarqhsd.net 0.0% 274 274 7.7 8.3 6.2 114.8
> 3. ks-76-7-255-241.sta.embarqhsd.net 0.0% 274 274 8.4 8.6 6.1 114.4
> 4. ge-7-19.car1.StLouis1.Level3.net 41.5% 273 159 34.1 26.6 12.6 213.9
> 5. ae-11-11.car2.StLouis1.Level3.net 47.3% 273 144 12.6 25.9 12.6 205.6
> 6. ae-4-4.ebr2.Chicago1.Level3.net 1.1% 273 270 19.2 26.4 18.5 140.8
> 7. ae-8-8.car1.Detroit1.Level3.net 0.0% 273 273 27.8 38.8 24.1 229.2
> 8. ge-6-12-222.car2.Detroit1.Level3. 0.0% 273 273 26.5 39.1 24.1 239.0
>
> --Brett
>
>
> Jeremy Chadwick wrote:
> >It would be helpful if you could use something like mtr instead of
> >traceroute in this case. The below trace could be indicating ICMP
> >de-prioritisation on L3 routers, which is known to be enabled, but could
> >also be an indicator of packet loss starting at hop #5 and "trickling
> >down" through succeeding hops (possibly #10 or #11).
> >
> >mtr --order=LSRNABW can help in diagnosing this.
> >
Level3 Chicago [ In reply to ]
---------------------------------------------
The packet loss shown at hops #4, #5, and possibly #6 is either ICMP
deprioritisation or high CPU utilisation on said routers. My vote is
---------------------------------------------


You could know this if you used tcptraceroute instead: michael.toren.net/code/tcptraceroute

There is something similar (but I can't remember the tool name) for micro$loth.

scott
Level3 Chicago [ In reply to ]
----- Original Message -----
From: "Scott Weeks"
Sent: Tuesday, August 18, 2009 10:55 AM
Subject: Re: [outages] Level3 Chicago


>
>
> ---------------------------------------------
> The packet loss shown at hops #4, #5, and possibly #6 is either ICMP
> deprioritisation or high CPU utilisation on said routers. My vote is
> ---------------------------------------------
>
>
> You could know this if you used tcptraceroute instead: michael.toren.net/code/tcptraceroute
>
> There is something similar (but I can't remember the tool name) for micro$loth.
>
> scott


Windows version is tracetcp.exe

http://tracetcp.sourceforge.net/

--Michael
Level3 Chicago [ In reply to ]
There is also lft or layer 4 traceroute

http://pwhois.org/lft/index.who

Richard


On Aug 18, 2009, at 2:55 PM, Michael Painter wrote:

> ----- Original Message ----- From: "Scott Weeks" Sent: Tuesday,
> August 18, 2009 10:55 AM
> Subject: Re: [outages] Level3 Chicago
>
>
>> ---------------------------------------------
>> The packet loss shown at hops #4, #5, and possibly #6 is either ICMP
>> deprioritisation or high CPU utilisation on said routers. My vote is
>> ---------------------------------------------
>> You could know this if you used tcptraceroute instead:
>> michael.toren.net/code/tcptraceroute
>> There is something similar (but I can't remember the tool name) for
>> micro$loth.
>> scott
>
>
> Windows version is tracetcp.exe
>
> http://tracetcp.sourceforge.net/
>
> --Michael
> _______________________________________________
> outages mailing list
> outages at outages.org
> https://puck.nether.net/mailman/listinfo/outages
Level3 Chicago [ In reply to ]
Is there version of mtr that include support for TCP or UDP types? A google
search didn't reveal anything to me, but would be the best of mtr with the
benefits of tcptraceroute.

Frank

-----Original Message-----
From: outages-bounces@outages.org [mailto:outages-bounces@outages.org] On
Behalf Of Jeremy Chadwick
Sent: Tuesday, August 18, 2009 12:50 PM
To: outages at outages.org
Subject: Re: [outages] Level3 Chicago

It would be helpful if you could use something like mtr instead of
traceroute in this case. The below trace could be indicating ICMP
de-prioritisation on L3 routers, which is known to be enabled, but could
also be an indicator of packet loss starting at hop #5 and "trickling
down" through succeeding hops (possibly #10 or #11).

mtr --order=LSRNABW can help in diagnosing this.

--
| Jeremy Chadwick jdc at parodius.com |
| Parodius Networking http://www.parodius.com/ |
| UNIX Systems Administrator Mountain View, CA, USA |
| Making life hard for others since 1977. PGP: 4BD6C0CB |

On Tue, Aug 18, 2009 at 01:36:58PM -0400, Clayton Zekelman wrote:
>
> Further to the issue below - I'm typically seeing packet loss at hop
> 5. This particular trace doesn't show it though, although there are
> timeouts on hop 6.
>
> 1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec
> 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0
msec
> 3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec
> 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec
> 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec
> 6 * *
> ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec
> 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec
> ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec
> ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec
> 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec
> ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec
> ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec
> 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec
> 28 msec 40 msec
> 10 * * *
> 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec
> 12 206.24.199.2 48 msec 32 msec 32 msec
> 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec
> 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40
msec
> 15 209.202.116.58 40 msec 48 msec 36 msec
> 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec
>
>
> At 01:05 PM 8/18/2009, Clayton Zekelman wrote:
>
> >I'm seeing some minor packet loss traversing Level3 in Chicago.
> >
> >We connect to their network in Detroit, and we're clean until the
> >first hop into Chicago.
> >
> >Anyone else seeing anything like that?
> >
> >
> >
> >---
> >Clayton Zekelman
> >Managed Network Systems Inc. (MNSi)
> >344-300 Tecumseh Rd. E.
> >Windsor, Ontario
> >N8X 5E8
> >
> >tel. 519-985-8410
> >fax. 519-985-8409
> >
> >_______________________________________________
> >outages mailing list
> >outages at outages.org
> >https://puck.nether.net/mailman/listinfo/outages
> >
> >
> >No virus found in this incoming message.
> >Checked by AVG - www.avg.com
> >Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date:
> >08/18/09 06:03:00
>
> ---
> Clayton Zekelman
> Managed Network Systems Inc. (MNSi)
> 344-300 Tecumseh Rd. E.
> Windsor, Ontario
> N8X 5E8
>
> tel. 519-985-8410
> fax. 519-985-8409
>
> _______________________________________________
> outages mailing list
> outages at outages.org
> https://puck.nether.net/mailman/listinfo/outages
_______________________________________________
outages mailing list
outages at outages.org
https://puck.nether.net/mailman/listinfo/outages
Level3 Chicago [ In reply to ]
The closest thing I know of is PingPlotter for Windows.

I'll remind folks that TCP and UDP-based traceroute/mtr-like utilities
still rely on ICMP time exceeded (type 11) being sent out to obtain the
present path, so if folks are looking for a pure TCP or UDP-based
traceroute or equivalent, to avoid dropping of ICMP, they're barking up
the wrong tree. :-)

--
| Jeremy Chadwick jdc at parodius.com |
| Parodius Networking http://www.parodius.com/ |
| UNIX Systems Administrator Mountain View, CA, USA |
| Making life hard for others since 1977. PGP: 4BD6C0CB |

On Tue, Aug 18, 2009 at 10:42:14PM -0500, Frank Bulk wrote:
> Is there version of mtr that include support for TCP or UDP types? A google
> search didn't reveal anything to me, but would be the best of mtr with the
> benefits of tcptraceroute.
>
> Frank
>
> -----Original Message-----
> From: outages-bounces at outages.org [mailto:outages-bounces at outages.org] On
> Behalf Of Jeremy Chadwick
> Sent: Tuesday, August 18, 2009 12:50 PM
> To: outages at outages.org
> Subject: Re: [outages] Level3 Chicago
>
> It would be helpful if you could use something like mtr instead of
> traceroute in this case. The below trace could be indicating ICMP
> de-prioritisation on L3 routers, which is known to be enabled, but could
> also be an indicator of packet loss starting at hop #5 and "trickling
> down" through succeeding hops (possibly #10 or #11).
>
> mtr --order=LSRNABW can help in diagnosing this.
>
> --
> | Jeremy Chadwick jdc at parodius.com |
> | Parodius Networking http://www.parodius.com/ |
> | UNIX Systems Administrator Mountain View, CA, USA |
> | Making life hard for others since 1977. PGP: 4BD6C0CB |
>
> On Tue, Aug 18, 2009 at 01:36:58PM -0400, Clayton Zekelman wrote:
> >
> > Further to the issue below - I'm typically seeing packet loss at hop
> > 5. This particular trace doesn't show it though, although there are
> > timeouts on hop 6.
> >
> > 1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec
> > 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0
> msec
> > 3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec
> > 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec
> > 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec
> > 6 * *
> > ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec
> > 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec
> > ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec
> > ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec
> > 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec
> > ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec
> > ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec
> > 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec
> > 28 msec 40 msec
> > 10 * * *
> > 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec
> > 12 206.24.199.2 48 msec 32 msec 32 msec
> > 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec
> > 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40
> msec
> > 15 209.202.116.58 40 msec 48 msec 36 msec
> > 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec
> >
> >
> > At 01:05 PM 8/18/2009, Clayton Zekelman wrote:
> >
> > >I'm seeing some minor packet loss traversing Level3 in Chicago.
> > >
> > >We connect to their network in Detroit, and we're clean until the
> > >first hop into Chicago.
> > >
> > >Anyone else seeing anything like that?
> > >
> > >
> > >
> > >---
> > >Clayton Zekelman
> > >Managed Network Systems Inc. (MNSi)
> > >344-300 Tecumseh Rd. E.
> > >Windsor, Ontario
> > >N8X 5E8
> > >
> > >tel. 519-985-8410
> > >fax. 519-985-8409
> > >
> > >_______________________________________________
> > >outages mailing list
> > >outages at outages.org
> > >https://puck.nether.net/mailman/listinfo/outages
> > >
> > >
> > >No virus found in this incoming message.
> > >Checked by AVG - www.avg.com
> > >Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date:
> > >08/18/09 06:03:00
> >
> > ---
> > Clayton Zekelman
> > Managed Network Systems Inc. (MNSi)
> > 344-300 Tecumseh Rd. E.
> > Windsor, Ontario
> > N8X 5E8
> >
> > tel. 519-985-8410
> > fax. 519-985-8409
> >
> > _______________________________________________
> > outages mailing list
> > outages at outages.org
> > https://puck.nether.net/mailman/listinfo/outages
> _______________________________________________
> outages mailing list
> outages at outages.org
> https://puck.nether.net/mailman/listinfo/outages
Level3 Chicago [ In reply to ]
Good point. Is the ICMP rate-limiting of these largish routers ingress and
egress, or only ingress-only?

Frank

-----Original Message-----
From: outages-bounces@outages.org [mailto:outages-bounces@outages.org] On
Behalf Of Jeremy Chadwick
Sent: Wednesday, August 19, 2009 12:21 AM
To: outages at outages.org
Subject: Re: [outages] Level3 Chicago

The closest thing I know of is PingPlotter for Windows.

I'll remind folks that TCP and UDP-based traceroute/mtr-like utilities
still rely on ICMP time exceeded (type 11) being sent out to obtain the
present path, so if folks are looking for a pure TCP or UDP-based
traceroute or equivalent, to avoid dropping of ICMP, they're barking up
the wrong tree. :-)

--
| Jeremy Chadwick jdc at parodius.com |
| Parodius Networking http://www.parodius.com/ |
| UNIX Systems Administrator Mountain View, CA, USA |
| Making life hard for others since 1977. PGP: 4BD6C0CB |

On Tue, Aug 18, 2009 at 10:42:14PM -0500, Frank Bulk wrote:
> Is there version of mtr that include support for TCP or UDP types? A
google
> search didn't reveal anything to me, but would be the best of mtr with the
> benefits of tcptraceroute.
>
> Frank
>
> -----Original Message-----
> From: outages-bounces at outages.org [mailto:outages-bounces at outages.org] On
> Behalf Of Jeremy Chadwick
> Sent: Tuesday, August 18, 2009 12:50 PM
> To: outages at outages.org
> Subject: Re: [outages] Level3 Chicago
>
> It would be helpful if you could use something like mtr instead of
> traceroute in this case. The below trace could be indicating ICMP
> de-prioritisation on L3 routers, which is known to be enabled, but could
> also be an indicator of packet loss starting at hop #5 and "trickling
> down" through succeeding hops (possibly #10 or #11).
>
> mtr --order=LSRNABW can help in diagnosing this.
>
> --
> | Jeremy Chadwick jdc at parodius.com |
> | Parodius Networking http://www.parodius.com/ |
> | UNIX Systems Administrator Mountain View, CA, USA |
> | Making life hard for others since 1977. PGP: 4BD6C0CB |
>
> On Tue, Aug 18, 2009 at 01:36:58PM -0400, Clayton Zekelman wrote:
> >
> > Further to the issue below - I'm typically seeing packet loss at hop
> > 5. This particular trace doesn't show it though, although there are
> > timeouts on hop 6.
> >
> > 1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec
> > 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0
> msec
> > 3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4
msec
> > 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec
> > 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec
> > 6 * *
> > ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec
> > 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec
> > ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec
> > ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec
> > 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec
> > ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec
> > ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec
> > 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec
> > 28 msec 40 msec
> > 10 * * *
> > 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28
msec
> > 12 206.24.199.2 48 msec 32 msec 32 msec
> > 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec
> > 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40
> msec
> > 15 209.202.116.58 40 msec 48 msec 36 msec
> > 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec
> >
> >
> > At 01:05 PM 8/18/2009, Clayton Zekelman wrote:
> >
> > >I'm seeing some minor packet loss traversing Level3 in Chicago.
> > >
> > >We connect to their network in Detroit, and we're clean until the
> > >first hop into Chicago.
> > >
> > >Anyone else seeing anything like that?
> > >
> > >
> > >
> > >---
> > >Clayton Zekelman
> > >Managed Network Systems Inc. (MNSi)
> > >344-300 Tecumseh Rd. E.
> > >Windsor, Ontario
> > >N8X 5E8
> > >
> > >tel. 519-985-8410
> > >fax. 519-985-8409
> > >
> > >_______________________________________________
> > >outages mailing list
> > >outages at outages.org
> > >https://puck.nether.net/mailman/listinfo/outages
> > >
> > >
> > >No virus found in this incoming message.
> > >Checked by AVG - www.avg.com
> > >Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date:
> > >08/18/09 06:03:00
> >
> > ---
> > Clayton Zekelman
> > Managed Network Systems Inc. (MNSi)
> > 344-300 Tecumseh Rd. E.
> > Windsor, Ontario
> > N8X 5E8
> >
> > tel. 519-985-8410
> > fax. 519-985-8409
> >
> > _______________________________________________
> > outages mailing list
> > outages at outages.org
> > https://puck.nether.net/mailman/listinfo/outages
> _______________________________________________
> outages mailing list
> outages at outages.org
> https://puck.nether.net/mailman/listinfo/outages
_______________________________________________
outages mailing list
outages at outages.org
https://puck.nether.net/mailman/listinfo/outages
Level3 Chicago [ In reply to ]
That's an excellent question -- and one I've always wondered myself.

This is purely speculative, but I believe outbound ICMP (e.g. sent from
the router to whatever src solicited it) is what's de-prioritised.

Someone more familiar with Cisco and Juniper equipment might know for
certain.

--
| Jeremy Chadwick jdc at parodius.com |
| Parodius Networking http://www.parodius.com/ |
| UNIX Systems Administrator Mountain View, CA, USA |
| Making life hard for others since 1977. PGP: 4BD6C0CB |

On Wed, Aug 19, 2009 at 09:55:40AM -0500, Frank Bulk - iName.com wrote:
> Good point. Is the ICMP rate-limiting of these largish routers ingress and
> egress, or only ingress-only?
>
> Frank
>
> -----Original Message-----
> From: outages-bounces at outages.org [mailto:outages-bounces at outages.org] On
> Behalf Of Jeremy Chadwick
> Sent: Wednesday, August 19, 2009 12:21 AM
> To: outages at outages.org
> Subject: Re: [outages] Level3 Chicago
>
> The closest thing I know of is PingPlotter for Windows.
>
> I'll remind folks that TCP and UDP-based traceroute/mtr-like utilities
> still rely on ICMP time exceeded (type 11) being sent out to obtain the
> present path, so if folks are looking for a pure TCP or UDP-based
> traceroute or equivalent, to avoid dropping of ICMP, they're barking up
> the wrong tree. :-)
>
> --
> | Jeremy Chadwick jdc at parodius.com |
> | Parodius Networking http://www.parodius.com/ |
> | UNIX Systems Administrator Mountain View, CA, USA |
> | Making life hard for others since 1977. PGP: 4BD6C0CB |
>
> On Tue, Aug 18, 2009 at 10:42:14PM -0500, Frank Bulk wrote:
> > Is there version of mtr that include support for TCP or UDP types? A
> google
> > search didn't reveal anything to me, but would be the best of mtr with the
> > benefits of tcptraceroute.
> >
> > Frank
> >
> > -----Original Message-----
> > From: outages-bounces at outages.org [mailto:outages-bounces at outages.org] On
> > Behalf Of Jeremy Chadwick
> > Sent: Tuesday, August 18, 2009 12:50 PM
> > To: outages at outages.org
> > Subject: Re: [outages] Level3 Chicago
> >
> > It would be helpful if you could use something like mtr instead of
> > traceroute in this case. The below trace could be indicating ICMP
> > de-prioritisation on L3 routers, which is known to be enabled, but could
> > also be an indicator of packet loss starting at hop #5 and "trickling
> > down" through succeeding hops (possibly #10 or #11).
> >
> > mtr --order=LSRNABW can help in diagnosing this.
> >
> > --
> > | Jeremy Chadwick jdc at parodius.com |
> > | Parodius Networking http://www.parodius.com/ |
> > | UNIX Systems Administrator Mountain View, CA, USA |
> > | Making life hard for others since 1977. PGP: 4BD6C0CB |
> >
> > On Tue, Aug 18, 2009 at 01:36:58PM -0400, Clayton Zekelman wrote:
> > >
> > > Further to the issue below - I'm typically seeing packet loss at hop
> > > 5. This particular trace doesn't show it though, although there are
> > > timeouts on hop 6.
> > >
> > > 1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec
> > > 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0
> > msec
> > > 3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4
> msec
> > > 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec
> > > 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec
> > > 6 * *
> > > ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec
> > > 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec
> > > ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec
> > > ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec
> > > 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec
> > > ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec
> > > ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec
> > > 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec
> > > 28 msec 40 msec
> > > 10 * * *
> > > 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28
> msec
> > > 12 206.24.199.2 48 msec 32 msec 32 msec
> > > 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec
> > > 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40
> > msec
> > > 15 209.202.116.58 40 msec 48 msec 36 msec
> > > 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec
> > >
> > >
> > > At 01:05 PM 8/18/2009, Clayton Zekelman wrote:
> > >
> > > >I'm seeing some minor packet loss traversing Level3 in Chicago.
> > > >
> > > >We connect to their network in Detroit, and we're clean until the
> > > >first hop into Chicago.
> > > >
> > > >Anyone else seeing anything like that?
> > > >
> > > >
> > > >
> > > >---
> > > >Clayton Zekelman
> > > >Managed Network Systems Inc. (MNSi)
> > > >344-300 Tecumseh Rd. E.
> > > >Windsor, Ontario
> > > >N8X 5E8
> > > >
> > > >tel. 519-985-8410
> > > >fax. 519-985-8409
> > > >
> > > >_______________________________________________
> > > >outages mailing list
> > > >outages at outages.org
> > > >https://puck.nether.net/mailman/listinfo/outages
> > > >
> > > >
> > > >No virus found in this incoming message.
> > > >Checked by AVG - www.avg.com
> > > >Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date:
> > > >08/18/09 06:03:00
> > >
> > > ---
> > > Clayton Zekelman
> > > Managed Network Systems Inc. (MNSi)
> > > 344-300 Tecumseh Rd. E.
> > > Windsor, Ontario
> > > N8X 5E8
> > >
> > > tel. 519-985-8410
> > > fax. 519-985-8409
> > >
> > > _______________________________________________
> > > outages mailing list
> > > outages at outages.org
> > > https://puck.nether.net/mailman/listinfo/outages
> > _______________________________________________
> > outages mailing list
> > outages at outages.org
> > https://puck.nether.net/mailman/listinfo/outages
> _______________________________________________
> outages mailing list
> outages at outages.org
> https://puck.nether.net/mailman/listinfo/outages
Level3 Chicago [ In reply to ]
'Jeremy Chadwick' wrote:
> That's an excellent question -- and one I've always wondered myself.
>
> This is purely speculative, but I believe outbound ICMP (e.g. sent from
> the router to whatever src solicited it) is what's de-prioritised.
>
> Someone more familiar with Cisco and Juniper equipment might know for
> certain.

Usually packets destined to the control-plane of the system are
prioritized based on criteria. It is better to let routing control
protocols (e.g. ospf, bgp, isis) through first than someone pinging or
running a traceroute. Packets *through* the router take the normal
forwarding path and are not affected by this system.

There may be system defaults based on hardware/software, but Cisco has
CoPP and I believe Juniper uses a firewall on the lo0 interface (been a
while since I touched one) for user-defined rules.

--
Devon
Level3 Chicago [ In reply to ]
Yes, and when pure prioritization is not available, ISPs
will use rate limiting too which is one of the ways
Level 3 restricts ICMP to the cpu on the core devices.

Also, the car device is an edge router so there could be
congestion on a customer port too when higher response times
are seen on the other side of a hop. Response times that settle
could be the control plane/data plane issue or could be once
traffic gets to a far end there's an asymetric path that goes
a different return path rather than back across the link seen on
the forward traceroute. All these are why simple pings and
traceroutes don't always tell the story.

* Devon True was thought to have said:

> 'Jeremy Chadwick' wrote:
> > That's an excellent question -- and one I've always wondered myself.
> >
> > This is purely speculative, but I believe outbound ICMP (e.g. sent from
> > the router to whatever src solicited it) is what's de-prioritised.
> >
> > Someone more familiar with Cisco and Juniper equipment might know for
> > certain.
>
> Usually packets destined to the control-plane of the system are
> prioritized based on criteria. It is better to let routing control
> protocols (e.g. ospf, bgp, isis) through first than someone pinging or
> running a traceroute. Packets *through* the router take the normal
> forwarding path and are not affected by this system.
>
> There may be system defaults based on hardware/software, but Cisco has
> CoPP and I believe Juniper uses a firewall on the lo0 interface (been a
> while since I touched one) for user-defined rules.
>
> --
> Devon
>
> _______________________________________________
> outages mailing list
> outages at outages.org
> https://puck.nether.net/mailman/listinfo/outages