Mailing List Archive

Correctable ECC Error question...
Is there a way to map the information given by the ECC error
into a specific RAM module on my UltraSparc 2?

Jan 8 20:21:46 wormhole kernel: CPU[1]: Correctable ECC Error
AFSR[100000] AFAR[0000000046592940] UDBL[0] UDBH[0] TL>1[0]

Jan 8 21:57:17 wormhole kernel: CPU[1]: Correctable ECC Error
AFSR[100000] AFAR[0000000046592940] UDBL[0] UDBH[0] TL>1[0]

(message repeats, exactly, except for timestamps, 11 more times)

--
gentoo-sparc@lists.gentoo.org mailing list
Re: Correctable ECC Error question... [ In reply to ]
Leif Sawyer wrote:
> Is there a way to map the information given by the ECC error
> into a specific RAM module on my UltraSparc 2?
>
> Jan 8 20:21:46 wormhole kernel: CPU[1]: Correctable ECC Error
> AFSR[100000] AFAR[0000000046592940] UDBL[0] UDBH[0] TL>1[0]
>
> Jan 8 21:57:17 wormhole kernel: CPU[1]: Correctable ECC Error
> AFSR[100000] AFAR[0000000046592940] UDBL[0] UDBH[0] TL>1[0]
>
> (message repeats, exactly, except for timestamps, 11 more times)
>

http://dlc.sun.com/pdf/802-2561-11/802-2561-11.pdf

section 4.5

It took me a minute to find this on the sun site, it was weirdly named,
called "ultra2 service manual", go figure!, I can see why you didn't
find it yourself.




--
gentoo-sparc@lists.gentoo.org mailing list
RE: Correctable ECC Error question... [ In reply to ]
> From: Hamish Greig [mailto:hgreig@bigpond.net.au]
> Leif Sawyer wrote:
> > Is there a way to map the information given by the ECC error into a
> > specific RAM module on my UltraSparc 2?
> >
> > Jan 8 20:21:46 wormhole kernel: CPU[1]: Correctable ECC Error
> > AFSR[100000] AFAR[0000000046592940] UDBL[0] UDBH[0] TL>1[0]
> >
> > Jan 8 21:57:17 wormhole kernel: CPU[1]: Correctable ECC Error
> > AFSR[100000] AFAR[0000000046592940] UDBL[0] UDBH[0] TL>1[0]
> >
> > (message repeats, exactly, except for timestamps, 11 more times)
> >
>
> http://dlc.sun.com/pdf/802-2561-11/802-2561-11.pdf
>

Sure, service manual works great for Sun error codes, but I'm
trying to map the linux error code into the sparc line.

And what I -don't- see, aside from the register information,
is what the -address- of the fault was. Or maybe that's encoded
somehow in the statis bit?

I did try google to figure it out, but only found solaris references
that showed I should be seing and address, and linux folks asking
the same questions.

--
gentoo-sparc@lists.gentoo.org mailing list