Mailing List Archive

Abnormal processor temperature.
I have a system here running an Intel N97 processor, which is idling at 70-80C on Gentoo with all cores 99% idle. This is 40 degrees hotter than it runs on Ubuntu or Windows 10.

Powertop confirms that the CPU is spending nearly all of its time in idle mode. I have tried both the 6.1 and the 6.5 kernels, including the pre-compiled gentoo-kernel-bin version just to be sure it's not something in *my* config messing it up.

I'm not really sure where to look next. There doesn't seem to be anything actually *using* the CPU or GPU to any significant degree. Certainly not enough to explain the temperature difference. If anyone has advice on what diagnostics to try next to find out what's going on I'd appreciate it.

Thanks,
LMP
Re: Abnormal processor temperature. [ In reply to ]
On Tuesday, 21 November 2023 22:12:28 GMT Laurence Perkins wrote:
> I have a system here running an Intel N97 processor, which is idling at
> 70-80C on Gentoo with all cores 99% idle. This is 40 degrees hotter than
> it runs on Ubuntu or Windows 10.
>
> Powertop confirms that the CPU is spending nearly all of its time in idle
> mode. I have tried both the 6.1 and the 6.5 kernels, including the
> pre-compiled gentoo-kernel-bin version just to be sure it's not something
> in *my* config messing it up.
>
> I'm not really sure where to look next. There doesn't seem to be anything
> actually *using* the CPU or GPU to any significant degree. Certainly not
> enough to explain the temperature difference. If anyone has advice on what
> diagnostics to try next to find out what's going on I'd appreciate it.
>
> Thanks,
> LMP

The fact you get a report of 40° hotter temperature does not mean the CPU
temperature is actually higher. Do you get any other indications the PC is
running hot, e.g. the fans are spinning faster, the heatsink is hotter to the
touch, or when measured with an infrared thermometer?

Is the 'sensors' command output, after you have run sensors-detect, different
than Ubuntu's?

Is the content of /etc/sensors3.conf different between the two distros?

Are there different sensor modules loaded?

Some chip drivers report spurious results and need specifying a different
measurement method/sensor device. You can take a look here for you CPU/MoBo
chip:

https://www.kernel.org/doc/Documentation/hwmon/
RE: Abnormal processor temperature. [ In reply to ]
> -----Original Message-----
> From: Michael <confabulate@kintzios.com>
> Sent: Tuesday, November 21, 2023 4:15 PM
> To: gentoo-user@lists.gentoo.org
> Subject: Re: [gentoo-user] Abnormal processor temperature.
>
> On Tuesday, 21 November 2023 22:12:28 GMT Laurence Perkins wrote:
> > I have a system here running an Intel N97 processor, which is idling
> > at 70-80C on Gentoo with all cores 99% idle. This is 40 degrees
> > hotter than it runs on Ubuntu or Windows 10.
> >
> > Powertop confirms that the CPU is spending nearly all of its time in
> > idle mode. I have tried both the 6.1 and the 6.5 kernels, including
> > the pre-compiled gentoo-kernel-bin version just to be sure it's not
> > something in *my* config messing it up.
> >
> > I'm not really sure where to look next. There doesn't seem to be
> > anything actually *using* the CPU or GPU to any significant degree.
> > Certainly not enough to explain the temperature difference. If anyone
> > has advice on what diagnostics to try next to find out what's going on I'd appreciate it.
> >
> > Thanks,
> > LMP
>
> The fact you get a report of 40° hotter temperature does not mean the CPU temperature is actually higher. Do you get any other indications the PC is running hot, e.g. the fans are spinning faster, the heatsink is hotter to the touch, or when measured with an infrared thermometer?
>
> Is the 'sensors' command output, after you have run sensors-detect, different than Ubuntu's?
>
> Is the content of /etc/sensors3.conf different between the two distros?
>
> Are there different sensor modules loaded?
>
> Some chip drivers report spurious results and need specifying a different measurement method/sensor device. You can take a look here for you CPU/MoBo
> chip:
>
> https://www.kernel.org/doc/Documentation/hwmon/
>

Sorry, I forgot to mention that. The fans definitely run faster. Which is fortunate because checking with an IR thermometer on the CPU heatsink produces more varied results than the internal sensor, but still 30-40° hotter.

Driver modules loaded by each distro don't seem substantially different. Ubuntu has intel-spi loaded while Gentoo does not, but that's it for differences from lspci. I can look for other differences as well, but it doesn't seem to be a difference in sensor calibration unfortunately.

Also of note, this same stage4 OS snapshot runs on other systems without anything unusual happening, but this is the only configuration using the N97 processor.

LMP
Re: Abnormal processor temperature. [ In reply to ]
On Wednesday, 22 November 2023 01:34:00 GMT Laurence Perkins wrote:
> > -----Original Message-----
> > From: Michael <confabulate@kintzios.com>
> > Sent: Tuesday, November 21, 2023 4:15 PM
> > To: gentoo-user@lists.gentoo.org
> > Subject: Re: [gentoo-user] Abnormal processor temperature.
> >
> > On Tuesday, 21 November 2023 22:12:28 GMT Laurence Perkins wrote:
> >
> > > I have a system here running an Intel N97 processor, which is idling
> > > at 70-80C on Gentoo with all cores 99% idle. This is 40 degrees
> > > hotter than it runs on Ubuntu or Windows 10.
> > >
> > > Powertop confirms that the CPU is spending nearly all of its time in
> > > idle mode. I have tried both the 6.1 and the 6.5 kernels, including
> > > the pre-compiled gentoo-kernel-bin version just to be sure it's not
> > > something in *my* config messing it up.
> > >
> > > I'm not really sure where to look next. There doesn't seem to be
> > > anything actually *using* the CPU or GPU to any significant degree.
> > > Certainly not enough to explain the temperature difference. If anyone
> > > has advice on what diagnostics to try next to find out what's going on
> > > I'd appreciate it.

> > > Thanks,
> > > LMP
> >
> >
> > The fact you get a report of 40° hotter temperature does not mean the CPU
> > temperature is actually higher. Do you get any other indications the PC
> > is running hot, e.g. the fans are spinning faster, the heatsink is hotter
> > to the touch, or when measured with an infrared thermometer?

> > Is the 'sensors' command output, after you have run sensors-detect,
> > different than Ubuntu's?

> > Is the content of /etc/sensors3.conf different between the two distros?
> >
> > Are there different sensor modules loaded?
> >
> > Some chip drivers report spurious results and need specifying a different
> > measurement method/sensor device. You can take a look here for you
> > CPU/MoBo
chip:
> >
> > https://www.kernel.org/doc/Documentation/hwmon/
> >
> >
>
>
> Sorry, I forgot to mention that. The fans definitely run faster. Which is
> fortunate because checking with an IR thermometer on the CPU heatsink
> produces more varied results than the internal sensor, but still 30-40°
> hotter.

OK, it's definitely not a sensor misreading the temperature then.


> Driver modules loaded by each distro don't seem substantially different.
> Ubuntu has intel-spi loaded while Gentoo does not, but that's it for
> differences from lspci. I can look for other differences as well, but it
> doesn't seem to be a difference in sensor calibration unfortunately.

In the first instance I suggest you add SPI module(s) to your kernel. It
controls comms with sensors and peripherals, so it could well play an
important role in controlling the MoBo temperature.
Re: Abnormal processor temperature. [ In reply to ]
On 2023-11-21, Laurence Perkins <lperkins@openeye.net> wrote:

> I have a system here running an Intel N97 processor, which is idling
> at 70-80C on Gentoo with all cores 99% idle. This is 40 degrees
> hotter than it runs on Ubuntu or Windows 10.
>
> Powertop confirms that the CPU is spending nearly all of its time in
> idle mode.

Are clock speeds being scaled down when idle? Or does the N97's "idle
mode" preclude the need to scale down clock speed when not busy to
avoid high temps?
RE: Re: Abnormal processor temperature. [ In reply to ]
> -----Original Message-----
> From: Grant Edwards <grant.b.edwards@gmail.com>
> Sent: Wednesday, November 22, 2023 8:36 AM
> To: gentoo-user@lists.gentoo.org
> Subject: [gentoo-user] Re: Abnormal processor temperature.
>
> On 2023-11-21, Laurence Perkins <lperkins@openeye.net> wrote:
>
> > I have a system here running an Intel N97 processor, which is idling
> > at 70-80C on Gentoo with all cores 99% idle. This is 40 degrees
> > hotter than it runs on Ubuntu or Windows 10.
> >
> > Powertop confirms that the CPU is spending nearly all of its time in
> > idle mode.
>
> Are clock speeds being scaled down when idle? Or does the N97's "idle mode" preclude the need to scale down clock speed when not busy to avoid high temps?
>
>

That was part of my confusion, because even when I used cpupower to lock all cores to 800MHz, it kept running hot.

Michael's suggestion of adding the SPI modules to the kernel appears to have fixed it. I'm not sure why the gentoo-kernel-bin that I tried didn't work at that rate, but I'm going to have to activate the "somebody-else's-problem" field on that one.

LMP
Re: Re: Abnormal processor temperature. [ In reply to ]
On Monday, 27 November 2023 23:14:14 GMT Laurence Perkins wrote:
> > -----Original Message-----
> > From: Grant Edwards <grant.b.edwards@gmail.com>
> > Sent: Wednesday, November 22, 2023 8:36 AM
> > To: gentoo-user@lists.gentoo.org
> > Subject: [gentoo-user] Re: Abnormal processor temperature.
> >
> > On 2023-11-21, Laurence Perkins <lperkins@openeye.net> wrote:
> > > I have a system here running an Intel N97 processor, which is idling
> > > at 70-80C on Gentoo with all cores 99% idle. This is 40 degrees
> > > hotter than it runs on Ubuntu or Windows 10.
> > >
> > > Powertop confirms that the CPU is spending nearly all of its time in
> > > idle mode.
> >
> > Are clock speeds being scaled down when idle? Or does the N97's "idle
> > mode" preclude the need to scale down clock speed when not busy to avoid
> > high temps?
> That was part of my confusion, because even when I used cpupower to lock all
> cores to 800MHz, it kept running hot.
>
> Michael's suggestion of adding the SPI modules to the kernel appears to have
> fixed it. I'm not sure why the gentoo-kernel-bin that I tried didn't work
> at that rate, but I'm going to have to activate the
> "somebody-else's-problem" field on that one.
>
> LMP

I suggested enabling the SPI modules because they are used by the CPU to
communicate with various sensors, adjust clock frequency between components
and thereafter to receive signals a/synchronously to control temperatures.
Theoretically, a fail-safe system would thermally throttle, or over-cool the
CPU, when any of the critical SPI signals is lost. I can't say I understand
why without some temperature sensor feedback the CPU decided to overclock
itself. I would have thought these days oscillator crystals or equivalent
capacitor circuitry would be internal to the CPU die, so the CPU frequency
control would be self-contained. Either way, this seems to have been the
problem.
Re: Re: Abnormal processor temperature. [ In reply to ]
Michael,

On Tuesday, 2023-11-28 10:13:56 +0000, you wrote:

> ...
> I suggested enabling the SPI modules because they are used by the CPU to
> communicate with various sensors, adjust clock frequency between components
> and thereafter to receive signals a/synchronously to control temperatures.

What exactly do you suggest? My kernel (6.1.57) configuration contains:

$ grep SPI /usr/src/linux/.config|grep -v SPIN
CONFIG_SCSI_SPI_ATTRS=y
# CONFIG_SPI is not set
# SPI RTC drivers
CONFIG_RTC_I2C_AND_SPI=y
# SPI and I2C RTC drivers
$

I played a little with the "/" and "?" commands of "make menuconfig",
activated "CONFIG_SPI" and found a lot of controllers and drivers, but
nothing obvious to enable additionally. However, apparently activating
"CONFIG_SPI" also activated

CONFIG_NET_VENDOR_ADI=y
CONFIG_SND_SPI=y
CONFIG_SPI_DYNAMIC=y
CONFIG_SPI_MASTER=y

Is this what you are suggesting?

Sincerely,
Rainer
Re: Re: Abnormal processor temperature. [ In reply to ]
On Wednesday, 29 November 2023 17:34:48 GMT Dr Rainer Woitok wrote:
> Michael,
>
> On Tuesday, 2023-11-28 10:13:56 +0000, you wrote:
> > ...
> > I suggested enabling the SPI modules because they are used by the CPU to
> > communicate with various sensors, adjust clock frequency between
> > components
> > and thereafter to receive signals a/synchronously to control temperatures.
>
> What exactly do you suggest? My kernel (6.1.57) configuration contains:
>
> $ grep SPI /usr/src/linux/.config|grep -v SPIN
> CONFIG_SCSI_SPI_ATTRS=y
> # CONFIG_SPI is not set
> # SPI RTC drivers
> CONFIG_RTC_I2C_AND_SPI=y
> # SPI and I2C RTC drivers
> $
>
> I played a little with the "/" and "?" commands of "make menuconfig",
> activated "CONFIG_SPI" and found a lot of controllers and drivers, but
> nothing obvious to enable additionally. However, apparently activating
> "CONFIG_SPI" also activated
>
> CONFIG_NET_VENDOR_ADI=y
> CONFIG_SND_SPI=y
> CONFIG_SPI_DYNAMIC=y
> CONFIG_SPI_MASTER=y
>
> Is this what you are suggesting?
>
> Sincerely,
> Rainer

It depends on the hardware, this is what I have enabled on an AMD MoBo:

~ $ grep SPI /usr/src/linux/.config
CONFIG_UNINLINE_SPIN_UNLOCK=y
CONFIG_MUTEX_SPIN_ON_OWNER=y
CONFIG_RWSEM_SPIN_ON_OWNER=y
CONFIG_LOCK_SPIN_ON_OWNER=y
CONFIG_ARCH_USE_QUEUED_SPINLOCKS=y
CONFIG_QUEUED_SPINLOCKS=y
# CONFIG_INET6_ESPINTCP is not set
CONFIG_SCSI_SPI_ATTRS=y
CONFIG_SPI=y
CONFIG_SPI_MASTER=y
# CONFIG_SPI_MEM is not set
# SPI Master Controller Drivers
# CONFIG_SPI_ALTERA is not set
# CONFIG_SPI_AXI_SPI_ENGINE is not set
# CONFIG_SPI_BITBANG is not set
# CONFIG_SPI_CADENCE is not set
# CONFIG_SPI_DESIGNWARE is not set
# CONFIG_SPI_NXP_FLEXSPI is not set
# CONFIG_SPI_GPIO is not set
# CONFIG_SPI_MICROCHIP_CORE is not set
# CONFIG_SPI_MICROCHIP_CORE_QSPI is not set
# CONFIG_SPI_LANTIQ_SSC is not set
# CONFIG_SPI_OC_TINY is not set
# CONFIG_SPI_PXA2XX is not set
# CONFIG_SPI_ROCKCHIP is not set
# CONFIG_SPI_SC18IS602 is not set
# CONFIG_SPI_SIFIVE is not set
# CONFIG_SPI_MXIC is not set
# CONFIG_SPI_XCOMM is not set
# CONFIG_SPI_XILINX is not set
# CONFIG_SPI_ZYNQMP_GQSPI is not set
CONFIG_SPI_AMD=m
# SPI Multiplexer support
# CONFIG_SPI_MUX is not set
# SPI Protocol Masters
# CONFIG_SPI_SPIDEV is not set
# CONFIG_SPI_LOOPBACK_TEST is not set
# CONFIG_SPI_TLE62X0 is not set
# CONFIG_SPI_SLAVE is not set
CONFIG_SPI_DYNAMIC=y
# SPI GPIO expanders
# end of SPI GPIO expanders
# CONFIG_SENSORS_LTC2947_SPI is not set
# CONFIG_MFD_DA9052_SPI is not set
# CONFIG_MFD_MC13XXX_SPI is not set
# CONFIG_MFD_TPS65912_SPI is not set
# CONFIG_MFD_ARIZONA_SPI is not set
# CONFIG_MFD_WM831X_SPI is not set
# CONFIG_SND_SPI is not set
# SPI RTC drivers
CONFIG_RTC_I2C_AND_SPI=y
# SPI and I2C RTC drivers
# CONFIG_HWSPINLOCK is not set
Re: Re: Abnormal processor temperature. [ In reply to ]
Michael,

On Wednesday, 2023-11-29 17:56:37 +0000, you wrote:

> ...
> It depends on the hardware, this is what I have enabled on an AMD MoBo:
>
> ~ $ grep SPI /usr/src/linux/.config
> ...
> CONFIG_SCSI_SPI_ATTRS=y
> CONFIG_SPI=y
> CONFIG_SPI_MASTER=y
> ...
> CONFIG_SPI_AMD=m
> ...
> CONFIG_SPI_DYNAMIC=y
> ...
> CONFIG_RTC_I2C_AND_SPI=y

Thanks for the info. Apart from "CONFIG_SPI_AMD=m" this is what I've
now configured too.

Sincerely,
Rainer