Mailing List Archive: backend stops working for a couple hours every morning at 2?

backend stops working for a couple hours every morning at 2?

Jan 12, 2022, 6:47 AM

Post #1 of 3 (289 views)

Hey, all. Over the past week, I've noticed my backend stops working every
morning at 2am local time (0700 UTC). The symptoms are that I get emails
from my every-10-minutes cron job that runs /usr/sbin/mythtv-update-motd .
The emails repeat for roughly 2-3 hours, give or take, and say "Our child
has stopped talking to us, kill it off. Unknown error during retrieval of
status from the MythTV backend." During that time, recordings also silently
fail.

The problem is that I can't figure out what's causing this at all. I don't
seem to have any cron jobs or systemd timers that run daily at 2am. I can't
find anything unusual in the mythtv-backend systemd journal or in any of
the /var/log/mythtv/*.log files, and in fact, the evidence is that the
backend process/service is still running. Things that appear in the logs
every 15 minutes or so continue to appear during the time period when the
cron emails are going out and recordings aren't working, and systemd
reports that the service has been running since before last night's gap.

Can you help me figure out what's causing this, or at least help me figure
out where to look for information or how to increase logging appropriately?
My usual sysadmin log-reading skills are failing me here...

Thanks!
Josh

Re: backend stops working for a couple hours every morning at 2? [ In reply to ]

stephen_agent at jsw

Jan 12, 2022, 6:19 PM

Post #2 of 3 (288 views)

Permalink

On Wed, 12 Jan 2022 09:47:58 -0500, you wrote:

>Hey, all. Over the past week, I've noticed my backend stops working every
>morning at 2am local time (0700 UTC). The symptoms are that I get emails
>from my every-10-minutes cron job that runs /usr/sbin/mythtv-update-motd .
>The emails repeat for roughly 2-3 hours, give or take, and say "Our child
>has stopped talking to us, kill it off. Unknown error during retrieval of
>status from the MythTV backend." During that time, recordings also silently
>fail.
>
>The problem is that I can't figure out what's causing this at all. I don't
>seem to have any cron jobs or systemd timers that run daily at 2am. I can't
>find anything unusual in the mythtv-backend systemd journal or in any of
>the /var/log/mythtv/*.log files, and in fact, the evidence is that the
>backend process/service is still running. Things that appear in the logs
>every 15 minutes or so continue to appear during the time period when the
>cron emails are going out and recordings aren't working, and systemd
>reports that the service has been running since before last night's gap.
>
>Can you help me figure out what's causing this, or at least help me figure
>out where to look for information or how to increase logging appropriately?
>My usual sysadmin log-reading skills are failing me here...
>
>Thanks!
>Josh

Two hours is a very long time. Usually, the culprit with this sort of
thing is the database backup happening, but even with my huge database
that only takes 20 minutes or so. Mythbackend is quite multithreaded,
so bits of it can work when other bits do not. But if the database is
running very slowly as it can during backups or database checks, as
soon as mythbackend tries to use the database, that bit of it can be
stalled for a long time waiting for it. So it would pay to run a full
check on all database tables to see if there are any that are crashed.

A long time ago, I had a problem where mythbackend would not record
(or do much else) for a couple of weeks, and I eventually discovered
that the cause was two hard drives that were on an external dual USB
mount. They had been accidentally turned off without being unmounted
properly and their JFS partitions needed a full fsck before they would
mount. As they had partitions which were used by a storagegroup,
mythbackend was attempting to mount them all the time and getting a
huge timeout each time. When both drives' timeouts had happened,
mythbackend would operate normally for a second or two before
attempting to mount them again. The same sort of thing can happen if
mythbackend tries to access a network drive which will not connect
properly - badly configured NFS mounts are a frequent problem with
this. SAMBA mounts using the default configuration seem to be more
forgiving, but they can cause problems too.
_______________________________________________
mythtv-users mailing list
mythtv-users@mythtv.org
http://lists.mythtv.org/mailman/listinfo/mythtv-users
http://wiki.mythtv.org/Mailing_List_etiquette
MythTV Forums: https://forum.mythtv.org

Re: backend stops working for a couple hours every morning at 2? [ In reply to ]

mwedel at sonic

Jan 12, 2022, 9:37 PM

Post #3 of 3 (288 views)

Permalink

On 1/12/22 6:19 PM, Stephen Worthington wrote:
> On Wed, 12 Jan 2022 09:47:58 -0500, you wrote:
>
>> Hey, all. Over the past week, I've noticed my backend stops working every
>> morning at 2am local time (0700 UTC). The symptoms are that I get emails
>>from my every-10-minutes cron job that runs /usr/sbin/mythtv-update-motd .
>> The emails repeat for roughly 2-3 hours, give or take, and say "Our child
>> has stopped talking to us, kill it off. Unknown error during retrieval of
>> status from the MythTV backend." During that time, recordings also silently
>> fail.

My first thought would be see what other cron jobs may be running at that same time (or starting somewhat before it) - it could be some other job that is using a lot of resources, and thus starving mythtv (for example, if there is a job that traverses the entire filesystem looking for things to clean up, that may consume enough of the IO to make mythtv have issues).

Possibly look at /var/syslog or elsewhere to see if there are any messages indicating issues.

_______________________________________________
mythtv-users mailing list
mythtv-users@mythtv.org
http://lists.mythtv.org/mailman/listinfo/mythtv-users
http://wiki.mythtv.org/Mailing_List_etiquette
MythTV Forums: https://forum.mythtv.org