Bug#361261: [Pkg-nagios-devel] Bug#361261: nagios-pgsql: main monitoring process dies intermittently

Kurt Yoder ktydebbug at richard-group.com
Mon Apr 10 15:17:02 UTC 2006


On Apr 10, 2006, at 11:07 AM, sean finney wrote:

> hi kurt,
>
> On Mon, Apr 10, 2006 at 10:50:27AM -0400, Kurt Yoder wrote:
>> There didn't seem to be any core files, or at least none that
>> "locate" could find. I'm running strace right now so the next time it
>> dies I will be able to see what happened.
>
> i'm not sure if the default behaviour is to dump core or not.
> try
>
> ulimit -c 9999
>
> before running nagios, and see if that helps to produce a core maybe?
>

OK, I did this and restarted nagios.



>> BTW, in an attempt to fix this problem I reverted all packages back
>> to "stable". This includes glibc, nagios, etc. However, this morning
>> I found that the same problem had reoccurred. I can re-send the
>> package versions if you like.
>
> it's possible that the erro rmessage you're getting was an error all
> along, but only after glibc was upgraded did the library
> start to detect the use of this specific double free or something.

I'm running the strace again. One thing that looks a little odd to me  
even though the process is still running is all the "Illegal seek"  
and "Resource temporarily unavailable" messages I see:

fork()                                  = 28128
--- SIGCHLD (Child exited) @ 0 (0) ---
close(9)                                = 0
waitpid(28128, [{WIFEXITED(s) && WEXITSTATUS(s) == 0}], 0) = 28128
time([1144682006])                      = 1144682006
close(8)                                = 0
time([1144682006])                      = 1144682006
read(6, 0xbffff080, 504)                = -1 EAGAIN (Resource  
temporarily unavailable)
time([1144682006])                      = 1144682006
time([1144682006])                      = 1144682006
time([1144682006])                      = 1144682006
time([1144682006])                      = 1144682006
fork()                                  = 28130
--- SIGCHLD (Child exited) @ 0 (0) ---
waitpid(28130, NULL, 0)                 = 28130
time([1144682006])                      = 1144682006
time(NULL)                              = 1144682006
read(4, "", 4096)                       = 0
_llseek(4, 0, 0xbfffe908, SEEK_SET)     = -1 ESPIPE (Illegal seek)
rt_sigprocmask(SIG_BLOCK, [CHLD], [RTMIN], 8) = 0
rt_sigaction(SIGCHLD, NULL, {SIG_DFL}, 8) = 0
rt_sigprocmask(SIG_SETMASK, [RTMIN], NULL, 8) = 0
nanosleep({1, 0}, {1, 0})               = 0
time([1144682007])                      = 1144682007
time(NULL)                              = 1144682007
read(4, "", 4096)                       = 0
_llseek(4, 0, 0xbfffe908, SEEK_SET)     = -1 ESPIPE (Illegal seek)
rt_sigprocmask(SIG_BLOCK, [CHLD], [RTMIN], 8) = 0
rt_sigaction(SIGCHLD, NULL, {SIG_DFL}, 8) = 0
rt_sigprocmask(SIG_SETMASK, [RTMIN], NULL, 8) = 0









More information about the Pkg-nagios-devel mailing list