[GE users] Scheduler died unexpectedly

Mulley, Nikhil Nikhil.Mulley at deshaw.com
Thu Jan 10 12:24:23 GMT 2008


Is there means of enabling the scheduler debugging ? 

-----Original Message-----
From: Mulley, Nikhil 
Sent: Thursday, January 10, 2008 1:46 PM
To: users at gridengine.sunsource.net
Subject: [GE users] Scheduler died unexpectedly

I want to look at why and how the scheduler died. I am using SGE
v6.0.11. Any (forensic) reports could be generated that why the
scheduler could have died in first place?

First thing that I came to notice that scheduler is died as the
schedd.pid was referring to non-existing pid number on my qmaster host
(from the act_qmaster file), I was wondering why is that shadowd did not
notice this and did not start the schedd/qmaster on one of the shadow
masters ? Is this mechanism can be expected from the host running
shadowd?

Thanks,
Nikhil

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list