[GE users] sge6.2u3 - scheduler dying intermittantly

bomb20 Harvey.Richardson at zeenty.com
Tue Aug 11 09:27:01 BST 2009


rpatterson wrote:
> Recently, I have been having trouble with the scheduler thread dying on
> our master. I assume that this is what's happening because the
> sge_qmaster process is still running, and running jobs continue on
> without a problem, but client requests (qsub/qstat) can no longer make a
> connection, and no new jobs are dispatched. Recently, this has been
> happening about once a week.

Do you have the $SGE_ROOT on NFS by any chance and is that reliable?
The reason I ask is that I have seen similar things when the directoy
goes away for a short time due to server or network issues.

Harvey

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=211794

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list