[GE users] Jobs still shown as running after process has died

robhorton r.horton at qmul.ac.uk
Fri Aug 13 12:57:54 BST 2010


Hi,

On Fri, 2010-08-13 at 12:09 +0200, reuti wrote:
> > I've got a "live" example at the moment if anyone has any debugging suggestions.
> 
> - was the $TMPDIR on the node already removed?

We don't create per-job a $TMPDIR

> - was the job's spool directory removed $SGE_ROOT/default/spool/<exechost>/active_jobs (or is it local like /var/spool/<exechost>/active_jobs, which would be better)?

The spool directory has gone.

> - the messages file of the qmaster has no entry also? (loglevel info)

No, but loglevel was set to warning - I've changed it and will see if I
can reproduce the error.

> - was the email send at the end of the job?
> - the nodes "messages" file contains a note about the email?

The job didn't request an email.

Rob

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=274265

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list