[GE users] sge_shepherd problems perhaps connected to nfs problems

Fred Youhanaie fly at anydata.co.uk
Wed Jun 27 21:35:43 BST 2007


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

Margaret Doll wrote:
> But the largest process id on the compute node is
> 
> root     13242 12922  0 16:05 pts/1    00:00:00 more

Those large pid's are 2's complements in 32 bit arithmetic, but
expressed as 64 bit numbers:

>>> kill(4294958680, SIGKILL)               = 0

4294958680 = -(2^32-4294958680)  = -8616
The shepherd is attempting to kill all the procs with process group id
of 8616.

>>> wait4(4294967295,

4294967295 = -(2^32-4294967295) = -1
The shepherd is waiting for any child.

It is worth checking if any child processes of the shepherd are still
active after a qdel.

Cheers
f.





---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list