[GE users] sge_shepherd problems perhaps connected to nfs problems

Margaret Doll Margaret_Doll at brown.edu
Thu Jun 28 16:59:08 BST 2007


I have a job that I started last night.

It is no longer running on top, but it shows up in qstat -f

ps -ef --forest | more^M
UID        PID  PPID  C STIME TTY          TIME CMD^M
sge       4228     1  0 Jun27 ?        00:02:53 /opt/gridengine/bin/ 
lx26-amd64/sge_execd^M
sge       5094  4228  0 Jun27 ?        00:00:00  \_ sge_shepherd-549 - 
bg^M
mad       5095  5094  0 Jun27 ?        00:00:00      \_ -csh /opt/ 
gridengine/default/spool/compute-0-1/job_scripts/549^M
mad       5168  5095 84 Jun27 ?        13:30:53          \_ /home/mad/ 
user1/mad^M


Again the script  was to run /home/mad/user1/mad


On Jun 27, 2007, at 6:00 PM, Fred Youhanaie wrote:

> Margaret Doll wrote:
>> There appears to be no child processes.
>> ps -ef | grep sge
>> sge       4231     1  0 Jun25 ?        00:06:14 /opt/gridengine/ 
>> bin/lx26-amd64/sge_execd
>> mad      11823     1  0 16:20 ?        00:00:00 sge_shepherd-546 -bg
>> ps -f --ppid=11823
>> UID        PID  PPID  C STIME TTY          TIME CMD
>
> sorry, wrong command (bad short cut!)
>
> try
> 	ps -ef --forest | more
>
> and look for patterns like this:
>
> sge      21123  5328  0 22:50 ?        00:00:00  \_ sge_shepherd-2 -bg
> user1    21124 21123  0 22:50 ?        00:00:00      \_ -sh /opt/ 
> sge/default/spool/sgehost/job_scripts/237 99
> user1    21131 21124  0 22:50 ?        00:00:00          \_ sleep 99
>
>
> cheers
> f.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net
>

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list