[GE users] wait3 returned -1 / now sending signal CONT to pid -3600

Sebastian Stark stark at tuebingen.mpg.de
Thu Dec 1 08:50:40 GMT 2005


On Wed, Nov 30, 2005 at 10:14:28PM +0100, Reuti wrote:
> Hi Sebastian,
> 
> can you please check with "ps -e f -o pid,ppid,pgrp,command" what is  
> running on this node112? And/or is in the messages file something  
> linke mentioned here:

There's nothing running anymoer on this node, the job has already exited
for some reasons. Next time it happens I'll try.

The messages file does not say much about this particular job id:


neckar ~ % grep 649640 /usr/local/sge/default/spool/node112/messages
11/30/2005 15:35:52|execd|node112|E|abnormal termination of shepherd for job 649640.1: "exit_status" file is empty
11/30/2005 15:35:52|execd|node112|E|can't open usage file "active_jobs/649640.1/usage" for job 649640.1: No such file or directory


neckar ~ % grep 649640 /usr/local/sge/default/spool/qmaster/messages
11/26/2005 09:14:13|qmaster|neckar|W|job 649640.1 failed on host node107 before writing exit_status because: shepherd exited with exit status 19
11/26/2005 09:14:13|qmaster|neckar|W|rescheduling job 649640.1
11/30/2005 15:35:53|qmaster|neckar|W|job 649640.1 failed on host node112 before writing exit_status because: shepherd exited with exit status 19

> http://gridengine.sunsource.net/issues/show_bug.cgi?id=1665
> 
> This is fixed in u6.

I should really upgrade...


thanks.

> 
> Cheers - Reuti
> 
> 
> Am 30.11.2005 um 16:02 schrieb Sebastian Stark:
> 
> >
> >I'm getting lots of these errors. What could be the cause? It only  
> >happens with some jobs.
> >
> >Thanks for any hints.
> >
> >
> >-Sebastian
> >

-- 
Sebastian Stark -- http://www.kyb.tuebingen.mpg.de/~stark
Max Planck Institute for Biological Cybernetics

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list