[GE users] qdel

Reuti reuti at staff.uni-marburg.de
Mon Nov 5 10:03:32 GMT 2007


Am 05.11.2007 um 06:34 schrieb John_Tai:

> Here is the output. I started the job as eldo mult.cir, stopped the  
> process with ctrl-c, but the process is still running.
> Thanks for helping.
>
>   PID  PPID  PGRP COMMAND
>  2538     1  2538 /home/sge/sge6.1/bin/lx24-amd64/sge_execd
> 11526     1 11526 /home/sge/sge6.1/utilbin/lx24-amd64/rshd -l
> 11527 11526 11527  \_ /home/sge/sge6.1/utilbin/lx24-amd64/ 
> qrsh_starter /data1/sge/spool/dsls1/active_jobs/219731.1
> 11560 11527 11560      \_ tcsh -c eldo mult.cir
> 11593 11560 11560          \_ /bin/sh /home/edamgr/linux/mentor/ 
> ams_2007.1-64bit/bin/eldo mult.cir
> 11864 11593 11560              \_ /bin/sh /home/edamgr/linux/mentor/ 
> ams_2007.1-64bit/com/eldo mult.cir
> 11877 11864 11560                  \_ /home/edamgr/linux/mentor/ 
> ams_2007.1-64bit/aol/bin/eldo_64.exe -i mult.cir
> 11886 11877 11560                      \_ /bin/sh /home/edamgr/ 
> linux/mentor/ams_2007.1-64bit/bin/run_wdb_server
> 12157 11886 11560                          \_ /bin/sh /home/edamgr/ 
> linux/mentor/ams_2007.1-64bit/aol/bin/run_wdb_server
> 12158 12157 11560                              \_ /home/edamgr/ 
> linux/mentor/ams_2007.1-64bit/jre/aol/bin/java -DMGC_TMPDIR=/tmp - 
> DAMS_WDBSERVER_INFO=/home/1

I wonder, where the sge_shepherd went to. Can you please post the  
same thing of a running job in qstat? Anything in /tmp as error  
message of the job?

This here you should be able to kill with a:

kill -9 -- -11560

i.e. killing the processgroup.

-- Reuti

>
>
>
> -----Original Message-----
> From: Reuti [mailto:reuti at staff.uni-marburg.de]
> Sent: Friday, November 02, 2007 5:45 PM
> To: users at gridengine.sunsource.net
> Subject: Re: [GE users] qdel
>
>
> Am 02.11.2007 um 06:36 schrieb John_Tai:
>
>> Sorry I am late. I tried that command but it doesn't work. Is this
>> what you were looking for?
>>
>>  ps -ef |grep eldo
>
> Yes, nearly. Blank between -e and f is important. It would also be
> useful to get the process group:
>
> ps -e f -o pid,ppid,pgrp,command
>
>> johnt    18024 17996  0 13:31 ?        00:00:00 tcsh -c eldo mult.cir
>> johnt    18054 18024  0 13:31 ?        00:00:00 /bin/sh /home/
>> edamgr/linux/mentor/ams_2007.1-64bit/bin/eldo mult.cir
>> johnt    18325 18054  0 13:31 ?        00:00:00 /bin/sh /home/
>> edamgr/linux/mentor/ams_2007.1-64bit/com/eldo mult.cir
>> johnt    18337 18325 84 13:31 ?        00:02:05 /home/edamgr/linux/
>> mentor/ams_2007.1-64bit/aol/bin/eldo_64.exe -i mult.cir
>
> -- Reuti
>
>>
>>
>>
>> -----Original Message-----
>> From: Reuti [mailto:reuti at staff.uni-marburg.de]
>> Sent: Tuesday, October 23, 2007 5:51 PM
>> To: users at gridengine.sunsource.net
>> Subject: Re: [GE users] qdel
>>
>>
>> Hi,
>>
>> Am 23.10.2007 um 08:39 schrieb John_Tai:
>>
>>> After upgrading to 6.1 qdel doesn't kill the process on the
>>> execution host anymore. The job is not in qstat, but the process
>>> itself is still running.
>>>
>>> The job is submitted using:
>>>
>>> qrsh -v eda=$cmd -cwd -now n <command>
>>>
>>> We used to be able to delete the job with qdel or even with ctrl-c,
>>> but now it doesn't work anymore.
>>>
>>> Any ideas why or how to debug?
>>
>> what is a:
>>
>> qrsh -v eda=$cmd -cwd ps -e f -o pid,ppid,pgrp,command
>>
>> showing? It should show all processes bound to the sge_shepherd.
>>
>> -- Reuti
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
>> For additional commands, e-mail: users-help at gridengine.sunsource.net
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
>> For additional commands, e-mail: users-help at gridengine.sunsource.net
>>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net
>

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list