[GE users] qdel not remove all instances of jobs

Simon Gao gao at schrodinger.com
Fri Mar 9 00:01:43 GMT 2007


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

Reuti wrote:
> Hi,
>
> Am 07.03.2007 um 23:03 schrieb Simon Gao:
>
>> I am having problems with qdel to completely remove jobs from a 
>> compute node. After running "qdel <jobid>", the job spawned processes 
>> continue to run even though the job has been removed from queue.  
>> What could cause such problem?
>
> are the processes jumping out of the processgroup? Are they started by 
> a "&" in the jobscript? So, we need some more details about it. Maybe 
> a "ps -e f -o pid,ppid,pgroup,command" from these surviving processes.
>
> -- Reuti
Here is one example.

Initially:

[user1 at compute-0-39 ~]$ pstree pn | grep -A 1 user1
        
|-sshd(2875)---sshd(8263)---sshd(8265,user1)---tcsh(8266)-+-pstree(8441)
        |                                                            
`-grep(8442)
--
        
|-sge_execd(3170,sge)---sge_shepherd(8105)---perl(8106,user1)-+-perl(8208)
        |                                                                
`-sh(8227)---app1(8228)


Followed by qdel:

[user1 at compute-0-39 ~]$ pstree -upn | grep -A 1 gree
        
|-sshd(2875)---sshd(8263)---sshd(8265,user1)---tcsh(8266)-+-pstree(8479)
        |                                                            
`-grep(8480)
--
        |-sh(8227,user1)---app1(8228)

Followed by kill -9 8228 on the node:

[user1 at compute-0-39 ~]$ pstree -upn | grep -A 1 gree
        
|-sshd(2875)---sshd(8263)---sshd(8265,user1)---tcsh(8266)-+-pstree(8501)
        |                                                            
`-grep(8502)

Simon

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list