[GE users] tight intergration problem

Reuti reuti at staff.uni-marburg.de
Fri Jan 27 16:04:05 GMT 2006


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

Am 27.01.2006 um 16:55 schrieb Jean-Paul Minet:

> Reuti,
>
>>> Now, issuing a qdel of this running job will properly stop slave   
>>> process, but on master node, remains a defunct:
>>>
>>> root      5699     1 99 09:14 ?        00:13:11 /home/pan/minet/  
>>> abinit/parallel_eth/abinip_eth -p4pg /home/pan/minet/abinit/  
>>> parallel_eth/PI5615 -p4wd /home
>>> root      5700  5699  0 09:14 ?        00:00:00 /home/pan/minet/  
>>> abinit/parallel_eth/abinip_eth -p4pg /home/pan/minet/abinit/  
>>> parallel_eth/PI5615 -p4wd /home
>>> root      5701  5699  0 09:14 ?        00:00:00 [qrsh] <defunct>
>>>
>>> Have you an idea where does this come from ?  mpich?
>>>
>> Did you set, as mentioned in the Howto:
>> export MPICH_PROCESS_GROUP=no
>> in your jobscript (or as default request to SGE in $SGE_ROOT/ 
>> default/ common/sge_request: -v MPICH_PROCESS_GROUP=no) and set -V  
>> in the rsh- wrapper?
>
> OK, it works with the variable defined/exported (with -v) in the  
> job script, but I would prefer to use sge_request (to prevent  
> problems resulting from users forgetting it in their script).  When  
> I use the sge_request file, it doesn't seem to work.  Does it get  
> exported to the slave nodes as well when it is picked up from the  
> sge_request? (I modified the wrapper, adding -V where needed).
>

The lines you put in sge_request are just:

-v MPICH_PROCESS_GROUP=no
-v P4_RSHCOMMAND=rsh

(no export, no #$ necessary)? Are the users using a  
private .sge_request file? - Reuti


> jp
>> -- Reuti
>>> jp
>>>
>>> -------------------------------------------------------------------- 
>>> -
>>> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
>>> For additional commands, e-mail: users-help at gridengine.sunsource.net
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
>> For additional commands, e-mail: users-help at gridengine.sunsource.net
>
> -- 
> Jean-Paul Minet
> Gestionnaire CISM - Institut de Calcul Intensif et de Stockage de  
> Masse
> Université Catholique de Louvain
> Tel: (32) (0)10.47.35.67 - Fax: (32) (0)10.47.34.52
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list