[GE users] tight intergration problem

Jean-Paul Minet minet at cism.ucl.ac.be
Fri Jan 27 15:55:14 GMT 2006


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

Reuti,

>> Now, issuing a qdel of this running job will properly stop slave  
>> process, but on master node, remains a defunct:
>>
>> root      5699     1 99 09:14 ?        00:13:11 /home/pan/minet/ 
>> abinit/parallel_eth/abinip_eth -p4pg /home/pan/minet/abinit/ 
>> parallel_eth/PI5615 -p4wd /home
>> root      5700  5699  0 09:14 ?        00:00:00 /home/pan/minet/ 
>> abinit/parallel_eth/abinip_eth -p4pg /home/pan/minet/abinit/ 
>> parallel_eth/PI5615 -p4wd /home
>> root      5701  5699  0 09:14 ?        00:00:00 [qrsh] <defunct>
>>
>> Have you an idea where does this come from ?  mpich?
>>
> 
> Did you set, as mentioned in the Howto:
> 
> export MPICH_PROCESS_GROUP=no
> 
> in your jobscript (or as default request to SGE in $SGE_ROOT/default/ 
> common/sge_request: -v MPICH_PROCESS_GROUP=no) and set -V in the rsh- 
> wrapper?

OK, it works with the variable defined/exported (with -v) in the job script, but 
I would prefer to use sge_request (to prevent problems resulting from users 
forgetting it in their script).  When I use the sge_request file, it doesn't 
seem to work.  Does it get exported to the slave nodes as well when it is picked 
up from the sge_request? (I modified the wrapper, adding -V where needed).

jp
> -- Reuti
> 
>> jp
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
>> For additional commands, e-mail: users-help at gridengine.sunsource.net
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net
> 
> 
> 

-- 
Jean-Paul Minet
Gestionnaire CISM - Institut de Calcul Intensif et de Stockage de Masse
Université Catholique de Louvain
Tel: (32) (0)10.47.35.67 - Fax: (32) (0)10.47.34.52

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list