[GE users] Mvapich processes not killed on qdel
mhanby at uab.edu
Wed May 9 16:43:34 BST 2007
I have GE 6.0u8 on a Rocks 4.2.1 cluster with Infiniband and the Topspin
roll (which includes mvapich).
When I qdel an mvapich job, the job immediately is removed from the
queue, however most of the processes on the nodes do not get killed. It
appears that the mpirun_ssh process does get killed, however all of the
actual job executables (sander.MPI) doesn't.
I followed the directions for tight integration of Mvapich
The job runs fine, but again it doesn't kill off processes when qdel'd.
Here's the pe:
$ qconf -sp mvapich
start_proc_args /share/apps/gridengine/mvapich/startmpi.sh -catch_rsh
The only modifications made to the startmpi.sh script was to change the
location of the hostname and rsh scripts from $SGE_ROOT to
Any suggestions on what I should look for?
More information about the gridengine-users