[GE users] MVAPICH 0.9.9 and SGE 6.2u1

rgigon rgigon at slb.com
Wed Jul 29 17:32:28 BST 2009


    [ The following text is in the "Windows-1252" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

Hi there,
I?m trying to achieve tight integration between MVAPICH 0.9.9 and SGE 6.2u1
I followed the directions in the HOWTO and am able to submit jobs, but when I use qdel, it still does not kill all the processes.

Results from ps on master:

6553  4370  6553  \_ sge_shepherd-1302 -bg
 6578  6553  6578      \_ -bash /opt/sge/default/spool/fire7/job_scripts/1302
 6669  6578  6578          \_ /usr/local/mpi/intel/mvapich-0.9.9/bin/mpirun_rsh -rsh -np 4 -hostfile /tmp/1302.1.normal.q/machines ./testtight.x
 6670  6669  6578              \_ /usr/bin/rsh fire7 cd /people4/jliu17/roberta/source; /usr/bin/env LD_LIBRARY_PATH=/usr/local/mpi/intel/mvapich-0.9.9/lib/shared:/usr/local/mpi/intel/mvapich-0.9.9/lib:/usr/local/mpi/intel/mvapich-0.9.9/lib/shared: MPIRUN_MPD=0 MPIRUN_HOST=fire7 MPIRUN_PORT=60546 MPIRUN_RANK=0 MPIRUN_NPROCS=4 MPIRUN_ID=6669    NOT_USE_TOTALVIEW=1  ./testtight.x
 6678  6670  6578              |   \_ [rsh] <defunct>
 6671  6669  6578              \_ /usr/bin/rsh fire7 cd /people4/jliu17/roberta/source; /usr/bin/env LD_LIBRARY_PATH=/usr/local/mpi/intel/mvapich-0.9.9/lib/shared:/usr/local/mpi/intel/mvapich-0.9.9/lib:/usr/local/mpi/intel/mvapich-0.9.9/lib/shared: MPIRUN_MPD=0 MPIRUN_HOST=fire7 MPIRUN_PORT=60546 MPIRUN_RANK=1 MPIRUN_NPROCS=4 MPIRUN_ID=6669    NOT_USE_TOTALVIEW=1  ./testtight.x
 6680  6671  6578              |   \_ [rsh] <defunct>
 6672  6669  6578              \_ /usr/bin/rsh fire8 cd /people4/jliu17/roberta/source; /usr/bin/env LD_LIBRARY_PATH=/usr/local/mpi/intel/mvapich-0.9.9/lib/shared:/usr/local/mpi/intel/mvapich-0.9.9/lib:/usr/local/mpi/intel/mvapich-0.9.9/lib/shared: MPIRUN_MPD=0 MPIRUN_HOST=fire7 MPIRUN_PORT=60546 MPIRUN_RANK=2 MPIRUN_NPROCS=4 MPIRUN_ID=6669    NOT_USE_TOTALVIEW=1  ./testtight.x
 6676  6672  6578              |   \_ [rsh] <defunct>
 6673  6669  6578              \_ /usr/bin/rsh fire8 cd /people4/jliu17/roberta/source; /usr/bin/env LD_LIBRARY_PATH=/usr/local/mpi/intel/mvapich-0.9.9/lib/shared:/usr/local/mpi/intel/mvapich-0.9.9/lib:/usr/local/mpi/intel/mvapich-0.9.9/lib/shared: MPIRUN_MPD=0 MPIRUN_HOST=fire7 MPIRUN_PORT=60546 MPIRUN_RANK=3 MPIRUN_NPROCS=4 MPIRUN_ID=6669    NOT_USE_TOTALVIEW=1  ./testtight.x
 6683  6673  6578                  \_ [rsh] <defunct>

Your thoughts are greatly appreciated.

Best regards,
Roberta

---------------------------------------------------------------------------------------------
Roberta M. Gigon
Schlumberger-Doll Research
One Hampshire Street, MD-B253
Cambridge, MA 02139
617.768.2099 - phone
617.768.2381 - fax

This message is considered Schlumberger CONFIDENTIAL.  Please treat the information contained herein accordingly.




More information about the gridengine-users mailing list