[GE users] MPICH 1.2.5.2 and Signals

Brian R. Smith brian at cypher.acomp.usf.edu
Wed Oct 27 21:52:08 BST 2004


Here you go:


1565     0     0 root      1565 /usr/local/sge/bin/lx24-x86/sge_execd
 6205     0     0 root      6205  \_ sge_shepherd-162 -bg
 6228   500   100 brs       6228      \_
bash /usr/local/sge/rccf/spool/n004/job_scripts/162
 6229   500   100 brs       6228
\_ /bin/sh /usr/local/mpich-intel/bin/mpirun -np 2
-machinefile /tmp/162.1.all
 6313   500   100 brs       6228
\_ /home/student/b/brs/bbmark/bbmark01
-p4pg /home/student/b/brs/bbmark/PI
 6314   500   100 brs       6228
\_ /home/student/b/brs/bbmark/bbmark01 -p4pg /home/student/b/brs/bbmar
 6315   500   100 brs       6228
\_ /usr/local/sge/bin/lx24-x86/qrsh -V -inherit -nostdin n001 /home/st
 6323   500   100 brs       6228
\_ /usr/local/sge/utilbin/lx24-x86/rsh -n -p 34146 n001 exec '/usr

During the run.  Its properly killed now after a 'qdel'.


Brian


On Wed, 2004-10-27 at 22:44 +0200, Reuti wrote:
> Quoting "Brian R. Smith" <brian at cypher.acomp.usf.edu>:
> 
> > I did set P4_RSHCOMMAND to /usr/local/sge/mpi/rsh BUT that was AFTER
> > setting it to "rsh" yielded no results.  I will try to set it to "rsh"
> > again though.  As for my submit script, It works fine.  I have no
> > problems with using -nolocal as I do not want processes running on my
> > master node.  I can submit jobs similar to an more complex that the
> > source you provided.  The problem comes when I want to clean those jobs
> > up before they finish themselves.
> 
> The -nolocal will mean the master node of the job - hence the first slave node 
> in the list of nodes SGE selected. If you get two slots on one machine you 
> should get an error message like "Not enough machines for platform: LINUX" - 
> because there are no machines left in the list.
> 
> Just got your next eMail: can you please make a:
> 
> ps f -eo pid,uid,gid,user,pgrp,command --cols=120
> 
> during execution of the job on the head node and after the kill?
> 
> CU - Reuti
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list