[GE users] run time intel compiler library libsvml not found

SLIM H.A. h.a.slim at durham.ac.uk
Thu Jan 17 17:29:25 GMT 2008


Dear Reuti

Apologies for the long delay to reply. I checked the web page you
referred to and the -V option solves the problem, thanks. 
However I noticed something curious: we use standard MPICH over ethernet
with sge/mpi/startmpi.sh -catch_rsh $pe_hostfile as the PE start script.
If I set 

job_is_first_task TRUE

in the definition of the PE, as sugested on the web page then MPICH
generates error messages. I do have to set 

control_slaves    TRUE
job_is_first_task FALSE

to get it to work. Why should this be?

Thanks

Henk

> 
> Aha, the slave task might not have the LD_LIBRARY_PATH. 
> Please add a - V to the rsh wrapper:
> 
> http://gridengine.sunsource.net/howto/mpich-integration.html
> 
> which will also solve other issues. And be sure to have a 
> Tight Integration, i.e. "setenv P4_RSHCOMMAND rsh" to use the 
> rsh-wrapper.
> 
> -- Reuti
>  

> -----Original Message-----
> From: Reuti [mailto:reuti at staff.uni-marburg.de] 
> Sent: 21 December 2007 13:36
> To: users at gridengine.sunsource.net
> Subject: Re: [GE users] run time intel compiler library 
> libsvml not found
> 
> Am 21.12.2007 um 13:04 schrieb SLIM H.A.:
> 
> > Maybe it clarifies if I show the script:
> >
> > #!/bin/csh
> > ... some standard sge options here
> > #$ -cwd
> > setenv MPICH_PROCESS_GROUP no
> > # request submission to a queue for parallel jobs #$ -q 
> par.q ##$ -S 
> > /bin/csh
> 
> This will be just a real comment, it's not #$ at the beginning.
> 
> > #   ^^ no effect
> > # set up the mpich version to use
> > # load the modules
> > module purge
> > module load intel/fce/9.0.032 mpich/ge/intel/64/1.2.7 
> sge/6.0u7_1 ldd 
> > ./monte echo LD_LIBRARY_PATH=$LD_LIBRARY_PATH # $ -v 
> > LD_LIBRARY_PATH=$LD_LIBRARY_PATH
> 
> This you can only use on the commandline, where 
> $LD_LIBRARY_PATH will be expanded by the shell. Here you 
> should see a literal $LD_LIBRARY_PATH echoed, unless -V is 
> used (space between # and $ is also not allowed)
> 
> > #   ^^ no effect
> > #$ -V
> > #   ^^ only works if the session shell has the module loaded as well
> 
> Seems okay.
> 
> > # execute command
> > mpirun -np $NSLOTS -machinefile $TMPDIR/machines ./monte
> >
> > I built monte with
> >
> > module purge
> > module load intel/fce/9.0.032 mpich/ge/intel/64/1.2.7 
> mpif90 monte.f90 
> > -o monte
> >
> > These are snippets from the output file ...
> >         libsvml.so =>
> > /usr/local/Cluster-Apps/intel/fce/9.0//lib/libsvml.so
> > (0x00002b21417de000)
> > ...
> > LD_LIBRARY_PATH=/usr/local/lib:/usr/X11R6/lib:/usr/local/Cluster-
> > Apps/in
> > tel/fce/9.0//lib:/usr/local/Cluster-App
> > s/mpich/ge/intel/64/1.2.7/lib/shared:/usr/local/Cluster-Apps/sge/
> > lib/lx2
> > 6-amd64
> > /usr/local/Cluster-Apps/sge/bin/lx24-amd64/qrsh -inherit -nostdin
> > node231 /data/hamilton/drk1has/hamilton_monte
> > pi/amd64_lnx_ifort/./monte node231 50375 \-p4amslave \-p4yourname
> > node231 \-p4rmrank 1
> > 
> /data/hamilton/drk1has/hamilton_montepi/amd64_lnx_ifort/./mont
> e: error 
> > while loading shared libraries: libsvml.
> > so: cannot open shared object file: No such file or directory ...
> 
> Aha, the slave task might not have the LD_LIBRARY_PATH. 
> Please add a - V to the rsh wrapper:
> 
> http://gridengine.sunsource.net/howto/mpich-integration.html
> 
> which will also solve other issues. And be sure to have a 
> Tight Integration, i.e. "setenv P4_RSHCOMMAND rsh" to use the 
> rsh-wrapper.
> 
> -- Reuti
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net
> 
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list