[GE users] Problems with LAM tight integration

Reuti reuti at staff.uni-marburg.de
Sun Aug 6 16:11:11 BST 2006


Am 04.08.2006 um 21:36 schrieb slaton:

>> Okay, please add the -v -d to lamboot, maybe we see something there.
>
> OK. From the (pe) file:
>
> n-1<1731> ssi:boot:open: opening
> <snip>
> n-1<1731> ssi:boot:rsh: starting on n0 (qcn16): hboot -t -c lam- 
> conf.lamd -d -v -sessionsuffix sge-110-undefined -I -H 10.0.1.16 -P  
> 32771 -n 0 -o 0
> n-1<1731> ssi:boot:rsh: launching locally
> n-1<1731> ssi:boot:rsh: successfully launched on n0 (qcn16)
> n-1<1731> ssi:boot:base:server: expecting connection from finite list
> error: ERROR! invalid option argument "-n"
> ---------------------------------------------------------------------- 
> -------
> The lamboot agent timed out while waiting for the newly-booted process
> to call back and indicated that it had successfully booted.
> [snip]

This looks all fine. The first, i.e. local lamd, will be started from  
the local hboot (without any rsh). Only the lamd-wrapper will make a  
qrsh call. Can you put an echo in your lamd-wrapper with:

echo "$@"
echo $ENVIRONMENT
echo $PE

-- Reuti

> Curious that it nevver attempts to start lamd on n1 (qcn17). Maybe  
> because
> it doesn't get successful callback from qcn16.
>
>> From the (po) file:
>
> [snip]
> tkill: setting prefix to (null)
> tkill: setting suffix to sge-110-undefined
> tkill: got killname back: /tmp/110.1.testing/lam-slaton at qcn16- 
> sge-110-undefined/lam-killfile
> tkill: f_kill = "/tmp/110.1.testing/lam-slaton at qcn16-sge-110- 
> undefined/lam-killfile"
> tkill: nothing to kill: "/tmp/110.1.testing/lam-slaton at qcn16- 
> sge-110-undefined/lam-killfile"
> hboot: performing tkill
> hboot: tkill -sessionsuffix sge-110-undefined -d
> hboot: booting...
> hboot: fork /usr/local/lam/sge/pgi/bin/lamd
> [1]   1734 lamd -H 10.0.1.16 -P 32771 -n 0 -o 0 -d -sessionsuffix  
> sge-110-undefined
> [snip]
>
>
> thanks
> slaton
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list