[GE users] Automated Install Hangs

Chris Dagdigian dag at sonsorol.org
Thu Sep 18 00:18:32 BST 2008


Hi Gary,

This is what I do when debugging auto install failures:

Edit the inst_sge script and change the first line to "/bin/sh -x" to  
enable debug output. You'll get a massive flood of output but in my  
experience it usually shows exactly where and why the error is  
happening.

-Chirs



On Sep 17, 2008, at 6:59 PM, Gary Richardson wrote:

> Hey,
>
> I'm trying to get automated execution client installs working. When  
> I run the install, the execd installs and starts up, but the install  
> process hangs. If I run top or ps aux | grep inst_sge, it looks like  
> it's continually forking processes that are exiting:
>
> [root@<removed>] ps auxww | grep inst_
> root      9855 10.0  1.5  27456 26240 ttyp0    S    15:47   0:56 / 
> bin/sh ./inst_sge -x -auto /tmp/local.conf -noremote
> root      9653  9.7  1.4  27020 25836 ttyp0    R    15:47   0:55 / 
> bin/sh ./inst_sge -x -auto /tmp/local.conf
> root     17032  0.0  1.4  27456 25620 ttyp0    R    15:57   0:00 / 
> bin/sh ./inst_sge -x -auto /tmp/local.conf -noremote
>
> In my cluster, I have a master server that exports it's SGE_ROOT  
> directory as read only through NFS. Execution daemons mount this in  
> the same place on their system. I'm creating an local execution  
> spool dir.
>
> If I do an interactive install, everything happens properly.
>
> Any advice?
>
> Thanks!
>


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list