[GE users] Automated Install Hangs

Gary Richardson gary.richardson at gmail.com
Thu Sep 18 00:32:07 BST 2008


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

good call. It looks like it's trying to copy the log file to the install log
dir:

+ /bin/echo -e

+ ./utilbin/lx24-x86/infotext -log 'Command failed: %s' cp -f
/tmp/install.30775
/mnt/grid/l268/sgeadmin/default/common/install_logs/execd_install_ip-10-251-126-21_2008-09-17_16:26:33.log
Command failed:
/mnt/grid/l268/sgeadmin/default/common/install_logs/execd_install_ip-10-251-126-21_2008-09-17_16:26:33.logcp-f/tmp/install.30775
+ ./utilbin/lx24-x86/infotext -log 'Probably a permission problem. Please
check file access permissions.'
Probably a permission problem. Please check file access permissions.
+ ./utilbin/lx24-x86/infotext -log 'Check read/write permission. Check if
SGE daemons are running.'
Check read/write permission. Check if SGE daemons are running.

I don't see anywhere in the config template to specify a different logging
directory. Is there a way?

Thanks!

On Wed, Sep 17, 2008 at 4:18 PM, Chris Dagdigian <dag at sonsorol.org> wrote:

> Hi Gary,
>
> This is what I do when debugging auto install failures:
>
> Edit the inst_sge script and change the first line to "/bin/sh -x" to
> enable debug output. You'll get a massive flood of output but in my
> experience it usually shows exactly where and why the error is happening.
>
> -Chirs
>
>
>
>
> On Sep 17, 2008, at 6:59 PM, Gary Richardson wrote:
>
>  Hey,
>>
>> I'm trying to get automated execution client installs working. When I run
>> the install, the execd installs and starts up, but the install process
>> hangs. If I run top or ps aux | grep inst_sge, it looks like it's
>> continually forking processes that are exiting:
>>
>> [root@<removed>] ps auxww | grep inst_
>> root      9855 10.0  1.5  27456 26240 ttyp0    S    15:47   0:56 /bin/sh
>> ./inst_sge -x -auto /tmp/local.conf -noremote
>> root      9653  9.7  1.4  27020 25836 ttyp0    R    15:47   0:55 /bin/sh
>> ./inst_sge -x -auto /tmp/local.conf
>> root     17032  0.0  1.4  27456 25620 ttyp0    R    15:57   0:00 /bin/sh
>> ./inst_sge -x -auto /tmp/local.conf -noremote
>>
>> In my cluster, I have a master server that exports it's SGE_ROOT directory
>> as read only through NFS. Execution daemons mount this in the same place on
>> their system. I'm creating an local execution spool dir.
>>
>> If I do an interactive install, everything happens properly.
>>
>> Any advice?
>>
>> Thanks!
>>
>>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net
>
>



More information about the gridengine-users mailing list