[GE users] Mac - can't resolve group error ?

Barry McInnes Barry.J.Mcinnes at noaa.gov
Tue Nov 6 16:47:19 GMT 2007


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

I updated from 6.1 to 6.1u2, this stops the server startup from logging
the group error - now I get no errors, but still no startup.
When I do
sh -v /opt/n1ge6/default/common/sgemaster start
I get the following... the sge_qmaster port is correct at 540

startup stuff deleted

CheckIfPrimaryQmasterHost $HOST
cat $fname
   starting sge_qmaster
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
$utilbin_dir/getservbyname -number sge_qmaster
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1
cat $SGE_ROOT/$SGE_CELL/common/act_qmaster
expr $loop + 1

sge_qmaster didn't start!
Please check the messages file

   starting sge_schedd
error: commlib error: can't connect to service (Connection refused)
error: getting configuration: unable to contact qmaster using port 540
on host "g5s1.cdc.noaa.gov"
error: can't get configuration from qmaster -- backgrounding
cat $pidfile
   starting sge_shadowd
[g5s1:/opt/n1ge6] admin%




On 11/6/07 8:10 AM, Barry McInnes wrote:
> I thought this was just a 10.5 client error, but when I updated 10.4.10
> server which is the qmaster, I get the same error in messages.
> qmaster|host|C|can't resolve group
> sge_schedd and sge_shadowd startup, but then any sge command
> gives unable  to contact qmaster using port 540 on host
> I have tried using a different port number to no avail.
> 
> The group error has never occured before 10.5 or 10.4.10 update ?
> 
> I have run disk and permission checks. When I revert to server 10.4.8
> SGE starts fine, but I would like to run 10.5 if possible,
> 
> thanks for any help
> 

-- 
---
Barry McInnes
325 Broadway
Boulder CO 80304
(303)4976231
barry.j.mcinnes at noaa.gov
---

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list