[GE users] Problems booting sge

Ron Chen ron_chen_123 at yahoo.com
Wed Jul 21 14:36:19 BST 2004


You can check:

$SGE_ROOT/default/common/local_conf/ztv121[*]
$SGE_ROOT/default/spool/qmaster/complexes
and many more :(

>From you error message, looks like a lot of the files
and directories are corrupted -- I hope you did backup
your files before.

 -Ron

--- "Basabe Echezarraga, Jose Luis"
<joseluis.basabe at itp.es> wrote:
> Thanks for your fast answer. 
> Which one are the more probable configuration files?
> 
> 
> Jose Luis
> 
> -----Mensaje original-----
> De: Ron Chen [mailto:ron_chen_123 at yahoo.com] 
> Enviado el: mi?rcoles, 21 de julio de 2004 14:48
> Para: users at gridengine.sunsource.net
> Asunto: Re: [GE users] Problems booting sge
> 
> Did the NFS server or the qmaster node crash? Looks
> like the configuation
> files got corrupted.
> 
> If you have them in backup, you can see if they are
> different than the
> current versions.
> 
>  -Ron
> 
> 
> --- "Basabe Echezarraga, Jose Luis"
> <joseluis.basabe at itp.es> wrote:
> > In the last times the SGE installation runs OK. 
> > (version 5.3)
> > But today when I restart the daemons, the qmaster
> doesn?t work.
> >  
> > bash-2.03# ps -ef | grep sge
> >     root  1470     1  0 13:01:52 ?        0:00
> > /aplic_nfs/sge53/bin/solaris64/sge_commd
> >     root  1473     1  0 13:01:58 ?        0:00
> > /aplic_nfs/sge53/bin/solaris64/sge_schedd
> > bash-2.03#
> > 
> > The message is the following one.
> >  
> > 
> > bash-2.03# /etc/rc2.d/S95rcsge stop
> >    Shutting down Grid Engine scheduler
> >    Shutting down Grid Engine qmaster
> > /aplic_nfs/sge53/default/spool/ztv121/active_jobs:
> > No such file or directory
> >    Shutting down Grid Engine communication daemon
> > error: CANNOT CONNECT
> > bash-2.03# /etc/rc2.d/S95rcsge start
> >    starting sge_qmaster
> > starting program:
> > /aplic_nfs/sge53/bin/solaris64/sge_commd
> > using service "sge_commd"
> > bound to port 536
> > found no local configuration for qmaster host
> "ztv121"
> > Reading in complexes:
> >         Complex "host".
> >         Complex "queue".
> > error: may be commd is locked:
> > error: can't resolve hostname "UNKNOW"
> > error: can't resolve hostname "UNKNOW"
> > critical error: setup failed
> >    starting sge_schedd
> > error: getting configuration: unable to contact
> qmaster via "" commd - 
> > qmaster not enrolled at commd
> > error: can't get configuration from qmaster --
> backgrounding 
> > bash-2.03#
> > 
> > 
> > Who can help me?
> >  
> > Thanks
> >  
> >  
> > 
> >  
> >  
> > 
> > 
> 
> 
> 
> 		
> __________________________________
> Do you Yahoo!?
> Yahoo! Mail Address AutoComplete - You start. We
> finish.
> http://promotions.yahoo.com/new_mail 
> 
>
---------------------------------------------------------------------
> To unsubscribe, e-mail:
> users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail:
> users-help at gridengine.sunsource.net
> 
>
---------------------------------------------------------------------
> To unsubscribe, e-mail:
> users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail:
> users-help at gridengine.sunsource.net
> 
> 



	
		
__________________________________
Do you Yahoo!?
Vote for the stars of Yahoo!'s next ad campaign!
http://advision.webevents.yahoo.com/yahoo/votelifeengine/

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list