[GE users] fatal error, run database recovery

Christian Bolliger christian.bolliger at id.unizh.ch
Tue Mar 15 10:02:43 GMT 2005


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

Sorry I didn't see this early enough.
Two things can be done in such a case:
- Trying to fix db with an apropriate program (python, perl etc). Takes 
a long time and you might run in to problems latter.
- Delete just the 'sge_job' file (after a db dump and copying the whole 
directory). You will lose all job information but not the  configuration.

Sorry that I couldn't help in that case.

Regards
Christian


Beadles, Jeff wrote:

> FYI, I ended up blasting everything, and reinstalling the grid master 
> & clients.  I was able to recover part of the database, but there were 
> several known things missing (like all.q), and who knows what else 
> unknown.
>  
> Regards, -Jeff
>
> ------------------------------------------------------------------------
> *From:* Beadles, Jeff [mailto:jeff_beadles at mentorg.com]
> *Sent:* Mon 3/14/2005 8:28 AM
> *To:* users at gridengine.sunsource.net
> *Subject:* [GE users] fatal error, run database recovery
>
> We had a disk fill on the grid master (SGE 6.0u1) over the weekend, 
> and are now seeing the following in the qmaster's messages file when 
> trying to startup the grid master:
>  
> 03/13/2005 21:04:49|qmaster|gmaster|E|couldn't open database 
> environment for server "local spooling", directory "/grid/spooldb": 
> (-30978) DB_RUNRECOVERY: Fatal error, run database recovery
> 03/13/2005 21:04:49|qmaster|gmaster|E|startup of rule "default rule" 
> in context "berkeleydb spooling" failed
> 03/13/2005 21:04:49|qmaster|gmaster|C|setup failed
> I've not seen a database recovery program, does such a program exist?
>  
> Any ideas, short of reinstalling everything on how to correct this?
>  
> FYI, this is version 6.0u1, with bdb spooling on the local (grid 
> master) host on a local disk.
>  
> Thanks in advance,
>   -Jeff
>  


-- 
=============================================================================
Christian Bolliger                 
IT Services                      | http://www.id.unizh.ch/
Central Systems / HPC   	 | http://www.matterhorn.unizh.ch/
University of  Zuerich           | E-Mail: christian.bolliger at id.unizh.ch
Winterthurerstr. 190             | Tel: +41 (0)1 63 56775
CH-8057 Zuerich; Switzerland     | Fax: +41 (0)1 63 54505
Mime/S CA:                https://www.ca.unizh.ch/client/




More information about the gridengine-users mailing list