[GE users] Failover tests

Rayson Ho raysonho at eseenet.com
Tue Mar 22 17:37:03 GMT 2005


    [ The following text is in the "iso-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

>safest to most extreme?
>
>eg. using the migrate command all the way upto  a kill -9 on the 
>sge_qmaster pid?

To play *safe*, you can stop the qmaster using the normal way, and then go
to $SGE_ROOT/default/spool/qmaster/ (CELL = default for most cases) and
delete the lock file.

When the shadow daemon finds that the original master is not updating the
heartbeat file, and that there is no lock file, it will start a new
qmaster/schedd pair.

Of course kill -9 works, but if qmaster is doing something to the spool
files, then it is possible that killing qmaster may corrupt the data.

Rayson

---------------------------------------------------------
Get your FREE E-mail account at http://www.eseenet.com !

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list