[GE users] Documentation about SGE

Rayson Ho raysonho at eseenet.com
Fri Jul 23 18:05:09 BST 2004


    [ The following text is in the "iso-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

>o LSF *by far* has fault tolerance and resiliancy features that 
>  blow away all the competition. The SGE shadow master does not 
>  come close to LSF's ability to keep "electing" a new master node 
>  as systems fail or drop offline one by one.

Mostly agree with your other points, but the point above is not to fair to
SGE: LSF can do that because the "sbatchd" is a fat daemon with the
functionality of SGE's execd (the job management part, not the load
reporting part) and shadowd.

SGE can also keep on electing new master node also, but it is not done this
way likely because the SGE designers think that the cluster admin should
configure it when it is needed rather than setting every batch node to be
the fail-over master by default.

Rayson

---------------------------------------------------------
Get your FREE E-mail account at http://www.eseenet.com !

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list