[GE users] Application-Level Checkpointing
reuti at staff.uni-marburg.de
Mon Dec 17 22:01:20 GMT 2007
Am 17.12.2007 um 19:21 schrieb Dev:
> Is it that the running application should completely get killed
> before SGE decides to restart the job ?
yes, but you have to kill it on your own in the migrate script.
Otherwise you might end up with the same job running twice.
> Dev <dev_hyd2001 at yahoo.com> wrote:
> Using Application Level Checkpointing and providing a
> migrate script to it, whats the criteria for the job to be
> restarted by SGE, for example once it has been unsuspended ? My
> test job doing a sleep gets restarted by SGE but some other jobs
> don't seem to get restarted .( This is with SGE 6.0u6 though )
> Be a better friend, newshound, and know-it-all with Yahoo! Mobile.
> Try it now.
> Looking for last minute shopping deals? Find them fast with Yahoo!
More information about the gridengine-users