[GE users] Application-Level Checkpointing

Reuti reuti at staff.uni-marburg.de
Mon Dec 17 22:01:20 GMT 2007


Hi,

Am 17.12.2007 um 19:21 schrieb Dev:

> Is it that the running application should completely get killed  
> before SGE decides to restart the job ?

yes, but you have to kill it on your own in the migrate script.  
Otherwise you might end up with the same job running twice.

-- Reuti

>
>
> Dev <dev_hyd2001 at yahoo.com> wrote:
> Hi,
>
>        Using Application Level Checkpointing and providing a  
> migrate script to it, whats the criteria for the job to be  
> restarted by SGE, for example once it has been unsuspended ?  My  
> test job doing a sleep gets restarted by SGE but some other jobs  
> don't seem to get restarted .( This is with SGE 6.0u6 though )
>
> cheers
>
> /Dev
>
> Be a better friend, newshound, and know-it-all with Yahoo! Mobile.  
> Try it now.
>
>
> Looking for last minute shopping deals? Find them fast with Yahoo!  
> Search.




More information about the gridengine-users mailing list