[GE users] Help: Checkpoint Problem

Lee Amy openlinuxsource at gmail.com
Thu Oct 9 13:21:54 BST 2008


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

2008/10/9 Reuti <reuti at staff.uni-marburg.de>

> Hi,
>
> Am 09.10.2008 um 07:36 schrieb Lee Amy:
>
>  I run parallel bioinformatics software at a cluster. And MPI
>> implementation is Open MPI 1.2.7. I know that this bioinformatis software
>> dosen't have built-in checkpoint function. So my problem is can I use SGE to
>> achieve that? However I have read the great howtos written by Reuti at
>> http://gridengine.sunsource.net/howto/checkpointing.html
>>
>
> first goal in your case should be, to get checkpointing working without
> SGE. For now it's not in the stable version of Open MPI:
>
> http://www.open-mpi.org/faq/?category=ft
>
> You might try the developer version or LAM/MPI in combination with BCLR (
> http://ftg.lbl.gov/CheckpointRestart/CheckpointRestart.shtml). If you have
> this working, the checkpoint creation and migration can be triggered by SGE.
>
> SGE will support checkpointing, but doesn't provide it to the application
> on its own.
>
> -- Reuti
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail: users-help at gridengine.sunsource.net
>
> Reuti,

Thank you very much, that's quite clear.

Amy



More information about the gridengine-users mailing list