Opened 17 years ago

Last modified 9 years ago

#65 new enhancement

IZ361: Need better documenation in manual and man page for checkpointing

Reported by: andy Owned by:
Priority: normal Milestone:
Component: sge Version: 5.3
Severity: Keywords: doc
Cc:

Description

[Imported from gridengine issuezilla http://gridengine.sunsource.net/issues/show_bug.cgi?id=361]

        Issue #:      361              Platform:     All           Reporter: andy (andy)
       Component:     gridengine          OS:        All
     Subcomponent:    doc              Version:      5.3              CC:
                                                                             [_] reuti
                                                                             [_] Remove selected CCs
        Status:       NEW              Priority:     P3
      Resolution:                     Issue type:    ENHANCEMENT
                                   Target milestone: ---
      Assigned to:    markob (markob)
      QA Contact:     janicecritchlow
          URL:
       * Summary:     Need better documenation in manual and man page for checkpointing
   Status whiteboard:
      Attachments:

     Issue 361 blocks:
   Votes for issue 361:


   Opened: Wed Aug 21 05:31:00 -0700 2002 
------------------------


Need better documentation in manual and man page
for checkpointing.

Both describe the syntax and semantics of the
various checkpointing environments insufficiently
to be able to setup such an environment oneself an
an easy way.

   ------- Additional comments from reuti Sun Sep 5 11:04:58 -0700 2004 -------
-) There should be some phase diagram in the Administration Guide, like the
first pages of the Howto for Berkeley Checkpoting to explain how the
checkpointing procesdures are invoked (small mistake there: in the
userdefined checkpointing interface the signal is only sent for
min_cpu_interval, but not when it get migrated, i.e. requeued) .

-) The variables, which are available in the definition of the clean / restart /
migrate / checkpoint procedure of a checkpointing interface are nowhere
explained (currently you can only have a look at the example checkpoint
interfaces in the $SGE_ROOT/ckpt directory).

-) It should be pointed out, that a `qmod -s[j,q]` will migrate the job (by hand)
instead of suspending it (also in `man qmod` I think it would be useful).

   ------- Additional comments from andreas Thu Oct 21 07:04:08 -0700 2004 -------
Assign Admin/User/Install guide related issues to Mark O'Brien.

Change History (0)

Note: See TracTickets for help on using tickets.