Opened 13 years ago

Last modified 9 years ago

#342 new defect

IZ2031: shadow master fails to take over if scheduler dies

Reported by: jeffbeadles Owned by:
Priority: normal Milestone:
Component: sge Version: 6.0u7
Severity: Keywords: scheduling
Cc:

Description

[Imported from gridengine issuezilla http://gridengine.sunsource.net/issues/show_bug.cgi?id=2031]

        Issue #:      2031             Platform:     All      Reporter: jeffbeadles (jeffbeadles)
       Component:     gridengine          OS:        All
     Subcomponent:    scheduling       Version:      6.0u7       CC:    None defined
        Status:       NEW              Priority:     P3
      Resolution:                     Issue type:    DEFECT
                                   Target milestone: ---
      Assigned to:    sgrell (sgrell)
      QA Contact:     andreas
          URL:
       * Summary:     shadow master fails to take over if scheduler dies
   Status whiteboard:
      Attachments:

     Issue 2031 blocks:
   Votes for issue 2031:


   Opened: Wed Apr 12 13:09:00 -0700 2006 
------------------------


If the scheduler dies on the qmaster, the shadow master never takes over the
qmaster/scheduler responsibilities.

To duplicate, setup a small grid with two hosts, one being the qmaster, and a
second as a shadow master.  Then, kill -9 the scheduler pid, and monitor
$SGE_ROOT/$SGE_CELL/common/act_qmaster, and note that it never fails over to
the shadow host.

Change History (0)

Note: See TracTickets for help on using tickets.