Opened 13 years ago

Last modified 6 years ago

#168 new enhancement

IZ1017: qstat shows jobs running on an execd that has disappeared.

Reported by: omarhass Owned by:
Priority: high Milestone:
Component: sge Version: 5.3
Severity: Keywords: Sun Solaris clients
Cc:

Description

[Imported from gridengine issuezilla http://gridengine.sunsource.net/issues/show_bug.cgi?id=1017]

        Issue #:      1017             Platform:     Sun           Reporter: omarhass (omarhass)
       Component:     gridengine          OS:        Solaris
     Subcomponent:    clients          Version:      5.3              CC:    None defined
        Status:       NEW              Priority:     P2
      Resolution:                     Issue type:    ENHANCEMENT
                                   Target milestone: ---
      Assigned to:    andreas (andreas)
      QA Contact:     roland
          URL:
       * Summary:     qstat shows jobs running on an execd that has disappeared.
   Status whiteboard:
      Attachments:

     Issue 1017 blocks:
   Votes for issue 1017:


   Opened: Tue May 4 07:32:00 -0700 2004 
------------------------


qstat/qmon shows jobs running on an execution host
that has crashed or is no longer available. I
believe that qmaster should be able to tell if the
exec host is not available and changes the jobs to
an UNKNOWN state. We could have an env. var. that
will set a timeout for which sge_qmaster will wait
to hear from sge_execd before it takes the action.

   ------- Additional comments from sgrell Mon Dec 12 02:55:23 -0700 2005 -------
Changed the Subcomponent.

Stephan

Change History (0)

Note: See TracTickets for help on using tickets.