Opened 10 years ago

Last modified 9 years ago

#912 new defect

IZ616: External script startup errors are not reported

Reported by: rhierlmeier Owned by:
Priority: normal Milestone:
Component: hedeby Version: current
Severity: Keywords: Sun util
Cc:

Description

[Imported from gridengine issuezilla http://gridengine.sunsource.net/issues/show_bug.cgi?id=616]

        Issue #:      616          Platform:     Sun         Reporter: rhierlmeier (rhierlmeier)
       Component:     hedeby          OS:        All
     Subcomponent:    util         Version:      current        CC:    None defined
        Status:       NEW          Priority:     P3
      Resolution:                 Issue type:    DEFECT
                               Target milestone: 1.0u5next
      Assigned to:    aja (aja)
      QA Contact:     rhierlmeier
          URL:
       * Summary:     External script startup errors are not reported
   Status whiteboard:
      Attachments:


     Issue 616 blocks:
   Votes for issue 616:     Vote for this issue


   Opened: Thu Jan 29 02:38:00 -0700 2009 
------------------------


   Description

   The SDM system executes an several places external scripts. If such a script
   could not be executed the real error message is not reported.

   I had a problem on a host which ran out of memory. I tried to start a sdm jvm on
   this host with

   % sdmadm -d suj
   jvm         host   result message
   --------------------------------------------------------
   cs_vm       master ERROR  Fork of jvm cs_vm failed: arch
                             script has been interrupted
   com.sun.grid.grm.GrmException: Fork of jvm cs_vm failed: arch script has been
   interrupted
           at
   com.sun.grid.grm.bootstrap.ParentStartupService.start(ParentStartupService.java:302)

           at
   com.sun.grid.grm.ui.component.StartJVMCommand.execute(StartJVMCommand.java:91)
           at
   com.sun.grid.grm.ui.impl.CommandServiceImpl.execute(CommandServiceImpl.java:102)
           at
   com.sun.grid.grm.cli.cmd.components.StartJVMCliCommand.execute(StartJVMCliCommand.java:88)

           at com.sun.grid.grm.cli.AbstractCli.run(AbstractCli.java:278)
           at com.sun.grid.grm.cli.SdmAdm.main(SdmAdm.java:160)
           at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
           at
   sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
           at
   sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)

           at java.lang.reflect.Method.invoke(Method.java:597)
           at
   com.sun.grid.grm.util.MainWrapper$SystemRunThread.run(MainWrapper.java:434)
   Caused by: java.io.IOException: arch script has been interrupted
           at com.sun.grid.grm.util.UnixPlatform.getArch(UnixPlatform.java:541)
           at
   com.sun.grid.grm.bootstrap.ParentStartupService.fork(ParentStartupService.java:339)
           at
   com.sun.grid.grm.bootstrap.ParentStartupService.start(ParentStartupService.java:285)

           ... 10 more

   The error message does not give the user any hint what's the real problem is.
   The error message “arch script has been interrupted” is definitely wrong.

   Evaluation

   This issue has only limited impact on the functionality system. However for
   support cases it is be vital to get correct error messages.


   Suggested Fix/Work Around

   We have no workaround for this issue, it must be fixed.

   Analysis

   The native method UnixPlatform_executeNativeScript does not report the error
   message if popen fails.
   We should allow that this method throws an IOException in this case. The message
   of the exception should be the popen error message from errno.


   How to test

   remove the executable permission from the arch script (<dist>/util/arch)
   start a sdm jvm
   You will get the above error message


   ATC: 0.5 PD
   ETC: 2 PD
               ------- Additional comments from aja Tue Oct 20 08:51:09 -0700 2009 -------
   accepting the issue
               ------- Additional comments from rhierlmeier Wed Nov 25 07:21:11 -0700 2009 -------
   Milestone changed
               ------- Additional comments from rhierlmeier Wed Nov 25 07:24:05 -0700 2009 -------
   Reassigned to aja

Change History (0)

Note: See TracTickets for help on using tickets.