[GE users] "usage" file?

Rayson Ho rayrayson at gmail.com
Fri May 2 21:42:53 BST 2008


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

The usage file (as well as the exit_status file) is designed for
shepherd to report information back to execd.

http://gridengine.sunsource.net/source/browse/*checkout*/gridengine/source/daemons/shepherd/shepherd.html

If your host spool directory is on an NFS partition, you may get this
kind of error when the NFS server is overloaded.

Rayson



On 5/2/08, John Marshall <John.Marshall at ec.gc.ca> wrote:
> Hi,
>
> What is the "usage" file, and why might it be causing a problem,
> namely, causing a queue error?
>
> This is under 6.1u3.
>
> I have attached a snippet from an email telling of the failure:
> -----
> failed before job:05/02/2008 19:57:22 [887:1336895831]: can't open file
> usage: Interrupted function call
> Shepherd trace:
> 05/02/2008 19:54:04 [887:1336895831]: shepherd called with uid = 0, euid =
> 887
> 05/02/2008 19:54:04 [887:1336895831]: starting up 6.1u3
> ...
> 5/02/2008 19:57:21 [887:1336895831]: wait3 returned -1
> 05/02/2008 19:57:21 [887:1336895831]: mapped signal TSTP to signal KILL
> 05/02/2008 19:57:21 [887:1336895831]: queued signal KILL
> 05/02/2008 19:57:21 [887:1336895831]: kill(-1300599732, KILL)
> 05/02/2008 19:57:21 [887:1336895831]: now sending signal KILL to pid
> -1300599732
> 05/02/2008 19:57:22 [887:1336895831]: wait3 returned 1300599732 (status: 9;
> WIFSIGNALED: 1,  WIFEXITED: 0, WEXITSTATUS: 0)
> 05/02/2008 19:57:22 [887:1336895831]: job exited with exit status 0
> 05/02/2008 19:57:22 [887:1336895831]: reaped "job" with pid 1300599732
> 05/02/2008 19:57:22 [887:1336895831]: job exited due to signal
> 05/02/2008 19:57:22 [887:1336895831]: job signaled: 9
> 05/02/2008 19:57:22 [887:1336895831]: ignored signal KILL to pid -1300599732
> 05/02/2008 19:57:22 [887:1336895831]: writing usage file to "usage"
> 05/02/2008 19:57:22 [887:1336895831]: can't open file usage: Interrupted
> function call
>
> Shepherd error:
> 05/02/2008 19:57:22 [887:1336895831]: can't open file usage: Interrupted
> function call
> -----
>
> Thanks,
> John
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> users-unsubscribe at gridengine.sunsource.net
> For additional commands, e-mail:
> users-help at gridengine.sunsource.net
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list