[GE users] Jobs getting rescheduled

reuti reuti at staff.uni-marburg.de
Mon Aug 16 18:29:52 BST 2010

Am 16.08.2010 um 19:19 schrieb amfortas:

> Many thinks for responding.
>> jobs were submit with "-r y" and/or the queue has the flag "rerun TRUE" set?
> Yes, that is set for the queue, to catch the occasional job that may need to be rescheduled owing to a problem on a work-node.

Is the job rescheduling itself, or just when a node gets "unheard" for some time?

> But what is surprising is that every job in the entire queue is getting rescheduled at the same time: even those that seem to be running quite happily. Is this the intended behaviour when "rerun TRUE" or '-r y' are set?


>> Was there any entry in the messages file of the qmaster (while "loglevel log_info" is set)?
> Log level was already set to 'log_info', but there is nothing informative in the qmaster 'messages' file.
>> Someone issued `qmod -rj "*"` by accident?
> I don't think so, no.

Just as a note: if someone who has manager right does this, all jobs will be rescheduled.

Anything in the accounting record? Usually there is written one when a job gets rescheduled.

-- Reuti

> Regards
> [NG]
> ------------------------------------------------------
> http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=274775
> To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].


To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].

More information about the gridengine-users mailing list