[GE users] sge 6.2u4 issue on Windows execution host

kostikbel kostikbel at ukr.net
Fri Dec 18 10:51:05 GMT 2009


We have SGE 6.2u4 installation that includes a group of Windows XP
SP2/SFU 3.5 hosts. Quite often, queues get into the "E" state. sge_execd
log contains the following

12/14/2009 16:33:09|  main|avexec4|E|ERROR: unlinking "jobs/00/0002/9728.1": Device busy
12/14/2009 16:33:09|  main|avexec4|E|can not remove file job spool file: jobs/00/0002/9728.1
12/14/2009 16:33:09|  main|avexec4|E|can't remove directory "active_jobs/29728.1": opendir(active_jobs/29728.1) failed: No such file or directory

The admin gets notification by email (for another job id,
just for illustration):
failed assumedly before job:can't open jobs/00/0002/.2808.1 for writing of job: Device busy

Note the "Device busy" part. I did searched for "Device busy" for SFU,
found http://www.suacommunity.com/forum/tm.aspx?m=5580&mpage=3#16800
but this seems not to help.

Any idea what is going on there ? What information shall I gather to
diagnose the problem ?

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=234075

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list