[GE users] Seemingly random node crashes

rayson rayrayson at gmail.com
Fri Apr 23 21:31:15 BST 2010


On 4/23/10, biostat <adam at greenhodge.net> wrote:
> It might be...our head node is the NFS server. But we aren't using the head node as an execution node, so it's not being taxed CPU- or memory- wise. Nor have any of the directories we are writing to come even close to filling up (a df reveals all our drives to be filled to 10%).

Almost filling up the NFS directory is no big deal, *but* doing a lot
of I/Os can kill the NFS server.

Rayson



>
> I forgot to mention it SEEMS to only happen when we run jobs that are owned by root, and I found that our symptoms seem to correlate quite strongly with those found here: http://discussions.apple.com/message.jspa?messageID=6594071#6594071
>
> ------------------------------------------------------
> http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=254657
>
> To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].
>

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=254674

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list