[GE users] qrsh problem
deadline at basement-supercomputing.com
Thu Aug 26 19:37:36 BST 2010
OS: Scientific Linux 5.4
Hardware: Intel x86_64, GigE
GE version 6.2u5
Problem: I have a smallish cluster and I want to use the head node to run
jobs. When I run parallel jobs, the nodes
will try to use the head node, but they will time out.
Sequential jobs run fine on all nodes (because
the are launched from the head node)
I narrowed it down using qrsh on the worker
"norbert" is head node running sge_qmaster and sge_execd
worker nodes are "n0" an "n2"
More information about the gridengine-users