[GE issues] [Issue 2900] qmaster fail-over results in very slow execd reconnect

crei crei at sun.com
Tue Feb 3 16:07:39 GMT 2009


User crei changed the following:

                What    |Old value                 |New value
             Assigned to|pollinger                 |crei
                Priority|P2                        |P4
              QA contact|pollinger                 |crei
            Subcomponent|execution                 |communication

------- Additional comments from crei at sunsource.net Tue Feb  3 08:07:37 -0800 2009 -------
I assume problem with connected clients. They don't seem to get a notation that
the connection is not available anymore. 

After 600 seconds the connection alive timeout might shutdown the connection.
Which sent will result in re-reading act-qmaster file an re-connect to the
qmaster host.

Setting Priority to P4 since the execd will finally re-connect, but take some time. 

It might be related to the fact that the interface is only shutdown with ifconfig.


To unsubscribe from this discussion, e-mail: [issues-unsubscribe at gridengine.sunsource.net].

More information about the gridengine-users mailing list