[GE issues] [Issue 2900] qmaster fail-over results in very slow execd reconnect

crei crei at sun.com
Tue Feb 3 16:07:39 GMT 2009


http://gridengine.sunsource.net/issues/show_bug.cgi?id=2900



User crei changed the following:

                What    |Old value                 |New value
================================================================================
             Assigned to|pollinger                 |crei
--------------------------------------------------------------------------------
                Priority|P2                        |P4
--------------------------------------------------------------------------------
              QA contact|pollinger                 |crei
--------------------------------------------------------------------------------
            Subcomponent|execution                 |communication
--------------------------------------------------------------------------------




------- Additional comments from crei at sunsource.net Tue Feb  3 08:07:37 -0800 2009 -------
I assume problem with connected clients. They don't seem to get a notation that
the connection is not available anymore. 

After 600 seconds the connection alive timeout might shutdown the connection.
Which sent will result in re-reading act-qmaster file an re-connect to the
qmaster host.

Setting Priority to P4 since the execd will finally re-connect, but take some time. 

It might be related to the fact that the interface is only shutdown with ifconfig.

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=36&dsMessageId=101707

To unsubscribe from this discussion, e-mail: [issues-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list