[GE users] Message "can't register at "qmaster"

Vertley Hopson v.v.hopson at larc.nasa.gov
Fri Aug 25 19:13:31 BST 2006


Could you please assist in understanding why we are getting the "can't 
register at "qmaster": unable to send message to qmaster using port 1808 on 
host "tabitha.larc.nasa.gov" messages that is taking preventing SGE from 
accepting anymore jobs. We have to kill the SGE processes and restart it to 
get going again. The problem seems to randomly occur.

Thanks

>Subject: sge_sgi_tabitha
>
>
>05/18/2006 06:03:26|execd|tabitha|I|starting up 6.0u7 (csp)
>05/20/2006 09:27:28|execd|tabitha|I|starting up 6.0u7 (csp)
>05/23/2006 08:51:12|execd|tabitha|I|starting up 6.0u7 (csp)
>05/23/2006 15:36:15|execd|tabitha|E|commlib error: ssl accept handshake 
>timeout (ssl accept timeout for client "princess.larc.nasa.gov")
>05/23/2006 15:53:00|execd|tabitha|E|commlib error: ssl accept error (ssl 
>accept error for client "princess.larc.nasa.gov")
>05/23/2006 15:53:00|execd|tabitha|E|commlib error: ssl error 
>([ID=336027900] in module "SSL routines": "unknown protocol")
>05/23/2006 15:53:42|execd|tabitha|E|commlib error: ssl accept handshake 
>timeout (ssl accept timeout for client "princess.larc.nasa.gov")
>05/26/2006 10:46:32|execd|tabitha|I|starting up 6.0u7 (csp)
>06/22/2006 13:48:05|execd|tabitha|I|controlled shutdown 6.0u7 (csp)
>06/22/2006 13:48:38|execd|tabitha|I|starting up 6.0u7 (csp)
>06/27/2006 11:13:17|execd|tabitha|E|commlib error: ssl accept handshake 
>timeout (ssl accept timeout for client "bullsi.larc.nasa.gov")
>06/27/2006 11:40:10|execd|tabitha|E|commlib error: ssl accept error (ssl 
>accept error for client "bullsi.larc.nasa.gov")
>06/27/2006 11:40:10|execd|tabitha|E|commlib error: ssl error 
>([ID=336027900] in module "SSL routines": "unknown protocol")
>06/27/2006 11:40:50|execd|tabitha|E|commlib error: ssl accept handshake 
>timeout (ssl accept timeout for client "bullsi.larc.nasa.gov")
>06/29/2006 11:10:52|execd|tabitha|W|can't register at "qmaster": unable to 
>send message to qmaster using port 1808 on host "tabitha.larc.nasa.gov": 
>got message ackno
>06/30/2006 13:22:34|execd|tabitha|E|commlib error: got read timeout 
>(closing "tabitha.larc.nasa.gov/qmaster/1")
>06/30/2006 13:22:35|execd|tabitha|E|commlib error: can't connect to 
>service (Connection refused)
>06/30/2006 13:22:35|execd|tabitha|I|controlled shutdown 6.0u7 (csp)
>06/30/2006 13:23:35|execd|tabitha|I|starting up 6.0u7 (csp)
>07/10/2006 20:23:49|execd|tabitha|W|can't register at "qmaster": unable to 
>send message to qmaster using port 1808 on host "tabitha.larc.nasa.gov": 
>got message ackno
>07/11/2006 09:16:14|execd|tabitha|I|starting up 6.0u7 (csp)
>07/12/2006 15:13:22|execd|tabitha|I|starting up 6.0u7 (csp)
>07/18/2006 09:24:31|execd|tabitha|I|starting up 6.0u7 (csp)
>08/01/2006 02:13:29|execd|tabitha|W|can't register at "qmaster": unable to 
>send message to qmaster using port 1808 on host "tabitha.larc.nasa.gov": 
>got message ackno
>08/01/2006 08:12:08|execd|tabitha|I|starting up 6.0u7 (csp)
>08/17/2006 15:09:22|execd|tabitha|W|can't register at "qmaster": unable to 
>send message to qmaster using port 1808 on host "tabitha.larc.nasa.gov": 
>got message ackno
>08/18/2006 03:50:48|execd|tabitha|I|starting up 6.0u7 (csp)
>08/24/2006 11:20:58|execd|tabitha|W|can't register at "qmaster": unable to 
>send message to qmaster using port 1808 on host "tabitha.larc.nasa.gov": 
>got message ackno
>08/24/2006 11:56:43|execd|tabitha|I|starting up 6.0u7 (csp)
>08/24/2006 11:59:36|execd|tabitha|I|controlled shutdown 6.0u7 (csp)
>08/24/2006 12:01:12|execd|tabitha|I|starting up 6.0u7 (csp)


________________________________________________________________________

Vertley V. Hopson                       | Phone: (757) 864-7446
SAIC/Atmospheric Sciences Data Center   | Fax:   (757) 864-8807
NASA/LaRC - Mail Stop 157D              | E-mail: v.v.hopson at larc.nasa.gov
2 S. Wright St., Bld 1268C, Rm 2307             |
Hampton, VA 23681-2199                  | URL: http://eosweb.larc.nasa.gov
_________________________________________________________________________


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe at gridengine.sunsource.net
For additional commands, e-mail: users-help at gridengine.sunsource.net




More information about the gridengine-users mailing list