[GE users] qmaster not starting

sangamesh forum.san at gmail.com
Wed Aug 12 06:59:07 BST 2009


    [ The following text is in the "iso-8859-1" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some special characters may be displayed incorrectly. ]

Hello all,

       Is there a solution for the below mentioned error, instead of going for re-installation of Grid Engine.

On Mon, Aug 10, 2009 at 12:57 AM, Sangamesh B <forum.san at gmail.com<mailto:forum.san at gmail.com>> wrote:
Dear sge users,

   We're using grid engine 6.2u2 on Rocks 5.1 from a long time.
Now suddenly its not starting to work. Following are the errors:

# echo $SGE_DEBUG_LEVEL
2 0 0 0 0 0 0 0

# /etc/init.d/sgemaster start
   starting sge_qmaster
     0   7544 46991528010816     ****** starting localization procedure ... **********
     1   7544 46991528010816     could not get environment variable "GRIDPACKAGE"
     2   7544 46991528010816     could not get environment variable "GRIDLOCALEDIR"
     3   7544 46991528010816     setlocale() returns "en_US.iso885915"
     4   7544 46991528010816     cutting of language string after "_":
     5   7544 46991528010816     locale directory: >/opt/n1ge62/locale<
     6   7544 46991528010816     package file:     >lx24-amd64/gridengine.mo<
     7   7544 46991528010816     language (LANG):  >en<
     8   7544 46991528010816     loading message file: /opt/n1ge62/locale/en/LC_MESSAGES/lx24-amd64/gridengine.mo
     9   7544 46991528010816     could not open message file - error
    10   7544 46991528010816     setlocale() returns "en_US.iso885915"
    11   7544 46991528010816     bindtextdomain() returns "/opt/n1ge62/locale"
    12   7544 46991528010816     textdomain() returns "lx24-amd64/gridengine"
    13   7544 46991528010816     error id output     : disabled
    14   7544 46991528010816     ****** starting localization procedure ... failed  **

sge_qmaster didn't start!
Please check the messages file

#

# echo $SGE_QMASTER_PORT
538
# echo $SGE_EXECD_PORT
539
# echo $SGE_CELL
default62
# cat /tmp/sge_messages

08/07/2009 12:39:11|  main|master|C|abort qmaster startup due to communication errors
08/07/2009 12:46:35|  main|master|C|abort qmaster startup due to communication errors
08/07/2009 12:55:42|  main|master|C|abort qmaster startup due to communication errors
08/07/2009 16:50:56|  main|master|C|abort qmaster startup due to communication errors
08/10/2009 09:31:37|  main|master|C|abort qmaster startup due to communication errors
08/10/2009 09:45:43|  main|master|C|abort qmaster startup due to communication errors
08/10/2009 09:47:33|  main|master|C|abort qmaster startup due to communication errors
08/10/2009 09:50:02|  main|master|C|abort qmaster startup due to communication errors

Let us know how to resolve this issue

Thank you




More information about the gridengine-users mailing list