[GE users] Setup of windows execd against Linux qmaster

sneumann sneumann at ipb-halle.de
Thu Feb 8 16:36:09 GMT 2007


    [ The following text is in the "X-UNKNOWN" character set. ]
    [ Your display is set for the "ISO-8859-10" character set.  ]
    [ Some characters may be displayed incorrectly. ]

Hi,

we have a running n1ge6u9 installation for our linux environment.
I am now trying to add a Windows Execution host. During qmaster
installation I have set up the sgeCA and WIN_DOMAIN_ACCESS="false"

I am somewhat lost in the security settings, so I'll first 
give the error message and then post the configuration.

What is wrong, if the qmaster message log shows 
"02/08/2007 |qmaster|lathan|C|denied: request for user "MSBI" does not match credentials for connection <windows.ipb-sub.ipb-halle.de,execd,1>"

This is just the registration of the execd, no jobs have yet been submitted.

I am probably missing the linkage between the sgeCA certificate business,
and the security provieded by sgepasswd. We're in a closed environment,
so I dont want security to be a hurdle in the setup, I'd be willing 
to disable as much as possible. (The logfile below says security_mode >none<)

Or did I miss something even more obvious ?
What else can I debug ?

Yours,
Steffen

---------------------------------------------------------

Linux/qmaster:
	sge installed and running as user sge, directories read/write by sge.

	Has both a user msbi and MSBI (with same uid and home etc, added in desparation ;-)
	Has an default/common/sgepasswd file with an entry for MSBI
	User msbi and MSBI have a /home/msbi/.sge/port1072/default/private/
	
Windows: 
	No domain, Computername MSBIVMXP, workgroup MSBI, single User MSBI

SFU Mapping maps:
\\MSBIVMXP\Administrator to root,
\\MSBIVMXP\sge to sge 
\\MSBIVMXP\MSBI to msbi

sge_execd process is owned by MSBI 
	local spool directory sge_spool writable by MSBI

---------------------------------------------------------
		
Debugging (according to http://gridengine.sunsource.net/servlets/ReadMsg?list=users&msgNo=16386)
for the sge_execd started on Windows:

     0   1551 1     me.who                      >19<
     1   1551 1     me.sge_formal_prog_name     >execd<
     2   1551 1     me.qualified_hostname       >msbivmxp.ipb-sub.ipb-halle.de<
     3   1551 1     me.unqualified_hostname     >msbivmxp<
     4   1551 1     me.uid                      >197611<
     5   1551 1     me.gid                      >197121<
     6   1551 1     me.daemonized               >0<
     7   1551 1     me.user_name                >MSBI<
     8   1551 1     me.default_cell             >default<
     9   1551 1     sge_root            >/vol/sge<
    10   1551 1     cell_root           >/vol/sge/default<
    11   1551 1     conf_file           >/vol/sge/default/common/bootstrap<
    12   1551 1     bootstrap_file      >/vol/sge/default/common/configuration<
    13   1551 1     act_qmaster_file    >/vol/sge/default/common/act_qmaster<
    14   1551 1     acct_file           >/vol/sge/default/common/accounting<
    15   1551 1     reporting_file      >/vol/sge/default/common/reporting<
    16   1551 1     local_conf_dir      >/vol/sge/default/common/local_conf<
    17   1551 1     shadow_masters_file >/vol/sge/default/common/shadow_masters<
    18   1551 1     admin_user          >sge<
    19   1551 1     default_domain      >none<
    20   1551 1     ignore_fqdn         >true<
    21   1551 1     spooling_method     >berkeleydb<
    22   1551 1     spooling_lib        >libspoolb<
    23   1551 1     spooling_params     >/var/spool/sge/default/spooldb/<
    24   1551 1     binary_path         >/vol/sge/bin<
    25   1551 1     qmaster_spool_dir   >/vol/sge/default/spool/qmaster/<
    26   1551 1     security_mode        >none<
    27   1551 1     ../libs/gdi/sge_any_request.c 509 starting up communication without threads
    28   1551 1     me.qualified_hostname: msbivmxp.ipb-sub.ipb-halle.de
    29   1551 1     secure dummy string: AIMK_SECURE_OPTION_ENABLED
    30   1551 1     re-read actual qmaster file (prepare_enroll)
    31   1551 1     returning port value: 1071
    32   1551 1     qualified hostname: msbivmxp.ipb-sub.ipb-halle.de
    33   1551 1     get_configuration: unique for msbivmxp.ipb-sub.ipb-halle.de: msbivmxp.ipb-sub.ipb-halle.de
    34   1551 1     requesting global and msbivmxp.ipb-sub.ipb-halle.de
    35   1551 1     ../libs/gdi/sge_any_request.c 192 cl_endpoint_list_get_autoclose_mode() [cl_endpoint_list.c/303] initiator thread    => setting autoclose to: 3
    36   1551 1     ../libs/gdi/sge_security.c 135 >>>>>>>>>>>>>>>>>>>>
    37   1551 1     ../libs/gdi/sge_security.c 136 gdi_snd: sending message to lathan.ipb-sub.ipb-halle.de/qmaster/1:
    38   1551 1     ../libs/gdi/sge_security.c 137 gdi_snd: cl_xml_ack_type_t: ack
    39   1551 1     ../libs/gdi/sge_security.c 138 gdi_snd: message tag:       TAG_GDI_REQUEST
    40   1551 1     ../libs/gdi/sge_security.c 140 gdi_snd: message id:        1
    41   1551 1     ../libs/gdi/sge_security.c 144 gdi_snd: send time:         02/08/2007 08:17:07
    42   1551 1     ../libs/gdi/sge_security.c 145 >>>>>>>>>>>>>>>>>>>>
    43   1551 1     ../libs/gdi/sge_security.c 115 <<<<<<<<<<<<<<<<<<<<
    44   1551 1     ../libs/gdi/sge_security.c 116 gdi_rcv: reseived message from lathan.ipb-sub.ipb-halle.de/qmaster/1:
    45   1551 1     ../libs/gdi/sge_security.c 117 gdi_rcv: cl_xml_ack_type_t: nak
    46   1551 1     ../libs/gdi/sge_security.c 118 gdi_rcv: message tag:       TAG_GDI_REQUEST
    47   1551 1     ../libs/gdi/sge_security.c 119 gdi_rcv: message id:        2
    48   1551 1     ../libs/gdi/sge_security.c 120 gdi_rcv: receive time:      02/08/2007 08:17:07
    49   1551 1     ../libs/gdi/sge_security.c 121 <<<<<<<<<<<<<<<<<<<<
    50   1551 1     ../libs/gdi/sge_any_request.c 947 received from: lathan.ipb-sub.ipb-halle.de,1
    51   1551 1     ../libs/gdi/sge_gdi_request.c 1344 can't unpack gdi request
    52   1551 1     ../libs/gdi/sge_gdi_request.c 1209 error unpacking gdi request: bad argument
    53   1551 1     ../libs/gdi/sge_any_request.c 199 cl_commlib_get_endpoint_status() [cl_commlib.c/5975] initiator thread    => waiting for SIRM with id 2
    54   1551 1     ../libs/gdi/sge_any_request.c 199 cl_commlib_get_endpoint_status() [cl_commlib.c/6050] initiator thread    => no SRIM for SIM with id 2
    55   1551 1     ../libs/gdi/sge_any_request.c 199 cl_commlib_get_endpoint_status() [cl_commlib.c/6050] initiator thread    => no SRIM for SIM with id 2
    56   1551 1     ../libs/gdi/sge_any_request.c 199 cl_commlib_get_endpoint_status() [cl_commlib.c/6032] initiator thread    => got SIRM for SIM with id: 2
    57   1551 1     ../libs/gdi/sge_any_request.c 1055 qmaster is still running
    58   1551 1     ../libs/gdi/sge_any_request.c 1060 endpoint is up since 60 seconds and has status 0
    59   1551 1     ../libs/gdi/gdi_conf.c 170 getting configuration: failed receiving gdi request


-- 
IPB Halle                    AG Massenspektrometrie & Bioinformatik
Dr. Steffen Neumann          http://www.IPB-Halle.DE
Weinberg 3                   http://msbi.bic-gh.de
06120 Halle                  New phone number !
                             Tel. +49 (0) 345 5582 - 1470
                                  +49 (0) 345 5582 - 0
sneumann(at)IPB-Halle.DE     Fax. +49 (0) 345 5582 - 1409



    [ Part 2, "This is a digitally signed message part" ]
    [ Application/PGP-SIGNATURE (Name: "signature.asc") 198 bytes. ]
    [ Unable to print this part. ]



More information about the gridengine-users mailing list