[Hedeby users] Re: [GE users] SDM issues

cbyun cbyun at ll.mit.edu
Fri Jul 24 20:56:26 BST 2009


Hi Richard and Torsten,

I have removed the resources from GE and spare_pool services.
Now I see there is no alarm in the resource.  However, I am still having difficulties to register the hosts.

Is the dot (.) allowed in the hostname?
My blade machines has their hostname as "blade-<number>-<number>.local"

See more details below:

# sdmadm sr
service id              state    type flags usage annotation
------------------------------------------------------------
power   blade-0-0.local ASSIGNED host       2
        blade-0-1.local ASSIGNED host       2
        blade-0-2.local ASSIGNED host       2
        blade-0-3.local ASSIGNED host       2
        blade-0-4.local ASSIGNED host       2
        blade-0-5.local ASSIGNED host       2
        blade-0-6.local ASSIGNED host       2
        blade-0-7.local ASSIGNED host       2
        blade-0-8.local ASSIGNED host       2

# sdmadm sslo
service    slo                 quantity urgency request
--------------------------------------------------------------------------------------------------
gesvc2     fixed_usage         0        0       SLO has no needs
           maxPendingJobs      0        0       SLO has no needs
power      PermanentRequestSLO 10       2       type = "host" & owner = "power"
spare_pool PermanentRequestSLO 5        1       type = "host"


However, I am still getting the following error in the cs_vm-0.log (BTW, I am using simple installation option)

07/24/2009 13:37:51|680|vice.impl.cloud.CloudSnapshot.checkCloudState|W|Service power:Problem: VPN server is corrupted! Registered but server-less resources: [[hostname: blade-0-0.local, instanceId: i-blade-0-0, launchTime: 2009-07-21T09:56:03.000Z] , ...

One thing I noticed is that, although the instanceID is supposed to be i-<hostname>, what is shown from the log is that it cuts out the ".local". It says: instanceId: i-blade-0-0

Is this an issue?

I turned all hosts off. Also I stopped and restarted the power cloud service and got the following error:

07/24/2009 14:45:34|703|.cloud.CloudServiceAdapterImpl.doStartService|I|Service power:Started cloud service adapter.
07/24/2009 14:45:35|704|.grm.util.EventListenerSupport$Worker.deliver|E|Event delivery problem: Timer already cancelled.
07/24/2009 14:49:35|705|vice.impl.cloud.CloudSnapshot.checkCloudState|W|Service power:The registered set of cloud host does not match the reported set! Registered mismatches [[hostname: blade-0-9.local, instanceId: i-blade-0-9, launchTime: 2009-07-21T09:56:03.000Z] , [hostname: blade-0-6.local, instanceId: i-blade-0-6, launchTime: 2009-07-21T09:56:03.000Z] ]. Reported mismatches []
07/24/2009 14:49:35|705|vice.impl.cloud.CloudSnapshot.checkCloudState|W|Service power:Problem: VPN server is corrupted! Registered but server-less resources: [[hostname: blade-0-6.local, instanceId: i-blade-0-6, launchTime: 2009-07-21T09:56:03.000Z] , [hostname: blade-0-9.local, instanceId: i-blade-0-9, launchTime: 2009-07-21T09:56:03.000Z] ].
07/24/2009 14:49:35|705|e.impl.cloud.CloudResourceAutoRecoverTask.run|W|Service power:Case NOT_REPORTED__NOT_REGISTERED__RESOURCE: This should not happen! The resource NOT_REPORTED__NOT_REGISTERED__RESOURCE does not seem to be a cloud resource at all. It is unknown to the cloud and not registered by the cloud service adapter! Please check your configuration!

And the sdmadm sr  shows:

# sdmadm sr
service id              state type flags usage annotation                                                                                                  
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
power   blade-0-8.local ERROR host       2     Service power:Resource does not seem to be a cloud resource! It is unknown to the cloud and not registered by the cloud service adapter!


Thanks,
- Chansup

------------------------------------------------------
http://gridengine.sunsource.net/ds/viewMessage.do?dsForumId=38&dsMessageId=209387

To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].



More information about the gridengine-users mailing list