[GE users] implementation opinion / suggestions

jching jching at bbn.com
Wed Jan 27 19:51:34 GMT 2010


We are currently in the process of planning for our next sge implementation and wanted to get the community's opinion on local bdb -vs- rpc bdb.  The setup will be ~2000 cores (500 nodes) with a combination of short and long jobs that will run in the queue <insert approximate # of jobs here>.

After reviewing some of the valuable performance data provided by Mark Dixon in a previous post, it looks like there is a significant performance gain when running local bdb -vs- rpc/bdb but the rpc/bdb option gives us an additional failover option with the shadow master.  We would love to hear any opinions and/or experience people have... we also had a few questions for the large cluster (200+ nodes) community:

1. What is your implementation? (Local or Remote BDB w/ Shadow? Type of physical hardware?  Network?)
2. How many nodes?
3. Types of jobs? (short or long period of runtime)
4. Any performance issues?
5. Do you run DRBD?

Thanks in advance for any valuable feedback!


To unsubscribe from this discussion, e-mail: [users-unsubscribe at gridengine.sunsource.net].

More information about the gridengine-users mailing list