I’m happy to announce that the cluster is up and running. In these two
days we installed the new switch which should fix most of the problems
we were encountering in the past. The new switch also allows us to
have all nodes up and running.
New things:
– Most of the new c3 nodes are up and running. This means 256 new
slots for the users. Some of them are currently disabled as we have a
small problem with the hardware. You can explicitly submit on the c3
nodes using “#$ -l opteron6128″.
– Since we have more nodes we have introduced a new type of queue, the
high memory queue. The hi_mem queue is running on the c0 nodes and
allows you to run programs that use more than 450MB of memory. The
memory limit for this queue is 960MB(soft)/980MB(hard). There are 32
slots available for the hi_mem queue. You can submit jobs on this
queue using “#$ -l hi_mem” in your scripts. The short and long queues
on the c0s are now disabled. Note that if you don’t specify any queue
(that is, you don’t put any -l in your scripts) the jobs will run in
the first available slot.