[Moabusers] Standing Reservation for GRES devices
Gareth.Williams at csiro.au
Gareth.Williams at csiro.au
Mon Jan 26 18:38:36 MST 2009
> -----Original Message-----
> From: moabusers-bounces at supercluster.org [mailto:moabusers-
> bounces at supercluster.org] On Behalf Of Derek Gottlieb
> Sent: Friday, 23 January 2009 6:45 AM
> To: moabusers at supercluster.org
> Subject: [Moabusers] Standing Reservation for GRES devices
>
> I've got a few GPUs in one of our clusters and I'm trying to figure out
> how to set up a reservation that would give GPU jobs priority access to
> those GPUs and associated CPUs. I currently have 2 GPUs per node
> attached to nodes with 8 CPU cores and 64GB of memory. Is there a way
> in Moab to specify that I'd like to reserve 2 CPUs and 16GB of memory on
> each node that has GPUs and keep these reserved for jobs that explicitly
> request GPUs?
>
> Here's what I've been starting with:
>
> NODETOJOBATTRMAP gpu
> SRCFG[gpusr] HOSTLIST=^gpuhost1$,^gpuhost2$
> SRCFG[gpusr] PERIOD=INFINITY
> SRCFG[gpusr] JOBATTRLIST=gpu
> #SRCFG[gpusr] TASKCOUNT=2 RESOURCES=PROCS:1,MEM:8192
> #SRCFG[gpusr] TASKCOUNT=4 RESOURCES=PROCS:1,MEM:8192
>
> NODECFG[gpuhost1] FEATURES+=gpu GRES=tesla:2
> NODECFG[gpuhost2] FEATURES+=gpu GRES=tesla:2
>
>
> Problem is I'm not real sure how to reserve 2 CPUs on each of these
> nodes without setting up multiple SRs. I would think if I put in the
> TASKCOUNT=4 line that I'd end up with 4 CPUs on one node and not reserve
> anything on the other. I'd like to only reserve 1 CPU/1 GPU as opposed
> to reserving all CPUS on my GPU nodes for GPU jobs and leave them
> largely idle.
>
> Any input would be appreciated,
> Derek
Hi Derek,
Our group may have a similar requirement within the next year.
It would be nice if one could add a GRES:tesla requirement to the resources
Perhaps something like:
SRCFG[gpusr] TASKCOUNT=4 RESOURCES=PROCS:1,MEM:8192,GRES:tesla:1
However the documentation does not indicate that such a thing is possible. You could make a feature request to cluster resources.
I guess you could set up SRs for each node separately:
SRCFG[gpusr1] HOSTLIST=^gpuhost1$
SRCFG[gpusr1] JOBATTRLIST=gpu
SRCFG[gpusr1] TASKCOUNT=2 RESOURCES=PROCS:1,MEM:8192
SRCFG[gpusr2] HOSTLIST=^gpuhost2$
SRCFG[gpusr2] JOBATTRLIST=gpu
SRCFG[gpusr2] TASKCOUNT=2 RESOURCES=PROCS:1,MEM:8192
.
.
.
Ugly but perhaps workable!
Cheers,
Gareth Williams
Outreach Manager
CSIRO IM&T - ASC
Ph +61 3 8601 3804 (CSIRO)
Ph +61 3 9669 8114 (BoM)
http://intra.hpsc.csiro.au
calendar: http://tinyurl.com/3mkl5d
> _______________________________________________
> moabusers mailing list
> moabusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/moabusers
More information about the moabusers
mailing list