[Moabusers] Torque/Moab node allocation issue

VS Ang vs_ang at yahoo.com
Sun Dec 2 14:39:13 MST 2007


Hello,

I am looking for some help with Torque/Moab configuration. We are using Torque and Moab on a cluster with nodes having 4 cores each. We are trying to schedule jobs such that the nodes run multiple jobs in shared mode. However, the scheduler is not allocating nodes based on the processors. We specified the following policies in the moab.cfg file:

NODEACCESSPOLICY            SHARED
JOBNODEMATCHPOLICY        EXACTNODE

The Torque queue is configured as below:

create queue batch
set queue batch queue_type = Execution
set queue batch resources_max.nodect = 132
set queue batch resources_max.nodes = 132:ppn=4
set queue batch resources_default.nodect = 1
set queue batch resources_default.nodes = 1:ppn=1
set queue batch resources_default.walltime = 01:00:00
set queue batch enabled = True
set queue batch started = True

Also I configured:
set server node_pack = True

The "server_priv/nodes" file has node entries as below:
blade1 np=4
blade2 np=4
blade3 np=4
blade4 np=4
blade5 np=4
..

However, no matter how the job is submitted, users always get 4 different nodes, as opposed to a single node with 4 CPUs.

>msub -I -l nodes=4
>cat $PBS_NODEFILE
blade1
blade2
blade3
blade4

>msub -l nodes=4,taskdistpolicy=pack

Also, another odd behavior is, if I specify:
>msub -l ncpus=4

I am actually getting all nodes in the cluster, as opposed to 4.

So, what is wrong here? Am I missing any other policy parameters in Torque or Moab?

Thanks in advance!

Srihari

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/moabusers/attachments/20071202/4b6eb8b7/attachment.html


More information about the moabusers mailing list