[Moabusers] Re: Default Nodesets being ignored.

wightman wightman at clusterresources.com
Wed Aug 30 14:04:52 MDT 2006


We have not been able to reproduce this locally.  When a job is
submitted with the nodesets flag it is properly scheduled.  There may be
something unique about your environment that we have not correctly
duplciated, however.

Can you send the output of the following commands, as well as your
moab.cfg file:

mdiag -n -v
mdiag -t

Thanks,

- Douglas

On Mon, 2006-08-28 at 16:27 -0400, Justin Bronder wrote:
> We are attempting to use defaut nodesets on a per queue basis to get
> certain
> queues to favor certain machines.  The following output should clarify
> the issue.
> This method used to work in patch 4.  We are currently using patch 7.
> 
> 
> jbronder at panopticon:~/src$ qsub -I -l nodes=1:ppn=1 -l walltime=25:00
> -q darwin-admin -A systemTest -l nodeset=oneof:feature:31  
> \qsub: waiting for job 1062.echelon.acrl.clusters.umaine.edu to start
> qsub: job 1062.echelon.acrl.clusters.umaine.edu ready
> jbronder at node128 ~ $  
> 
> So I'm on node128, however:
> 
> root at echelon:/usr/src/moab-4.5.0p7/include# checknode node128
> node node128
> 
> State:   Running  (in current state for 00:00:26)
> Expected State:     Idle   SyncDeadline: Mon Aug 28 16:33:34
> Configured Resources: PROCS: 2  MEM: 1973M  SWAP: 3979M  DISK: 1M
> Utilized   Resources: PROCS: 1
> Dedicated  Resources: PROCS: 2  MEM: 1973M  SWAP: 3979M  DISK: 1M
> Opsys:      linux     Arch:      ppc64 
> Speed:      1.00      CPULoad:   0.000
> Network Load: 0.01 kB/s
> Flags:      rmdetected
> Network:    DEFAULT
> Features:   pRACK1,RACK1,17
> Attributes: [Batch]
> Classes:    [default 2:2][linux-spool 2:2][darwin-spool
> 2:2][linux-batch 2:2][darwin-batch 2:2][linux-admin 2:2][darwin-admin
> 1:2][kearney 2:2]
> RM[base]:   TYPE=PBS
> NodeAccessPolicy: SINGLEJOB
> 
> Total Time: 5:21:17:12  Up: 3:19:29:44 (64.76%)  Active: 00:02:05
> (0.02%)
> 
> Reservations:
>   1062x1  Job:Running  -00:00:56 -> 00:24:04 (00:25:00)
> Jobs:        1062
> ALERT:  node has 2 procs dedicated but load is low (0.000)
> 
> As you can see, node128 does not have the 31 feature.  There are
> plenty of nodes
> (20 to be exact) that do have this feature and can be verified to be
> available using
> checknode.  Is there something else that needs to be done to force the
> nodesets to
> be used?
> 
> Just in case it matters, here is the checkjob output:
> 
> root at echelon:/usr/src/moab-4.5.0p7/include# checkjob 1062
> job 1062
> 
> AName: STDIN
> State: Running 
> Creds:  user:jbronder  group:clusteradmin  account:systemTest
> class:darwin-admin
> WallTime:   00:02:42 of 00:25:00
> SubmitTime: Mon Aug 28 16:23:34
>   (Time Queued  Total: 00:00:00  Eligible: -00:00:01)
> 
> StartTime: Mon Aug 28 16:23:34
> Total Requested Tasks: 1
> 
> Req[0]  TaskCount: 1  Partition: RACK1
> Memory >= 0  Disk >= 0  Swap >= 0
> Opsys:   ---  Arch: ---  Features: ---
> NodeSet=ONEOF:FEATURE:31
> 
> Allocated Nodes:
> [node128:1]
> 
> 
> StartCount:     1
> StartPriority:  2
> Reservation '1062' (-00:02:47 -> 00:22:13  Duration: 00:25:00)
> 
> 
> 
> Thanks,
> Justin.
> 



More information about the moabusers mailing list