[Moabusers] Re: Default Nodesets being ignored.

wightman wightman at clusterresources.com
Mon Aug 28 15:31:03 MDT 2006


Could you file a bug report at support.clusterresources.com?

That will help us track this bug to resolution.

Thanks,

- Douglas

On Mon, 2006-08-28 at 16:27 -0400, Justin Bronder wrote:
> We are attempting to use defaut nodesets on a per queue basis to get
> certain
> queues to favor certain machines.  The following output should clarify
> the issue.
> This method used to work in patch 4.  We are currently using patch 7.
> 
> 
> jbronder at panopticon:~/src$ qsub -I -l nodes=1:ppn=1 -l walltime=25:00
> -q darwin-admin -A systemTest -l nodeset=oneof:feature:31  
> \qsub: waiting for job 1062.echelon.acrl.clusters.umaine.edu to start
> qsub: job 1062.echelon.acrl.clusters.umaine.edu ready
> jbronder at node128 ~ $  
> 
> So I'm on node128, however:
> 
> root at echelon:/usr/src/moab-4.5.0p7/include# checknode node128
> node node128
> 
> State:   Running  (in current state for 00:00:26)
> Expected State:     Idle   SyncDeadline: Mon Aug 28 16:33:34
> Configured Resources: PROCS: 2  MEM: 1973M  SWAP: 3979M  DISK: 1M
> Utilized   Resources: PROCS: 1
> Dedicated  Resources: PROCS: 2  MEM: 1973M  SWAP: 3979M  DISK: 1M
> Opsys:      linux     Arch:      ppc64 
> Speed:      1.00      CPULoad:   0.000
> Network Load: 0.01 kB/s
> Flags:      rmdetected
> Network:    DEFAULT
> Features:   pRACK1,RACK1,17
> Attributes: [Batch]
> Classes:    [default 2:2][linux-spool 2:2][darwin-spool
> 2:2][linux-batch 2:2][darwin-batch 2:2][linux-admin 2:2][darwin-admin
> 1:2][kearney 2:2]
> RM[base]:   TYPE=PBS
> NodeAccessPolicy: SINGLEJOB
> 
> Total Time: 5:21:17:12  Up: 3:19:29:44 (64.76%)  Active: 00:02:05
> (0.02%)
> 
> Reservations:
>   1062x1  Job:Running  -00:00:56 -> 00:24:04 (00:25:00)
> Jobs:        1062
> ALERT:  node has 2 procs dedicated but load is low (0.000)
> 
> As you can see, node128 does not have the 31 feature.  There are
> plenty of nodes
> (20 to be exact) that do have this feature and can be verified to be
> available using
> checknode.  Is there something else that needs to be done to force the
> nodesets to
> be used?
> 
> Just in case it matters, here is the checkjob output:
> 
> root at echelon:/usr/src/moab-4.5.0p7/include# checkjob 1062
> job 1062
> 
> AName: STDIN
> State: Running 
> Creds:  user:jbronder  group:clusteradmin  account:systemTest
> class:darwin-admin
> WallTime:   00:02:42 of 00:25:00
> SubmitTime: Mon Aug 28 16:23:34
>   (Time Queued  Total: 00:00:00  Eligible: -00:00:01)
> 
> StartTime: Mon Aug 28 16:23:34
> Total Requested Tasks: 1
> 
> Req[0]  TaskCount: 1  Partition: RACK1
> Memory >= 0  Disk >= 0  Swap >= 0
> Opsys:   ---  Arch: ---  Features: ---
> NodeSet=ONEOF:FEATURE:31
> 
> Allocated Nodes:
> [node128:1]
> 
> 
> StartCount:     1
> StartPriority:  2
> Reservation '1062' (-00:02:47 -> 00:22:13  Duration: 00:25:00)
> 
> 
> 
> Thanks,
> Justin.
> 



More information about the moabusers mailing list