[Moabusers] Default Nodesets being ignored.

Justin Bronder jsbronder at gmail.com
Mon Aug 28 14:27:21 MDT 2006


We are attempting to use defaut nodesets on a per queue basis to get certain
queues to favor certain machines.  The following output should clarify the
issue.
This method used to work in patch 4.  We are currently using patch 7.


jbronder at panopticon:~/src$ qsub -I -l nodes=1:ppn=1 -l walltime=25:00 -q
darwin-admin -A systemTest -l nodeset=oneof:feature:31
\qsub: waiting for job 1062.echelon.acrl.clusters.umaine.edu to start
qsub: job 1062.echelon.acrl.clusters.umaine.edu ready
jbronder at node128 ~ $

So I'm on node128, however:

root at echelon:/usr/src/moab-4.5.0p7/include# checknode node128
node node128

State:   Running  (in current state for 00:00:26)
Expected State:     Idle   SyncDeadline: Mon Aug 28 16:33:34
Configured Resources: PROCS: 2  MEM: 1973M  SWAP: 3979M  DISK: 1M
Utilized   Resources: PROCS: 1
Dedicated  Resources: PROCS: 2  MEM: 1973M  SWAP: 3979M  DISK: 1M
Opsys:      linux     Arch:      ppc64
Speed:      1.00      CPULoad:   0.000
Network Load: 0.01 kB/s
Flags:      rmdetected
Network:    DEFAULT
Features:   pRACK1,RACK1,17
Attributes: [Batch]
Classes:    [default 2:2][linux-spool 2:2][darwin-spool 2:2][linux-batch
2:2][darwin-batch 2:2][linux-admin 2:2][darwin-admin 1:2][kearney 2:2]
RM[base]:   TYPE=PBS
NodeAccessPolicy: SINGLEJOB

Total Time: 5:21:17:12  Up: 3:19:29:44 (64.76%)  Active: 00:02:05 (0.02%)

Reservations:
  1062x1  Job:Running  -00:00:56 -> 00:24:04 (00:25:00)
Jobs:        1062
ALERT:  node has 2 procs dedicated but load is low (0.000)

As you can see, node128 does not have the 31 feature.  There are plenty of
nodes
(20 to be exact) that do have this feature and can be verified to be
available using
checknode.  Is there something else that needs to be done to force the
nodesets to
be used?

Just in case it matters, here is the checkjob output:

root at echelon:/usr/src/moab-4.5.0p7/include# checkjob 1062
job 1062

AName: STDIN
State: Running
Creds:  user:jbronder  group:clusteradmin  account:systemTest
class:darwin-admin
WallTime:   00:02:42 of 00:25:00
SubmitTime: Mon Aug 28 16:23:34
  (Time Queued  Total: 00:00:00  Eligible: -00:00:01)

StartTime: Mon Aug 28 16:23:34
Total Requested Tasks: 1

Req[0]  TaskCount: 1  Partition: RACK1
Memory >= 0  Disk >= 0  Swap >= 0
Opsys:   ---  Arch: ---  Features: ---
NodeSet=ONEOF:FEATURE:31

Allocated Nodes:
[node128:1]


StartCount:     1
StartPriority:  2
Reservation '1062' (-00:02:47 -> 00:22:13  Duration: 00:25:00)



Thanks,
Justin.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/moabusers/attachments/20060828/62d0a901/attachment.html


More information about the moabusers mailing list