<div>The processors allocated when request 'nodes=1:ppn=3' is right, both when use maui 3.3.1 and 3.2.6p21.</div><div> </div><div>I try to add "JOBNODEMATCHPOLICY EXACTNODE" in maui 3.3.1 config file, but the processor allocation for "procs" syntax is still one. I compare the config file of maui 3.3.1 and maui 3.2.6p21, and see nothing is different.</div>
<div> </div><div>Probably I should use maui 3.2.6p21 for the moment if i want to submit job with "procs" syntax.</div><div> </div><div>BTW, i'm concerning the strength and weakness of the usage of "procs", since I don't want to care about the hardware configuration and its current usage, maybe this laziness is at some cost of performance degradation.</div>
<div> </div><div>Thanks for you Gustavo!</div><div> </div><div>Xiangqian</div><div> </div><div> </div><div class="gmail_quote">2012/1/16 Gustavo Correa <span dir="ltr"><<a href="mailto:gus@ldeo.columbia.edu">gus@ldeo.columbia.edu</a>></span><br>
<blockquote style="margin:0px 0px 0px 0.8ex;padding-left:1ex;border-left-color:rgb(204,204,204);border-left-width:1px;border-left-style:solid" class="gmail_quote">PS - Hi Xiangqian.<br>
<br>
Maybe you need to add this line to your maui.cfg [and restart maui],<br>
for the 'proc=Z' syntax to work as you expect:<br>
<br>
JOBNODEMATCHPOLICY EXACTNODE<br>
<br>
I *think* the default is<br>
<br>
JOBNODEMATCHPOLICY EXACTPROC<br>
<br>
which expects your node to have the exact number of processors you requested [i.e. 3].<br>
<br>
See appendix F of the Maui Admininstrator Guide for details.<br>
<br>
I am not sure, but my recollection is that somebody reported a problem similar to yours<br>
in the list before, and the solution suggested was this one.<br>
<div class="im HOEnZb"><br>
I hope this helps,<br>
Gus Correa<br>
<br>
</div><div class="HOEnZb"><div class="h5">On Jan 16, 2012, at 10:21 AM, Gustavo Correa wrote:<br>
<br>
> Hi Xiangqian<br>
><br>
> For what it is worth, I use Maui 3.2.6p21, and I don't have the problem you described.<br>
> I don't know the behavior in Maui 3.3.1, but as you reported 3.2.6p1 also works correctly for you,<br>
> with the nodes-1:ppn=3 syntax.<br>
> I am happy with 3.2.6p21.<br>
><br>
> There is still a chance that a change in maui.cfg 3.3.1 may fix this glitch,<br>
> but I don't know what it would be. Most likely it has to do with the node allocation policies,<br>
> and how it translates 'procs' into nodes and ppn.<br>
> Somebody else more savvy in the list may clarify this point.<br>
><br>
> I confess I prefer the more detailed syntax 'nodes=X:ppn=Y',<br>
> because it specifies more detail about the resources you are requesting,<br>
> and apparently avoids the issue that hit you.<br>
><br>
> Have you tried the 'nodes=1:ppn=3' syntax in Maui 3.3.1?<br>
> I wonder if it would work there too.<br>
><br>
> I hope this helps,<br>
> Gus Correa<br>
><br>
><br>
> On Jan 16, 2012, at 1:43 AM, Xiangqian Wang wrote:<br>
><br>
>> thanks, Gustavo<br>
>><br>
>> sorry for the misspelling in the previous email, i recheck it and correct it as following:<br>
>><br>
>> i tested torque 2.5.8 and maui 3.3.1 on a centos 6.0 node, the job script is:<br>
>><br>
>> #!/bin/sh<br>
>> #PBS -N procsjob<br>
>> #PBS -l procs=3<br>
>> #PBS -q batch<br>
>> ping localhost -c 100<br>
>><br>
>> and qstat output "exec_host = snode02/0".<br>
>> i replace with the new job script, as<br>
>><br>
>> #!/bin/sh<br>
>> #PBS -N procsjob<br>
>> #PBS -l nodes=1:ppn=3<br>
>> #PBS -q batch<br>
>> ping localhost -c 100<br>
>> and qstat output "exec_host = snode02/2+snode02/1+snode02/0".<br>
>><br>
>> i change maui 3.3.1 to maui 3.2.6p21 and test again, qstat output "exec_host = snode02/2+snode02/1+snode02/0" for both script. maybe it's a maui 3.3.1 problem?<br>
>><br>
>><br>
>> 2012/1/14 Gustavo Correa <<a href="mailto:gus@ldeo.columbia.edu">gus@ldeo.columbia.edu</a>><br>
>> Hi Xiangqian<br>
>><br>
>> Is it a typo in your email or did you comment out this line in your Torque/PBS script?<br>
>> [Note the double hash ##.]<br>
>><br>
>>> ##PBS -l procs=3<br>
>><br>
>> Have you tried this form instead?<br>
>><br>
>> #PBS -l nodes=1:ppn=3<br>
>><br>
>> For more details check 'man qsub' and 'man pbs_resources'.<br>
>><br>
>> I hope it helps,<br>
>> Gus Correa<br>
>><br>
>> On Jan 13, 2012, at 4:10 AM, Xiangqian Wang wrote:<br>
>><br>
>>> my demo torque+maui cluster has one node with np=4 set fot it. i want to submit a job requesting 3 processors, but when it start to run, i see only one processor is used (qstat shows "exec_host = snode02/0").<br>
>>><br>
>>> i use torque 2.5.6 and maui 3.3.1. anyone can help me out, it'll be greatly appreciated<br>
>>><br>
>>> the submit script is something like:<br>
>>><br>
>>> #!/bin/sh<br>
>>> #PBS -N procsjob<br>
>>> ##PBS -l procs=3<br>
>>> #PBS -q batch<br>
>>> the output of checkjob is :<br>
>>><br>
>>> checking job 33<br>
>>> State: Running<br>
>>> Creds: user:wangxq group:wangxq class:batch qos:DEFAULT<br>
>>> WallTime: 00:00:00 of 1:00:00<br>
>>> SubmitTime: Fri Jan 13 17:07:43<br>
>>> (Time Queued Total: 00:00:01 Eligible: 00:00:01)<br>
>>> StartTime: Fri Jan 13 17:07:44<br>
>>> Total Tasks: 1<br>
>>> Req[0] TaskCount: 1 Partition: DEFAULT<br>
>>> Network: [NONE] Memory >= 0 Disk >= 0 Swap >= 0<br>
>>> Opsys: [NONE] Arch: [NONE] Features: [NONE]<br>
>>> Exec: '' ExecSize: 0 ImageSize: 0<br>
>>> Dedicated Resources Per Task: PROCS: 1<br>
>>> Utilized Resources Per Task: [NONE]<br>
>>> Avg Util Resources Per Task: [NONE]<br>
>>> Max Util Resources Per Task: [NONE]<br>
>>> NodeAccess: SHARED<br>
>>> NodeCount: 0<br>
>>> Allocated Nodes:<br>
>>> [snode02:1]<br>
>>> Task Distribution: snode02<br>
>>><br>
>>> IWD: [NONE] Executable: [NONE]<br>
>>> Bypass: 0 StartCount: 1<br>
>>> PartitionMask: [ALL]<br>
>>> Flags: RESTARTABLE<br>
>>> Reservation '33' (00:00:00 -> 1:00:00 Duration: 1:00:00)<br>
>>> PE: 1.00 StartPriority: 1<br>
>>> _______________________________________________<br>
>>> torqueusers mailing list<br>
>>> <a href="mailto:torqueusers@supercluster.org">torqueusers@supercluster.org</a><br>
>>> <a href="http://www.supercluster.org/mailman/listinfo/torqueusers" target="_blank">http://www.supercluster.org/mailman/listinfo/torqueusers</a><br>
>><br>
>> _______________________________________________<br>
>> torqueusers mailing list<br>
>> <a href="mailto:torqueusers@supercluster.org">torqueusers@supercluster.org</a><br>
>> <a href="http://www.supercluster.org/mailman/listinfo/torqueusers" target="_blank">http://www.supercluster.org/mailman/listinfo/torqueusers</a><br>
>><br>
>> _______________________________________________<br>
>> torqueusers mailing list<br>
>> <a href="mailto:torqueusers@supercluster.org">torqueusers@supercluster.org</a><br>
>> <a href="http://www.supercluster.org/mailman/listinfo/torqueusers" target="_blank">http://www.supercluster.org/mailman/listinfo/torqueusers</a><br>
><br>
> _______________________________________________<br>
> torqueusers mailing list<br>
> <a href="mailto:torqueusers@supercluster.org">torqueusers@supercluster.org</a><br>
> <a href="http://www.supercluster.org/mailman/listinfo/torqueusers" target="_blank">http://www.supercluster.org/mailman/listinfo/torqueusers</a><br>
<br>
_______________________________________________<br>
torqueusers mailing list<br>
<a href="mailto:torqueusers@supercluster.org">torqueusers@supercluster.org</a><br>
<a href="http://www.supercluster.org/mailman/listinfo/torqueusers" target="_blank">http://www.supercluster.org/mailman/listinfo/torqueusers</a><br>
</div></div></blockquote></div><br>