<font face="verdana,sans-serif">Hi Gus,</font><div><font face="verdana,sans-serif"><br></font></div><div><font face="verdana,sans-serif">I have unset </font></div><div><font face="verdana,sans-serif"><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; color: rgb(80, 0, 80); ">> set server resources_default.neednodes = 1<br>
> set server resources_default.nodect = 1</span></font></div><div><font face="verdana,sans-serif"><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; color: rgb(80, 0, 80); "><br>
</span></font></div><div><font face="verdana,sans-serif"><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; color: rgb(80, 0, 80); ">I have 30 nodes listed in cluster and each one has 4 core</span></font></div>
<div><font face="verdana,sans-serif"><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; color: rgb(80, 0, 80); ">But still the job is running on one node.</span></font></div>
<div><font face="verdana,sans-serif"><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; color: rgb(80, 0, 80); "><br></span></font></div><div><font face="verdana,sans-serif"><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; color: rgb(80, 0, 80); ">Thanks</span></font></div>
<div><font face="verdana,sans-serif"><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; color: rgb(80, 0, 80); ">Govind</span></font></div><div><font face="verdana,sans-serif"><font class="Apple-style-span" color="#500050" face="arial, sans-serif"><span class="Apple-style-span" style="border-collapse: collapse;"><br>
</span></font></font><br><div class="gmail_quote">On 15 June 2010 18:50, Gus Correa <span dir="ltr"><<a href="mailto:gus@ldeo.columbia.edu">gus@ldeo.columbia.edu</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
Hi Govind<br>
<br>
Please, read what Garrick said about your server configuration.<br>
To unset stuff do:<br>
qmgr -c 'unset server resources_default.neednodes'<br>
and so on ...<br>
<br>
Besides, your *Torque* (not OpenMPI) nodes file should list<br>
*all* nodes on your cluster with the respective<br>
number of processors/cores.<br>
If I understood what you said right,<br>
you only have one node listed there.<br>
<br>
Something like this:<br>
<br>
node01 np=4<br>
node02 np=4<br>
... (and so on ...)<br>
<div class="im"><br>
I hope this helps,<br>
Gus Correa<br>
<br>
<br>
</div><div class="im">Govind Songara wrote:<br>
> Hi Gus,<br>
><br>
> Thanks for your reply, with your advise I build openMPI.<br>
> I have standard nodes list as<br>
> node01 np=4<br>
><br>
> Thanks<br>
> Govind<br>
><br>
> On 15 June 2010 17:20, Gus Correa <<a href="mailto:gus@ldeo.columbia.edu">gus@ldeo.columbia.edu</a><br>
</div><div><div></div><div class="h5">> <mailto:<a href="mailto:gus@ldeo.columbia.edu">gus@ldeo.columbia.edu</a>>> wrote:<br>
><br>
> Hi Govind<br>
><br>
> What is the content of your Torque nodes file?<br>
> (located at /var/torque/server_priv/nodes or equivalent)<br>
> It should list all nodes and the number of processors/cores on each,<br>
> Something like this:<br>
><br>
> node01 np=2<br>
> node02 np=8<br>
> ...<br>
> node47 np=4<br>
> ...<br>
><br>
> Maybe I asked you this before in the OpenMPI list, not sure.<br>
><br>
> I hope this helps,<br>
> Gus Correa<br>
> ---------------------------------------------------------------------<br>
> Gustavo Correa<br>
> Lamont-Doherty Earth Observatory - Columbia University<br>
> Palisades, NY, 10964-8000 - USA<br>
> ---------------------------------------------------------------------<br>
><br>
><br>
> Govind wrote:<br>
> > Hi,<br>
> ><br>
> > I have am openmpi build with tm support<br>
> > When i run the mpi job requesting for two nodes and 4 core it run<br>
> only<br>
> > on single node.<br>
> ><br>
> > >cat mpipbs-script.sh<br>
> > #PBS -N mpipbs-script<br>
> > #PBS -q short<br>
> > ### Number of nodes: resources per node<br>
> > ### (4 cores/node, so ppn=4 is ALL resources on the node)<br>
> > #PBS -l nodes=2:ppn=4<br>
> > echo `cat $PBS_NODEFILE`<br>
> > NPROCS=`wc -l < $PBS_NODEFILE`<br>
> > echo This job has allocated $NPROCS processors<br>
> > /opt/openmpi-1.4.2/bin/mpirun /scratch0/gsongara/mpitest/hello<br>
> ><br>
> > It show only one node here is the output<br>
> > ===============<br>
> > node47.beowulf.cluster node47.beowulf.cluster node47.beowulf.cluster<br>
> > node47.beowulf.cluster<br>
> > This job has allocated 4 processors<br>
> > Hello World! from process 1 out of 4 on node47.beowulf.cluster<br>
> > Hello World! from process 2 out of 4 on node47.beowulf.cluster<br>
> > Hello World! from process 3 out of 4 on node47.beowulf.cluster<br>
> > Hello World! from process 0 out of 4 on node47.beowulf.cluster<br>
> > ===============<br>
> ><br>
> > torque config<br>
> > set queue short resources_max.nodes = 4<br>
> > set queue short resources_default.nodes = 1<br>
> > set server resources_default.neednodes = 1<br>
> > set server resources_default.nodect = 1<br>
> > set server resources_default.nodes = 1<br>
> ><br>
> > I also trying adding this parameter to maui as advised in some old<br>
> > thread but does not help<br>
> > JOBNODEMATCHPOLICY EXACTNODE<br>
> > ENABLEMULTINODEJOBS TRUE<br>
> > NODEACCESSPOLICY SHARED<br>
> ><br>
> ><br>
> > Can someone please advise if i missing anything here.<br>
> ><br>
> > Regards<br>
> > Govind<br>
> ><br>
> ><br>
> ><br>
> ------------------------------------------------------------------------<br>
> ><br>
> > _______________________________________________<br>
> > torqueusers mailing list<br>
</div></div>> > <a href="mailto:torqueusers@supercluster.org">torqueusers@supercluster.org</a> <mailto:<a href="mailto:torqueusers@supercluster.org">torqueusers@supercluster.org</a>><br>
<div class="im">> > <a href="http://www.supercluster.org/mailman/listinfo/torqueusers" target="_blank">http://www.supercluster.org/mailman/listinfo/torqueusers</a><br>
><br>
> _______________________________________________<br>
> torqueusers mailing list<br>
</div>> <a href="mailto:torqueusers@supercluster.org">torqueusers@supercluster.org</a> <mailto:<a href="mailto:torqueusers@supercluster.org">torqueusers@supercluster.org</a>><br>
<div><div></div><div class="h5">> <a href="http://www.supercluster.org/mailman/listinfo/torqueusers" target="_blank">http://www.supercluster.org/mailman/listinfo/torqueusers</a><br>
><br>
><br>
><br>
> ------------------------------------------------------------------------<br>
><br>
> _______________________________________________<br>
> torqueusers mailing list<br>
> <a href="mailto:torqueusers@supercluster.org">torqueusers@supercluster.org</a><br>
> <a href="http://www.supercluster.org/mailman/listinfo/torqueusers" target="_blank">http://www.supercluster.org/mailman/listinfo/torqueusers</a><br>
<br>
_______________________________________________<br>
torqueusers mailing list<br>
<a href="mailto:torqueusers@supercluster.org">torqueusers@supercluster.org</a><br>
<a href="http://www.supercluster.org/mailman/listinfo/torqueusers" target="_blank">http://www.supercluster.org/mailman/listinfo/torqueusers</a><br>
</div></div></blockquote></div><br></div>