Hi Adil,<br><br>I think that you could enable port 15001 in iptables. That way you'll have working<br>firewall and torque as well.<br><br>Jozef<br><br><div class="gmail_quote">On Mon, Feb 25, 2008 at 2:44 PM, Adil Mughal <<a href="mailto:adil.m.mughal@gmail.com">adil.m.mughal@gmail.com</a>> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">I feel silly for answering my own problem but I found that<br>
<br>
> service iptables stop<br>
<br>
solved my problems!!<br>
<div><div></div><div class="Wj3C7c"><br>
On Mon, Feb 25, 2008 at 1:35 PM, Adil Mughal <<a href="mailto:adil.m.mughal@gmail.com">adil.m.mughal@gmail.com</a>> wrote:<br>
> I had a closer look at my mom_log file on one of the slaves and there<br>
> is the following repeated error message:<br>
><br>
> pbs_mom;Req;jobobit;No contact with server at hostaddr 907c3092, port<br>
> 15001, jobid 165.dphpc1011.dph.$<br>
> $1.dph.aber.ac.uk errno 113<br>
><br>
><br>
> Does that help?<br>
><br>
> Adil<br>
><br>
><br>
><br>
> On Mon, Feb 25, 2008 at 1:17 PM, Adil Mughal <<a href="mailto:adil.m.mughal@gmail.com">adil.m.mughal@gmail.com</a>> wrote:<br>
> > Dear Experts<br>
> ><br>
> > I recently had to reboot my master computer.<br>
> ><br>
> > After rebooting I went through the usual steps to set up - i.e.<br>
> ><br>
> > >qterm<br>
> > > pbs_server<br>
> > >pbs_sched<br>
> ><br>
> > The problem is that now when I submit a basic job like:<br>
> ><br>
> > echo "sleep 5" | qsub<br>
> ><br>
> > or<br>
> ><br>
> > echo "touch testfile" | qsub<br>
> ><br>
> > the job remains in the run state, that is typing qstat gives something<br>
> > like this:<br>
> ><br>
> > Job id Name User Time Use S Queue<br>
> > ------------------- ---------------- --------------- -------- - -----<br>
> > 165.dphpc1011 STDIN guest1 0 R batch<br>
> > 166.dphpc1011 STDIN guest1 00:00:00 R batch<br>
> > 167.dphpc1011 STDIN guest1 0 R batch<br>
> > 168.dphpc1011 STDIN guest1 00:00:00 R batch<br>
> ><br>
> > Wheras prevously the jobs were running and then dequeuing<br>
> ><br>
> > Any ideas what I might have missed<br>
> ><br>
> > adil<br>
> ><br>
><br>
_______________________________________________<br>
torqueusers mailing list<br>
<a href="mailto:torqueusers@supercluster.org">torqueusers@supercluster.org</a><br>
<a href="http://www.supercluster.org/mailman/listinfo/torqueusers" target="_blank">http://www.supercluster.org/mailman/listinfo/torqueusers</a><br>
</div></div></blockquote></div><br>