<br><font size=2 face="Arial">I think the situation Sam has is slightly
different and I can confirm (practically and with a simple test) the problem.
The difference is to wait 2 minutes between the submission of 81384
and 81385.</font>
<br>
<br><font size=2 face="Arial">This stems from something I've griped about
before, in order for a dependency to be recognized correctly by Torque,
the job that is being depended on must currently be in the queue (I
could be off here, I'm not sure what the all of the legal states are, but
I don't believe C is one of them). If the jobid dependency isn't
currently in the queue, the submitted job get's held. I'm not willing
to say what the correct behavior is there, though.</font>
<br>
<br><font size=2 face="Arial">Nate</font>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<br>
<table width=100%>
<tr valign=top>
<td width=40%><font size=1 face="sans-serif"><b>"Garrick Staples"
<garrick@clusterresources.com></b> </font>
<br><font size=1 face="sans-serif">Sent by: torqueusers-bounces@supercluster.org</font>
<p><font size=1 face="sans-serif">05-Sep-2006 17:39</font>
<td width=59%><font size=1 face="sans-serif">
</font>
<table width=100%>
<tr>
<td>
<div align=right><font size=1 face="sans-serif">To</font></div>
<td valign=top><font size=1 face="sans-serif">torqueusers@supercluster.org</font>
<tr>
<td>
<div align=right><font size=1 face="sans-serif">cc</font></div>
<td valign=top>
<tr>
<td>
<div align=right><font size=1 face="sans-serif">Subject</font></div>
<td valign=top><font size=1 face="sans-serif">Re: [torqueusers] dependencies
on completed jobs</font></table>
<br>
<table>
<tr valign=top>
<td>
<td></table>
<br></table>
<br>
<br>
<br><font size=2><tt>On Tue, Sep 05, 2006 at 12:45:27PM -0700, Sam Rash
alleged:<br>
> So we've noticed that if we submit job A, then submit job B which
depends on<br>
> A (-W depend=afterok:B_job_id), and A has already completed (we have<br>
> keep_completed set to at least 30 min), B get stuck in the hold state.
Is<br>
> this intentional? Or a bug?<br>
> <br>
> <br>
> <br>
> It seems like B should surely run.<br>
> <br>
> (maybe A updates its dependents when it completes and B won't check<br>
> explicitly?)<br>
<br>
The simple test works fine for me.<br>
<br>
[garrick@hpcjr-master garrick]$ echo sleep 60 | qsub<br>
81384.hpcjr-master.usc.edu<br>
[garrick@hpcjr-master garrick]$ echo sleep 60 | qsub -W depend=afterok:81384<br>
81385.hpcjr-master.usc.edu<br>
[garrick@hpcjr-master garrick]$ echo sleep 60 | qsub -W depend=afterok:81385<br>
81386.hpcjr-master.usc.edu<br>
<br>
After 1.5 minutes:<br>
81384.hpcjr-master.u garrick batch STDIN
10643 1 -- -- 01:00 C 00:00<br>
81385.hpcjr-master.u garrick batch STDIN
10804 1 -- -- 01:00 R --<br>
81386.hpcjr-master.u garrick batch STDIN
-- 1 -- -- 01:00 H
--<br>
<br>
And after 2.5 minutes:<br>
81384.hpcjr-master.u garrick batch STDIN
10643 1 -- -- 01:00 C 00:00<br>
81385.hpcjr-master.u garrick batch STDIN
10804 1 -- -- 01:00 C 00:00<br>
81386.hpcjr-master.u garrick batch STDIN
10910 1 -- -- 01:00 R --<br>
<br>
<br>
_______________________________________________<br>
torqueusers mailing list<br>
torqueusers@supercluster.org<br>
http://www.supercluster.org/mailman/listinfo/torqueusers<br>
<br>
</tt></font>
<br>