[Mauiusers] suspend / resume
Bernward Platz
Bernward.Platz@clucon.de
Thu, 29 Jul 2004 18:34:23 +0200
I think this is a problem in req_signal.c, because
when a job is suspended the nodes allocated by the job are not released.
I wrote a short patch to solve this problem. The important call in
req_signal.c is "free_nodes".
The path is not well tested yet. But I used the patch several times without
problems.
Regards
Bernward
diff -urN -X exclude torque-1.0.1.org/src/server/req_signal.c torque-1.0.1/
src/server/req_signal.c
--- torque-1.0.1.org/src/server/req_signal.c 2004-02-13 20:01:00.000000000
+0100
+++ torque-1.0.1/src/server/req_signal.c 2004-03-20 10:01:13.000000000
+0100
@@ -206,8 +206,10 @@
pjob->ji_qs.ji_svrflags |= JOB_SVFLG_Suspend;
set_statechar(pjob);
job_save(pjob, SAVEJOB_QUICK);
+ free_nodes(pjob);
} else if (strcmp(preq->rq_ind.rq_signal.rq_signame,
SIG_RESUME) == 0) {
+ set_old_nodes(pjob);
pjob->ji_qs.ji_svrflags &= ~JOB_SVFLG_Suspend;
set_statechar(pjob);
job_save(pjob, SAVEJOB_QUICK);
On Wednesday 28 July 2004 10:50, Sébastien Georget wrote:
> Hi,
>
> I am trying to use maui/torque suspend feature. Right now I can
> suspend/resume jobs using qsig -s suspend/resumeJOBID or mjobctl -s/-r
> JOBID.
> The problem is that the nodes where the suspended job runs are still in
> the state 'job-exclusive' and cannot be used to submit new jobs. I
> wonder which one of maui or torque has a faulty behaviour here.
> Should torque change the state of the node to free when the job is
> suspended, or should it be maui ? Can it be configured somewhere ?
>
> thx,
> Sébastien
--
------------------------------------------------------------
clucon - cluster concepts
Bernward Platz
Dipl.-Inform. Bernward Platz
Geisenbrunner Str. 72a
81475 Muenchen
Phone: +49 89 7593838
Fax: +49 89 75201462
Mobile: +49 175 5247883
Mail: Bernward.Platz@clucon.de
Web: http://www.clucon.de