Hello,<br><br>I'm having a problem draining my cluster's batch queues using setres. Adapting one of the examples given in the clusterresources documentation I attempt to reserve every node and processor in my cluster during our scheduled downtime:<br>
<br># setres -s 18:00:00_11/18 -e 08:00:00_11/19 -n electricityDowntime ALL<br><br><br>I then check the reservations and see that the downtime is there and is applied to all processors along with a few other reservations for pre-existing jobs.<br>
<br>To test the reservation I submit a job which requests an excessive amount of wallclock time (Job 918). It's scheduled to start when the downtime finishes which is what I expect:<br><br>[root@lphesrv1 spool]# showres<br>
Reservations<br><br>ReservationID Type S Start End Duration N/P StartTime<br><br>909 Job R -1:21:00 83:06:39:00 83:08:00:00 1/1 Mon Nov 10 15:40:32<br>912 Job R -1:09:05 4:12:50:55 4:14:00:00 1/1 Mon Nov 10 15:52:27<br>
913 Job R -00:49:57 4:13:10:03 4:14:00:00 1/1 Mon Nov 10 16:11:35<br>918 Job I 8:14:58:28 91:22:58:28 83:08:00:00 1/1 Wed Nov 19 08:00:00<br>electricityDowntime.0 User - 8:00:58:28 8:14:58:28 14:00:00 60/480 Tue Nov 18 18:00:00<br>
<br>17 reservations located<br><br><br>Unfortunately, upon checking the reservations ~ 10minutes later I find that Job 918 has started despite the reservations overlapping:<br><br><br>[root@lphesrv1 spool]# showres<br>Reservations<br>
<br>ReservationID Type S Start End Duration N/P StartTime<br><br>909 Job R -1:27:25 83:06:32:35 83:08:00:00 1/1 Mon Nov 10 15:40:32<br>912 Job R -1:15:30 4:12:44:30 4:14:00:00 1/1 Mon Nov 10 15:52:27<br>
913 Job R -00:56:22 4:13:03:38 4:14:00:00 1/1 Mon Nov 10 16:11:35<br>918 Job R -00:06:15 83:07:53:45 83:08:00:00 1/1 Mon Nov 10 17:01:42<br>electricityDowntime.0 User - 8:00:52:03 8:14:52:03 14:00:00 60/480 Tue Nov 18 18:00:00<br>
<br>17 reservations located<br><br><br>Can anyone help me debug this/explain this behaviour? I can't find anything in my maui dir logs and only:<br><br>11/10/2008 17:01:42;0100;PBS_Server;Req;;Type StatusJob request received from <a href="mailto:root@lphesrv1.epfl.ch">root@lphesrv1.epfl.ch</a>, sock=14<br>
11/10/2008 17:01:42;0100;PBS_Server;Req;;Type ModifyJob request received from <a href="mailto:root@lphesrv1.epfl.ch">root@lphesrv1.epfl.ch</a>, sock=14<br>11/10/2008 17:01:42;0008;PBS_Server;Job;<a href="http://918.lphesrv1.epfl.ch">918.lphesrv1.epfl.ch</a>;Job Modified at request of <a href="mailto:root@lphesrv1.epfl.ch">root@lphesrv1.epfl.ch</a><br>
11/10/2008 17:01:42;0100;PBS_Server;Req;;Type RunJob request received from <a href="mailto:root@lphesrv1.epfl.ch">root@lphesrv1.epfl.ch</a>, sock=14<br>11/10/2008 17:01:42;0008;PBS_Server;Job;<a href="http://918.lphesrv1.epfl.ch">918.lphesrv1.epfl.ch</a>;Job Run at request of <a href="mailto:root@lphesrv1.epfl.ch">root@lphesrv1.epfl.ch</a><br>
<br>in the pbs logs.<br><br>I'm using:<br><br>[root@lphesrv1 spool]# qmgr -c "p s"|grep pbs_ver<br>set server pbs_version = 2.3.0-snap.200801151629<br>[root@lphesrv1 spool]# setres -v<br>maui client version 3.2.6p20<br>
<br>Will post my maui.cfg if relevant.<br><br>Thanks,<br><br>Paul.<br><br>-- <br>-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=<br> Paul Szczypka, EPFL SB IPEP LPHE1, BSP 614, CH-1015 Lausanne<br> paul.szczypka@cern.ch Tel: +41 21 69 30495<br>
-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=<br> Please avoid sending me Word or PowerPoint attachments.<br> See <a href="http://www.gnu.org/philosophy/no-word-attachments.html">http://www.gnu.org/philosophy/no-word-attachments.html</a><br>
<br><br><br>