4.1.4 e - When in cray mode, write physmem and availmem in addition to totmem so that Moab correctly reads memory info. e - Specifying size, nodes, and mppwidth and all mutually exclusize, so reject job submissions that attempt to specify more than one of these. TRQ-1185. b - Merged changes for revision 7000 by hand because the merge was not clean. This fixes problems with a deadlock when doing job dependencies using synccount/syncwith. TRQ-1374 b - Fix a segfault in req_jobobit due to an off-by-one error. TRQ-1361. e - Add the svn revision to --version outputs. TRQ-1357. b - Fix a race condition in mom hierarchy reporting. TRQ-1378. b - Fixed pbs_mom so epilogue will only run once. TRQ-1134 b - Fix some debug output escaping into job output. TRQ-1360. b - Fix a Cray-mode bug with jobs ending immediately when spanning nodes of different proc counts when specifying -l procs. TRQ-1365. b - Don't fail to make the tmpdir for sister moms. bugzilla #220, TRQ-1403. e - Changed momctl to do retries to get connections to make it more robust on busy systems. TRQ-1328. e - Added new option to torque.cfg named HOST_NAME_SUFFIX which allows qsub to add a suffix to a hostname on job submission. TRQ-1332 c - Fix crashes due to unprotected array accesses. TRQ-1395. b - Fixed a deadlock in get_parent_dest_queues when the queue_parent_name and queue_dest_name are the same. TRQ-1413. 11/7/12 b - Fixed segfault in req_movejob where the job ji_qhdr was NULL. TRQ-1416 b - Fixed an End of File problem between Moab and TORQUE. This one had to do with SO_KEEPALIVE getting set on the accept socket for port 15001. Because we already check connections with the tcp_timeout we do not need the keep alive. The setsockopt commands to set the keep alive have been removed. TRQ-1211 b - Fix a conflict in the code for herogeneous jobs and regular jobs. b - For alps jobs, use the login nodes evenly even when one goes down. TRQ-1317. b - Display the correct 'Assigned Cpu Count' in momctl output. TRQ-1307. b - Make pbs_original_connect() no longer hang if the host is down. TRQ-1388. b - Make epilogues run only once and be executed by the child and not the main pbs_mom process. TRQ-937. b - Reduce the error messages in HA mode from moms. They now only log errors if no server could be contacted. TRQ-1385. b - Fixed a seg-fault in send_depend_req. Also fixed a deadlock in the depend_on_term TRQ-1430 and TRQ-1436 b - Fixed a null pointer dereference seg-fault when checking for disallowed types TRQ-1408. b - Fix a counting problem when running multi-req ALPS jobs (cray only). TRQ-1431. b - Remove red herring error messages 'did not find work task for local request'. These tasks are no longer created since issue_Drequest blocks until it gets the reply and then processes it. TRQ-1423. b - Fixed a problem where qsub was not applying the submit filter when given in the torque.cfg file. TRQ-1446 e - When the mom has no jobs, check the aux path to make sure it is clean and that we aren't leaving any files there. TRQ-1240. b - Made it so that threads taken up by poll job tasks cannot consume all available threads in the thread pool. This will make it so other work can continue if poll jobs get stuck for whatever reason and that the server will recover. TRQ-1433 b - Fix a deadlock when recording alps reservations. TRQ-1421. b - Fixed a segfault in req_jobobit caused by NULL pointer assignment to variable pa. TRQ-1467 b - Fixed deadlock in remove_array. remove_array was calling get_arry with allarrays_mutex locked. TRQ-1466 b - Fixed a problem with an end of file error when running momctl -dx. TRQ-1432. b - Fix a deadlock in rare cases on job insertion. TRQ-1472. b - Fix a deadlock after restarting pbs_server when it was SIGKILL'd before a job array was done cloning. TRQ-1474. b - Fix a Cray-related deadlock. Always lock the reporter mom before a compute node. TRQ-1445 b - Additional fix for TRQ-1472. In rm_request on the mom pbs_tcp_timeout was getting set to 0 which made it so the MOM would fail reading incoming data if it had not already arrived. This would cause momctl -to fail with an end of file message. e - Add a safety net to resend any obits for exiting jobs on the mom that still haven't cleaned up after five minutes. TRQ-1458. b - Fix cray running jobs being cancelled after a restart due to jobs not being set to the login nodes. TRQ-1482. b - Make qsub -I -x work again. TRQ-1483. c - Fix a potential crash when getting the status of a login node in cray mode. TRQ-1491.