[ET Trac] [Einstein Toolkit] #1751: [Pull request: CactusUtils/WatchDog] new thorn to automatically terminate jobs that hang

Einstein Toolkit trac-noreply at einsteintoolkit.org
Mon Mar 2 13:30:16 CST 2015


#1751: [Pull request: CactusUtils/WatchDog] new thorn to automatically terminate
jobs that hang
-----------------------------------+----------------------------------------
 Reporter:  dradice@…              |       Owner:                     
     Type:  enhancement            |      Status:  new                
 Priority:  unset                  |   Milestone:                     
Component:  EinsteinToolkit thorn  |     Version:  development version
 Keywords:                         |  
-----------------------------------+----------------------------------------
 WatchDog is thorn that terminates jobs that do not make progress over a
 user-defined time frame. Internally, WatchDog updates an internal timer at
 CCTK_ANALYSIS and uses the pthread library to spawn watcher thread that
 periodically checks if the timer has been updated. If the timer has not
 been updated for more than a user-defined time frame, the thread calls
 "exit()" to terminate the process (and the job).

-- 
Ticket URL: <https://trac.einsteintoolkit.org/ticket/1751>
Einstein Toolkit <http://einsteintoolkit.org>
The Einstein Toolkit


More information about the Trac mailing list