[ET Trac] [Einstein Toolkit] #1751: [Pull request: CactusUtils/WatchDog] new thorn to automatically terminate jobs that hang
Einstein Toolkit
trac-noreply at einsteintoolkit.org
Mon Mar 2 13:30:16 CST 2015
#1751: [Pull request: CactusUtils/WatchDog] new thorn to automatically terminate
jobs that hang
-----------------------------------+----------------------------------------
Reporter: dradice@… | Owner:
Type: enhancement | Status: new
Priority: unset | Milestone:
Component: EinsteinToolkit thorn | Version: development version
Keywords: |
-----------------------------------+----------------------------------------
WatchDog is thorn that terminates jobs that do not make progress over a
user-defined time frame. Internally, WatchDog updates an internal timer at
CCTK_ANALYSIS and uses the pthread library to spawn watcher thread that
periodically checks if the timer has been updated. If the timer has not
been updated for more than a user-defined time frame, the thread calls
"exit()" to terminate the process (and the job).
--
Ticket URL: <https://trac.einsteintoolkit.org/ticket/1751>
Einstein Toolkit <http://einsteintoolkit.org>
The Einstein Toolkit
More information about the Trac
mailing list