[ET Trac] [Einstein Toolkit] #1286: SimFactory should not run queued chained jobs if a previous job fails
Einstein Toolkit
trac-noreply at einsteintoolkit.org
Mon Mar 11 09:54:53 CDT 2013
#1286: SimFactory should not run queued chained jobs if a previous job fails
--------------------------+-------------------------------------------------
Reporter: hinder | Owner: eschnett
Type: enhancement | Status: new
Priority: major | Milestone:
Component: SimFactory | Version:
Resolution: | Keywords:
--------------------------+-------------------------------------------------
Comment (by eschnett):
That may be too simplistic. If a simulation runs out of queue time, would
that count as "success"? Would you expect Simfactory to continue chaining
jobs in this case? What if different MPI processes return different exit
codes? What is, in general the exit code of mpirun anyway? What would you
do if a simulation runs of of time? out of memory? out of disk space? What
if there is a file permission error and the simulation can't write? What
if the Cactus executable never actually starts because something is wrong?
What if the user used qdel to stop a simulation? What if the user used
qdel to stop a simulation? What if the user used the web interface or a
termination trigger to stop the simulation?
--
Ticket URL: <https://trac.einsteintoolkit.org/ticket/1286#comment:3>
Einstein Toolkit <http://einsteintoolkit.org>
The Einstein Toolkit
More information about the Trac
mailing list