[Commits] [Einstein Toolkit] #60: Checkpoint and recovery does not work
Einstein Toolkit
trac at einsteintoolkit.org
Tue Oct 19 10:02:07 CDT 2010
#60: Checkpoint and recovery does not work
------------------------+---------------------------------------------------
Reporter: hinder | Owner: mthomas
Type: defect | Status: new
Priority: blocker | Milestone:
Component: SimFactory | Version:
Keywords: |
------------------------+---------------------------------------------------
This issue was discussed on the mailing list (8 October 2010 21:55:08
GMT+02:00) but never made it into a bug report. I have reproduced the
problem on Damiana and Queen Bee. If I submit a simulation and let it run
and make some checkpoint files, and then submit the same simulation again,
a second restart is correctly created, but I get the warning on startup
WARNING[L3,P0] (IOUtil): No HDF5 checkpoint files with basefilename
'checkpoint.chkpt' found in recovery directory .
and the simulation starts from iteration 0 rather than recovering.
It looks like simfactory is not hardlinking the checkpoint files from the
old restart to the new restart. Being able to use checkpoint/recovery is
absolutely necessary for production work.
--
Ticket URL: <https://trac.einsteintoolkit.org/ticket/60>
Einstein Toolkit <http://einsteintoolkit.org>
The Einstein Toolkit
More information about the Commits
mailing list