[Commits] [Einstein Toolkit] #60: Checkpoint and recovery does not work

Einstein Toolkit trac at einsteintoolkit.org
Tue Oct 19 10:02:07 CDT 2010


#60: Checkpoint and recovery does not work
------------------------+---------------------------------------------------
 Reporter:  hinder      |       Owner:  mthomas
     Type:  defect      |      Status:  new    
 Priority:  blocker     |   Milestone:         
Component:  SimFactory  |     Version:         
 Keywords:              |  
------------------------+---------------------------------------------------
 This issue was discussed on the mailing list (8 October 2010 21:55:08
 GMT+02:00) but never made it into a bug report.  I have reproduced the
 problem on Damiana and Queen Bee.  If I submit a simulation and let it run
 and make some checkpoint files, and then submit the same simulation again,
 a second restart is correctly created, but I get the warning on startup

 WARNING[L3,P0] (IOUtil): No HDF5 checkpoint files with basefilename
 'checkpoint.chkpt' found in recovery directory .

 and the simulation starts from iteration 0 rather than recovering.

 It looks like simfactory is not hardlinking the checkpoint files from the
 old restart to the new restart.  Being able to use checkpoint/recovery is
 absolutely necessary for production work.

-- 
Ticket URL: <https://trac.einsteintoolkit.org/ticket/60>
Einstein Toolkit <http://einsteintoolkit.org>
The Einstein Toolkit


More information about the Commits mailing list