[Users] Question about restarting a simulation from checkpoint data
Roland Haas
rhaas at mail.ubc.ca
Thu Jun 4 10:44:31 CDT 2026
Hello all,
This is also addressed in the chat:
https://matrix.to/#/%21mYSgqyShRCoThqmUmY%3Agitter.im/%24GoGyO7PMmaXQPBip5Jjk5T5dycsBi-o6_JlF8kI2zPQ?via=gitter.im&via=matrix.org
where the most recent issue is
--8<--
/home/jorge/simulations/ProcaStar6/SIMFACTORY/exe/cactus_sim -L 3 /home/jorge/simulations/ProcaStar6/output-0000/ProcaStar.par
cactus_sim: /home/jorge/Einstein-Toolkit/Cactus/arrangements/Carpet/CarpetLib/src/th.hh:70: CCTK_REAL8 CarpetLib::th::get_time(int, int, int) const: Assertion `tl >= 0 and tl < timelevels' failed.
Rank 0 with PID 57937 received signal 6
--8<--
Yours,
Roland
> [CAUTION: Non-UBC Email]
>
> Dear Einstein Toolkit users,
>
> I am currently working with the Einstein Toolkit and trying to use
> checkpoint/restart functionality to recover simulations.
>
> I understand the parameters that must be included in the parameter file,
> but I am not sure how I should correctly restart a simulation from existing
> checkpoint data.
>
> So far, I have been launching simulations with commands such as
>
> ./simfactory/bin/sim run ProcaStar -workstation-parfile=par/ProcaStar.par
>
> and the simulation runs correctly and generates checkpoint files. However,
> when I try to restart from the checkpoint data, I am unsure what the
> correct procedure should be. I do not know whether I should use a different
> SimFactory command, modify the submission procedure, or provide additional
> options.
>
> The checkpoint-related parameters in my parameter file are:
>
> CarpetIOHDF5::checkpoint = yes
> IOUtil::checkpoint_every_walltime_hours = 0.1
> IOUtil::checkpoint_dir = "checkpoints_prueba"
> IOUtil::checkpoint_keep = 1
> IOUtil::checkpoint_ID = no
> IOUtil::checkpoint_on_terminate = yes
> IOUtil::recover = "autoprobe"
> IOUtil::recover_dir = "checkpoints_prueba"
> IOUtil::abort_on_io_errors = yes
> CarpetIOHDF5::open_one_input_file_at_a_time = yes
>
> Could someone please clarify the recommended procedure for restarting a
> simulation from existing checkpoint files? In particular:
>
> 1.
>
> Which SimFactory command should be used to restart a simulation?
> 2.
>
> Is it necessary to create a new run or continue the existing one?
> 3.
>
> Are there any additional parameters or directory requirements for
> recovery to work correctly?
>
> My environment is:
> Einstein Toolkit version: ET_2025_05
> Operating system: Linux Ubuntu 22.04.5 LTS
> Cluster/machine: Local
>
> Thank you very much for your help.
>
> Best regards,
> Yahir Mio
--
My email is as private as my paper mail. I therefore support encrypting
and signing email messages. Get my PGP key from http://pgp.mit.edu .
More information about the Users
mailing list