<html><head><meta http-equiv="Content-Type" content="text/html; charset=us-ascii"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><br class=""><div><br class=""><blockquote type="cite" class=""><div class="">On 23 Oct 2018, at 08:57, Benjamin Chardi Marco &lt;<a href="mailto:bchardim@redhat.com" class="">bchardim@redhat.com</a>&gt; wrote:</div><br class="Apple-interchange-newline"><div class=""><div dir="ltr" class=""><div dir="ltr" class="">Dear friends,<div class=""><br class=""></div><div class="">We are trying to use the EinsteinToolKit&nbsp;GW150914.rpar binary balckhole merge simulation&nbsp; as use case to test that our container orchestration product OpenShift can be used for HPC.&nbsp;</div><div class="">Our test environment only has 30 CPUs so we need to execute that simulation in a reasonable time.</div></div></div></div></blockquote><div><br class=""></div><div>Hi,</div><div><br class=""></div><div>30 CPUs is quite a lot; do you really mean 30 CPUs, or 30 cores? &nbsp;What CPU are you using, and how many cores does it have? &nbsp;Also, what is the interconnect between the nodes? &nbsp;Infiniband, omnipath, gigabit ethernet, etc?</div><div><br class=""></div><blockquote type="cite" class=""><div class=""><div dir="ltr" class=""><div dir="ltr" class=""><div class="">Please can you tell us how to modify&nbsp; GW150914.rpar in order to get a less precise simulation executed in a 30 CPUs cluster in a reasonable time (~ few days).&nbsp;</div></div></div></div></blockquote><div><br class=""></div><div><div>You can run a lower resolution by changing the</div><div><br class=""></div><div>&nbsp; --define N 28</div><div><br class=""></div><div>to something else. &nbsp;This must be a multiple of 4, and can probably go as low as 20 without the simulation crashing. &nbsp;[Roland: you mentioned 24 in your email. &nbsp;Did you try 20 and it crashed? &nbsp;I seem to remember 20 working at one point.] &nbsp;This is a measure of the number of grid cells across the black holes, so increasing it gives you more cells, higher resolution, and the run goes more slowly.</div><div><br class=""></div><div>Roland's suggestions are also good, but I would make a couple of changes to what he recommended. &nbsp;</div><div><br class=""></div><div>The original boundary condition in GW150914.rpar sets the time derivative ("rhs") of all fields to 0 on the outer spherical boundary. &nbsp;This fixes the fields to their initial value, so can be considered a Dirichlet (or "scalar") boundary condition: ML_BSSN::rhs_boundary_condition&nbsp;&nbsp; &nbsp;&nbsp;= "scalar". &nbsp;This is in general a bad thing to do (you will get almost perfect reflections of all outgoing waves), but the boundary was placed far enough away that it could not influence the waveform. &nbsp;This is generally very cheap with the spherical outer grid used here, and was done because we had not implemented radiative boundary conditions that worked with the spherical grids in McLachlan at the time.</div><div><br class=""></div><div>The improvement that I believe Roland meant to make was to change the boundary condition to radiative (not Robin), which has now been implemented in the code. &nbsp;This makes the fields obey an advection-type equation on the outer boundary, assuming that all fields are solutions to outgoing radial wave equations. &nbsp;In Roland's parameter file, he set</div><div><br class=""></div><div>NewRad::z_is_radial&nbsp;&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;= yes</div><div><br class=""></div><div>but this is a technical change to trick NewRad into working with the spherical grids that we use here. &nbsp;To change the boundary condition itself, you need to set</div><div><br class=""></div><div>ML_BSSN::rhs_boundary_condition&nbsp;&nbsp; &nbsp;&nbsp;= "NewRad"</div><div><br class=""></div><div>rather than "scalar".</div><div><br class=""></div><div>The other change Roland made was to change final_time to half its current value:</div><div><br class=""></div><div><div>final_time = waveform_length + outermost_detector</div><div>-&gt;</div><div class=""><div class="">final_time = 0.5*(waveform_length + outermost_detector)</div></div><div class=""><br class=""></div><div class="">This doesn't seem correct. &nbsp;It is true that final_time is used to set sphere_outer_radius, but this will not halve the size of the domain. &nbsp;Further, it will halve the runtime of the simulation, so the simulation will stop before the BHs have merged. &nbsp;Instead, I would change sphere_outer_radius as follows:</div><div class=""><br class=""></div><div class=""><div class="">-sphere_outer_radius = int((outermost_detector + final_time)/(i*hr))*i*hr</div><div class="">+sphere_outer_radius = int((1000)/(i*hr))*i*hr</div></div><div class=""><br class=""></div><div class="">This might make the waveform noiser, but with the changed boundary condition, it shouldn't be too bad.</div></div><div><br class=""></div></div><blockquote type="cite" class=""><div class=""><div dir="ltr" class=""><div dir="ltr" class=""><div class="">Now we can run the simulation&nbsp; GW150914.rpar using OpenMPI&nbsp;+ EinsteinToolKit, but it takes so long to be executed (~ weeks).</div></div></div></div></blockquote><div><br class=""></div><div>This sounds liked quite a long time! &nbsp;It sounds too long. &nbsp;On the page describing the simulation,&nbsp;<a href="https://einsteintoolkit.org/gallery/bbh/index.html" class="">https://einsteintoolkit.org/gallery/bbh/index.html</a>, it says that the simulation takes 2.8 days on 128 cores of an&nbsp;Intel(R) Xeon(R) CPU E5-2630 v3 @ 2.40GHz (Haswell). &nbsp;Assuming that you mean you are using 30 cores, and if you are using a similar CPU, then it should take 2.8 * 128/30 = 11.9 days. &nbsp;Is this about what you see? &nbsp;What speed is reported? &nbsp;You can see this in the output file GW150914_*.out:</div><div><br class=""></div><div><div style="margin: 0px; font-stretch: normal; line-height: normal; font-family: Menlo; background-color: rgb(255, 255, 255);" class=""><span style="font-variant-ligatures: no-common-ligatures" class="">----------------------------------------------------------------------------------------------------------</span></div><div style="margin: 0px; font-stretch: normal; line-height: normal; font-family: Menlo; background-color: rgb(255, 255, 255);" class=""><span style="font-variant-ligatures: no-common-ligatures" class="">Iteration&nbsp; &nbsp; &nbsp; Time | *me_per_hour |&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ML_BSSN::phi | *TISTICS::maxrss_mb | *TICS::swap_used_mb</span></div><div style="margin: 0px; font-stretch: normal; line-height: normal; font-family: Menlo; background-color: rgb(255, 255, 255);" class=""><span style="font-variant-ligatures: no-common-ligatures" class="">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; |&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; |&nbsp; &nbsp; &nbsp; minimum&nbsp; &nbsp; &nbsp; maximum | &nbsp; minimum &nbsp; maximum | &nbsp; minimum &nbsp; maximum</span></div><div style="margin: 0px; font-stretch: normal; line-height: normal; font-family: Menlo; background-color: rgb(255, 255, 255);" class=""><span style="font-variant-ligatures: no-common-ligatures" class="">----------------------------------------------------------------------------------------------------------</span></div><div style="margin: 0px; font-stretch: normal; line-height: normal; font-family: Menlo; background-color: rgb(255, 255, 255);" class=""><span style="font-variant-ligatures: no-common-ligatures" class="">&nbsp;&nbsp; 114640 &nbsp; 246.602 | &nbsp; 10.5254966 |&nbsp; &nbsp; 0.0149352&nbsp; &nbsp; 0.9995490 |&nbsp; &nbsp; &nbsp; 3748&nbsp; &nbsp; &nbsp; 5289 | &nbsp; &nbsp; &nbsp; &nbsp; 0 &nbsp; &nbsp; &nbsp; &nbsp; 0</span></div><div style="margin: 0px; font-stretch: normal; line-height: normal; font-family: Menlo; background-color: rgb(255, 255, 255);" class=""><span style="font-variant-ligatures: no-common-ligatures" class="">&nbsp;&nbsp; 114644 &nbsp; 246.611 | &nbsp; 10.5255173 |&nbsp; &nbsp; 0.0144565&nbsp; &nbsp; 0.9995490 |&nbsp; &nbsp; &nbsp; 3748&nbsp; &nbsp; &nbsp; 5289 | &nbsp; &nbsp; &nbsp; &nbsp; 0 &nbsp; &nbsp; &nbsp; &nbsp; 0</span></div><div class=""><span style="font-variant-ligatures: no-common-ligatures" class=""><br class=""></span></div><div class=""><span style="font-variant-ligatures: no-common-ligatures" class="">The third column is the speed of the simulation in coordinate time per hour (it is a truncation of "physical_time_per_hour"). &nbsp;</span></div></div><div><br class=""></div><div>It's possible that the OpenMP or MPI configuration is not correct. &nbsp;Please could you post the standard output file (GW150914_*.out) to&nbsp;<a href="https://pastebin.com" class="">https://pastebin.com</a> so we can take a look at it?</div><div><br class=""></div><blockquote type="cite" class=""><div class=""><div dir="ltr" class=""><div dir="ltr" class=""><div class="">We believe that&nbsp; GW150914.rpar EinsteinToolKit is a great use case to test OpenShift for HPC, and of course we will reference to EinsteinToolKit is our final report as a use case for Openshift in HPC mode.</div></div></div></div></blockquote><div><br class=""></div></div><div class="">Great; it sounds interesting! &nbsp;There are instructions for the papers which should be cited if you use this parameter file and code at the top of the parameter file:</div><div class=""><pre style="word-wrap: break-word; white-space: pre-wrap;" class=""># Copyright Barry Wardell, Ian Hinder, Eloisa Bentivegna

# We ask that if you make use of the parameter file or the example

# data, then please cite

# Simulation of GW150914 binary black hole merger using the

# Einstein Toolkit - <a href="https://doi.org/10.5281/zenodo.155394" class="">https://doi.org/10.5281/zenodo.155394</a>

# as well as the Einstein Toolkit, the Llama multi-block

# infrastructure, the Carpet mesh-refinement driver, the apparent

# horizon finder AHFinderDirect, the TwoPunctures initial data code,

# QuasiLocalMeasures, Cactus, and the McLachlan spacetime evolution

# code, the Kranc code generation package, and the Simulation Factory.

# An appropriate bibtex file, etgw150914.bib, is provided with this

# parameter file.

</pre></div><div class="">and the bibtex file is at&nbsp;<a href="https://einsteintoolkit.org/gallery/bbh/etgw150914.bib" class="">https://einsteintoolkit.org/gallery/bbh/etgw150914.bib</a>.&nbsp;</div><div class=""><br class=""></div><div class=""><div dir="auto" style="caret-color: rgb(0, 0, 0); color: rgb(0, 0, 0); letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div dir="auto" style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class="">--&nbsp;<br class="">Ian Hinder<br class=""><a href="https://ianhinder.net" class="">https://ianhinder.net</a><br class=""></div></div>

</div>

<br class=""></body></html>