[Users] Meudon_Mag_NS parfile output

Zach Etienne zachetie at gmail.com
Sat Oct 14 13:59:46 CDT 2023


Hi Shamim,

Thank you for providing additional information on the issue. Let's address
your questions one by one:

* Memory Usage
If the error message is about "grid structure consistency," that usually
points to an issue setting up the Carpet grids rather than a lack of
memory. Maybe Carpet is having a problem breaking up the grids beyond 8
cores? I suggest you submit a ticket to
https://bitbucket.org/einsteintoolkit/tickets/ with the full instructions
on reproducing the problem.

* Invalid MIT-MAGIC-COOKIE-1 Key
You're correct in your assumption that the Invalid MIT-MAGIC-COOKIE-1 key
issue might not be directly causing the problem you're facing with the BNSM
runs, especially since the simulations ran successfully despite the
warning. However, I'd still recommend addressing this issue to eliminate it
as a potential confounding factor.

-Zach

*     *     *
Zachariah Etienne
Assoc. Prof. of Physics, U. of Idaho
Adjunct Assoc. Prof. of Physics & Astronomy, West Virginia U.
https://etienneresearch.com
https://blackholesathome.net


On Sat, Oct 14, 2023 at 6:36 AM Shamim Haque 1910511 <shamims at iiserb.ac.in>
wrote:

> Hi Zach,
>
> I retried it on a workstation with fairly good configuration (80 threads,
> 256 GB RAM), where I ran test BNSM simulations. The problem remains the
> same.
>
> I cannot use more than 8 procs (num-threads 1) for this parfile.
> Otherwise, it says, 'grid structure consistency check failed'. So, I
> guess this simulation is not demanding too much memory. But is there a
> better way to check how much memory is expected?
>
> Secondly, I checked the error files from my old BNSM runs, those files
> also contain the following lines indicating Invalid MIT-MAGIC-COOKIE-1 key:
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
> *+ mpirun --use-hwthread-cpus -np 10
> /home/astro208/simulations/t1.4_had_1.4M/SIMFACTORY/exe/cactus_sim -L 3
> /home/astro208/simulations/t1.4_had_1.4M/output-0000/t1.4_had_1.4M.parInvalid
> MIT-MAGIC-COOKIE-1
> key--------------------------------------------------------------------------WARNING:
> No preset parameters were found for the device that Open MPIdetected:
> Local host:            astro  Device name:           irdma0  Device vendor
> ID:      0x8086  Device vendor part ID: 14289Default device parameters will
> be used, which may result in lowerperformance.  You can edit any of the
> files specified by thebtl_openib_device_param_files MCA parameter to set
> values for yourdevice.NOTE: You can turn off this warning by setting the
> MCA parameter      btl_openib_warn_no_device_params_found to
> 0.--------------------------------------------------------------------------No
> OpenFabrics connection schemes reported that they were able to beused on a
> specific port.  As such, the openib BTL (OpenFabricssupport) will be
> disabled for this port.  Local host:           astro  Local device:
> irdma1  Local port:           1  CPCs attempted:
> udcm--------------------------------------------------------------------------Open
> MPI failed an OFI Libfabric library call (fi_endpoint).  This is
> highlyunusual; your job may behave unpredictably (and/or abort) after
> this.  Local host: astro  Location: mtl_ofi_component.c:629  Error:
> Unspecified error
> (256)--------------------------------------------------------------------------*
>
> These simulations ran successfully. Even though Invalid-MAGIC-COOKIE could
> be a separate issue, it may not be the source of this particular problem,
> but I could be totally wrong here.
>
> Please let me know your thoughts on this.
>
> Regards
> Shamim Haque
> Senior Research Fellow (SRF)
> Department of Physics
> IISER Bhopal
> Shamim Haque
> Senior Research Fellow (SRF)
> Department of Physics
> IISER Bhopal
>
>>
> On Thu, Oct 12, 2023 at 8:15 PM Zach Etienne <zachetie at gmail.com> wrote:
>
>> Hi Shamim,
>>
>> Invalid MIT-MAGIC-COOKIE-1 key: This is related to X11 forwarding and
>> authentication. The MIT-MAGIC-COOKIE-1 is an authentication scheme used by
>> X11, the Linux windowing system. When you're trying to run a program that
>> requires graphical output on a remote machine, the X11 system uses these
>> "magic cookies" to authenticate the user. If there's a mismatch or the key
>> is invalid, you will be denied permission.
>>
>> We believe the segmentation fault is probably due to running a parameter
>> file on a computer that doesn't have enough memory. The error file seemed
>> to indicate running on a laptop.
>>
>> -Zach
>>
>> *     *     *
>> Zachariah Etienne
>> Assoc. Prof. of Physics, U. of Idaho
>> Adjunct Assoc. Prof. of Physics & Astronomy, West Virginia U.
>> https://etienneresearch.com
>> https://blackholesathome.net
>>
>>
>> On Thu, Oct 12, 2023 at 6:34 AM Shamim Haque 1910511 <
>> shamims at iiserb.ac.in> wrote:
>>
>>> Hello all,
>>>
>>> I am trying to use the bbig.par parameter file to extract the metric and
>>> thermodynamic information for an isolated star ID from Lorene. This parfile
>>> is available in Meudon_Mag_NS/par folder.
>>>
>>> I tried this par file with the given Lorene ID. It is supposed to exit
>>> after iter 0 with IO outfiles. However, it does not give the requested
>>> outputs upon completion of the simulation. I can see the Lorene information
>>> is read correctly in the out file, but I am unable to find out the problem.
>>>
>>> I need some help with this. I have attached the parfile, ID, outfile and
>>> error file for reference.
>>>
>>> Regards
>>> Shamim Haque
>>> Senior Research Fellow (SRF)
>>> Department of Physics
>>> IISER Bhopal
>>>>>> _______________________________________________
>>> Users mailing list
>>> Users at einsteintoolkit.org
>>> http://lists.einsteintoolkit.org/mailman/listinfo/users
>>>
>>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.einsteintoolkit.org/pipermail/users/attachments/20231014/dbdbaf10/attachment.htm>


More information about the Users mailing list