[Users] Meudon_Mag_NS parfile output

Shamim Haque 1910511 shamims at iiserb.ac.in
Mon Oct 16 02:30:55 CDT 2023


Hi Zach,

Thank you for the suggestions. I'll raise a ticket for the first issue, and
check out the second one.

Regards
Shamim Haque
Senior Research Fellow (SRF)
Department of Physics
IISER Bhopal

ᐧ

On Sun, Oct 15, 2023 at 12:29 AM Zach Etienne <zachetie at gmail.com> wrote:

> Hi Shamim,
>
> Thank you for providing additional information on the issue. Let's address
> your questions one by one:
>
> * Memory Usage
> If the error message is about "grid structure consistency," that usually
> points to an issue setting up the Carpet grids rather than a lack of
> memory. Maybe Carpet is having a problem breaking up the grids beyond 8
> cores? I suggest you submit a ticket to
> https://bitbucket.org/einsteintoolkit/tickets/ with the full instructions
> on reproducing the problem.
>
> * Invalid MIT-MAGIC-COOKIE-1 Key
> You're correct in your assumption that the Invalid MIT-MAGIC-COOKIE-1 key
> issue might not be directly causing the problem you're facing with the BNSM
> runs, especially since the simulations ran successfully despite the
> warning. However, I'd still recommend addressing this issue to eliminate it
> as a potential confounding factor.
>
> -Zach
>
> *     *     *
> Zachariah Etienne
> Assoc. Prof. of Physics, U. of Idaho
> Adjunct Assoc. Prof. of Physics & Astronomy, West Virginia U.
> https://etienneresearch.com
> https://blackholesathome.net
>
>
> On Sat, Oct 14, 2023 at 6:36 AM Shamim Haque 1910511 <shamims at iiserb.ac.in>
> wrote:
>
>> Hi Zach,
>>
>> I retried it on a workstation with fairly good configuration (80 threads,
>> 256 GB RAM), where I ran test BNSM simulations. The problem remains the
>> same.
>>
>> I cannot use more than 8 procs (num-threads 1) for this parfile.
>> Otherwise, it says, 'grid structure consistency check failed'. So, I
>> guess this simulation is not demanding too much memory. But is there a
>> better way to check how much memory is expected?
>>
>> Secondly, I checked the error files from my old BNSM runs, those files
>> also contain the following lines indicating Invalid MIT-MAGIC-COOKIE-1 key:
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> *+ mpirun --use-hwthread-cpus -np 10
>> /home/astro208/simulations/t1.4_had_1.4M/SIMFACTORY/exe/cactus_sim -L 3
>> /home/astro208/simulations/t1.4_had_1.4M/output-0000/t1.4_had_1.4M.parInvalid
>> MIT-MAGIC-COOKIE-1
>> key--------------------------------------------------------------------------WARNING:
>> No preset parameters were found for the device that Open MPIdetected:
>> Local host:            astro  Device name:           irdma0  Device vendor
>> ID:      0x8086  Device vendor part ID: 14289Default device parameters will
>> be used, which may result in lowerperformance.  You can edit any of the
>> files specified by thebtl_openib_device_param_files MCA parameter to set
>> values for yourdevice.NOTE: You can turn off this warning by setting the
>> MCA parameter      btl_openib_warn_no_device_params_found to
>> 0.--------------------------------------------------------------------------No
>> OpenFabrics connection schemes reported that they were able to beused on a
>> specific port.  As such, the openib BTL (OpenFabricssupport) will be
>> disabled for this port.  Local host:           astro  Local device:
>> irdma1  Local port:           1  CPCs attempted:
>> udcm--------------------------------------------------------------------------Open
>> MPI failed an OFI Libfabric library call (fi_endpoint).  This is
>> highlyunusual; your job may behave unpredictably (and/or abort) after
>> this.  Local host: astro  Location: mtl_ofi_component.c:629  Error:
>> Unspecified error
>> (256)--------------------------------------------------------------------------*
>>
>> These simulations ran successfully. Even though Invalid-MAGIC-COOKIE
>> could be a separate issue, it may not be the source of this particular
>> problem, but I could be totally wrong here.
>>
>> Please let me know your thoughts on this.
>>
>> Regards
>> Shamim Haque
>> Senior Research Fellow (SRF)
>> Department of Physics
>> IISER Bhopal
>> Shamim Haque
>> Senior Research Fellow (SRF)
>> Department of Physics
>> IISER Bhopal
>>
>>>>
>> On Thu, Oct 12, 2023 at 8:15 PM Zach Etienne <zachetie at gmail.com> wrote:
>>
>>> Hi Shamim,
>>>
>>> Invalid MIT-MAGIC-COOKIE-1 key: This is related to X11 forwarding and
>>> authentication. The MIT-MAGIC-COOKIE-1 is an authentication scheme used by
>>> X11, the Linux windowing system. When you're trying to run a program that
>>> requires graphical output on a remote machine, the X11 system uses these
>>> "magic cookies" to authenticate the user. If there's a mismatch or the key
>>> is invalid, you will be denied permission.
>>>
>>> We believe the segmentation fault is probably due to running a parameter
>>> file on a computer that doesn't have enough memory. The error file seemed
>>> to indicate running on a laptop.
>>>
>>> -Zach
>>>
>>> *     *     *
>>> Zachariah Etienne
>>> Assoc. Prof. of Physics, U. of Idaho
>>> Adjunct Assoc. Prof. of Physics & Astronomy, West Virginia U.
>>> https://etienneresearch.com
>>> https://blackholesathome.net
>>>
>>>
>>> On Thu, Oct 12, 2023 at 6:34 AM Shamim Haque 1910511 <
>>> shamims at iiserb.ac.in> wrote:
>>>
>>>> Hello all,
>>>>
>>>> I am trying to use the bbig.par parameter file to extract the metric
>>>> and thermodynamic information for an isolated star ID from Lorene. This
>>>> parfile is available in Meudon_Mag_NS/par folder.
>>>>
>>>> I tried this par file with the given Lorene ID. It is supposed to exit
>>>> after iter 0 with IO outfiles. However, it does not give the requested
>>>> outputs upon completion of the simulation. I can see the Lorene information
>>>> is read correctly in the out file, but I am unable to find out the problem.
>>>>
>>>> I need some help with this. I have attached the parfile, ID, outfile
>>>> and error file for reference.
>>>>
>>>> Regards
>>>> Shamim Haque
>>>> Senior Research Fellow (SRF)
>>>> Department of Physics
>>>> IISER Bhopal
>>>>>>>> _______________________________________________
>>>> Users mailing list
>>>> Users at einsteintoolkit.org
>>>> http://lists.einsteintoolkit.org/mailman/listinfo/users
>>>>
>>>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.einsteintoolkit.org/pipermail/users/attachments/20231016/d634dd31/attachment-0001.htm>


More information about the Users mailing list