[Users] Setting up ETK on an AMD Rome cluster

Roland Haas rhaas at illinois.edu
Thu Mar 2 11:53:59 CST 2023


Hello Vaishak,

Sorry for the delay, and thank you for including the various log files.

I have been running on a new AMD based system (NCSA Delta, Milan, not
Rome) during the last week (with Vectors active), though it is a
slightly older ET code (no changes to Vectors though). I also ran on
SDSC Expanse (Rome, Epyc 7742) for the ET testsuite for the 2022_11
release (http://einsteintoolkit.org/testsuite_results/index.php)
without SEGFAULT failures.
 
This unfortunately makes debugging the issue that you are facing harder.

One (possible) issue could be related to using  -march=native in you
compilation flags. Since this instructs GCC to compile for the CPU
architecture it finds itself running on, I would double check that
indeed the login nodes on sonic use the same CPU as the compute nodes.

Yours,
Roland

> Dear All,
> 
> Greetings from India. I am trying to get the ETK working on an AMD Rome
> powered supercomputer at ICTS, India. I am working with gcc (11.1.0,
> 12.2.0) and openmpi. The compilation is successful but every one of the
> tests and runs fails due to seg faults at the vectorization stage. On
> recompiling the toolkit without vectorization, the tests run  (except for
> one test of ML_BSSN which fails due to a relative error ~ 1e-14). I am
> attaching the backtrace (from the gallery BBH run), make log (with
> vectorization) and the optionlist herewith.
> 
> Requesting help!
> 
> 
> With regards
> 



-- 
My email is as private as my paper mail. I therefore support encrypting
and signing email messages. Get my PGP key from http://keys.gnupg.net.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
Url : http://lists.einsteintoolkit.org/pipermail/users/attachments/20230302/8cf10e3c/attachment-0001.bin 


More information about the Users mailing list