[Users] ET test failures on Stampede

Ian Hinder ian.hinder at aei.mpg.de
Thu Oct 30 10:34:35 CDT 2014


On 30 Oct 2014, at 15:03, Erik Schnetter <schnetter at cct.lsu.edu> wrote:

> I've begun to run the automated tests for the ET on our production
> machines. Things look very good almost everywhere, except on Stampede,
> one of the machines that is most important to us. It seems that there
> are many test failures for GRHydro, and these seem to be caused by
> segfaults. Does anybody volunteer to investigate?

I don't know anything about the problems with GRHydro.  

I was having problems a while back with the current simfactory default version of mvapich2, and TACC support suggested I try the mvapich2-x version.  The problem I saw was the MPI reductions would hang.  I have been using that for a few months with no problems.  The required change to the optionlist is:

> 
> < MPI_DIR  = /opt/apps/intel13/mvapich2/1.9
> ---
>> MPI_DIR  = /home1/apps/intel13/mvapich2-x/2.0b
>> MPI_LIB_DIRS = /home1/apps/intel13/mvapich2-x/2.0b/lib64


At least, this was before the changes to the MPI thorn.  It's possible that this is no longer enough.

Should we change to this version in simfactory for the release?

-- 
Ian Hinder
http://numrel.aei.mpg.de/people/hinder



More information about the Users mailing list