[Users] Another restart from checkpoint failure

Frank Loeffler knarf at cct.lsu.edu
Thu Feb 17 07:41:17 CST 2011


Hi,

On Thu, Feb 17, 2011 at 03:24:04PM +0900, Jakob Hansen wrote:
>   #005: H5Zdeflate.c line 133 in H5Z_filter_deflate(): memory allocation
> failed for deflate uncompression
>     major: Resource unavailable
>     minor: No space available for allocation

> Any ideas for possible cause and solution to this?

Looking at these error messages I suggest you first do an h5ls/h5dump to
see if the files are actually ok. If that is so, what I would suspect next
is that reading the files takes more memory than is available, as indicated
by the message above. In this case I suggest to try the workarounds
mentioned in this thread:

http://lists.einsteintoolkit.org/pipermail/users/2011-February/000852.html

If this also doesn't help I would suggest to try again using more
available memory, e.g. using more nodes.

Frank

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 835 bytes
Desc: not available
Url : http://lists.einsteintoolkit.org/pipermail/users/attachments/20110217/e018daf4/attachment.bin 


More information about the Users mailing list