Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

in 4.7.2, parallel I/O not working #1520

Closed
edhartnett opened this issue Nov 12, 2019 · 9 comments
Closed

in 4.7.2, parallel I/O not working #1520

edhartnett opened this issue Nov 12, 2019 · 9 comments

Comments

@edhartnett
Copy link
Contributor

I just noticed that 4.7.2 netcdf_par.h includes ncexternl.h, but this is not installed.

This seems to be fixed in master, but I believe it means that 4.7.2 will not work for parallel I/O, which is a fairly serious problem!

Or am I mistaken?

@ckhroulev
Copy link
Contributor

This is correct (see #1502). NetCDF 4.7.2 cannot be used for parallel I/O, which forced me to downgrade to 4.7.0 since 4.7.1 does not work on macOS.

@edhartnett
Copy link
Contributor Author

OK, bummer. @WardF is the plan to get the next release out soon? Preferably before the AGU?

@edhartnett edhartnett changed the title in 4.7.2, netcdf_par.h includes ncexternl.h, which is not installed in 4.7.2, parallel I/O not working Nov 15, 2019
@edhartnett
Copy link
Contributor Author

@WardF given that parallel I/O is a major feature, and one that NOAA and other HPC users absolutely need, can we soon get a new release where parallel I/O is working?

Thanks

@junwang-noaa
Copy link

May I ask if this parallel I/O is working for compressed netcdf4(HDF5) data? If yes, what is the time line for the release package ready? We are working on the next GFS implementation, it is planned to have the model output history files in compressed netcdf4 format, but currently downstream jobs are suffering the slowness of reading the data. We hope the parallel I/O can help resolving the issue. Thanks.

@edwardhartnett
Copy link
Contributor

Howdy @junwang-noaa parallel I/O can read but not write compressed data. That will be the state of the next release.

However, the HDF5 team has created a filter that allows compression when writing with parallel I/O and they have promised to help me get this going soon after the new year.

But I would like to take a look at one of your history files to see if I can understand what is going on. Can you share one with me?

@junwang-noaa
Copy link

I have two sample data on EMC ftp site at:

https://ftp.emc.ncep.noaa.gov/EIB/sample/gfsv16_compressed_netcdf/gfs.t00z.atmf024.nc
https://ftp.emc.ncep.noaa.gov/EIB/sample/gfsv16_compressed_netcdf/gfs.t00z.sfcf024.nc

The atmf file is compressed and the sfcf is not. The sequential read of the two files takes about 5 minuts, while previous parallel read of the plain binary files takes 20s.

@edwardhartnett
Copy link
Contributor

Thanks I will take a look...

@edwardhartnett
Copy link
Contributor

Let's continue this discussion in #1543

@edhartnett
Copy link
Contributor Author

The parallel I/O build issue has been fixed by the 4.7.3 release, so I will close this issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants