-
Notifications
You must be signed in to change notification settings - Fork 52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support Biological Observation Matrix (BIOM) files #3715
Conversation
Interesting, what is going on here? |
The local HDF5 libs are likely outdated in the testing container, so they need to be rebuilt. I've honestly never bothered checking how it was build. After installing $ h5cc -showconfig
SUMMARY OF THE HDF5 CONFIGURATION
=================================
General Information:
-------------------
HDF5 Version: 1.10.4
Configured on: Mon, 13 Apr 2020 12:15:08 +0000
Configured by: Debian
Host system: x86_64-pc-linux-gnu
Uname information: Debian
Byte sex: little-endian
Installation point: /usr
Flavor name: serial
[...] This sounds like a problem for Monday Chris. |
Monday Chris. Checkout: https://github.com/HDFGroup/hdf5/releases |
Or, Monday Chris, you might consider pinning the python wrappers on the version of the Ubunu binaries. |
Ok. Can replicate locally. So that is a start. The testing container reports this version of hdf5: |
[WIP]
The HDF5 lib is currently at Different versions of the
|
Guh, auto rebase seems to have messed some things up. Will clean up tomorrow. |
c40b1a9
to
5b30f3c
Compare
Nudged the poetry.lock to force a rebuild of |
Can prob utilize the virtual envs to very quickly address: |
Part of the pitch:
This PR introduces support for BIOM files. It validates them by attempting to parse them via the
biom-format
library.An explicit check is made that all uploaded BIOM are in hd5f format. This is the binary compressed version and I'd argue it's the only one we should support.
Once this is merged:- #3713BIOM file validation is hooked to a larger worker queue.
Isolated virtualenv
After some (painful) debugging I've refactored the way the BIOM parser is called. This alleviates the problem with incorrect versioned HDF5 libraries being pre-loaded.
The base container now also generates a 'biom' virtual environment under
/opt/virtualenvs
. Apart from fixing the import bug, it has the benefit of isolating the parser libraries from the web app.