Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow overriding raw data files with processed derived data files in load function #42

Closed
glopesdev opened this issue Oct 13, 2021 · 1 comment · Fixed by #50
Closed
Assignees
Labels
feature New planned feature

Comments

@glopesdev
Copy link
Contributor

glopesdev commented Oct 13, 2021

We would like to allow for curation and cleaning of low-level data, while keeping the original raw data immutable. Ideally downstream analysis scripts would not need to be changed in any major way to consume the cleaned up data.

It would be ideal if this can be transparently supported by the aeon file system by allowing the load method to receive a list of dataset paths which are used for file resolution in priority order (paths in the list are searched in FIFO order).

We need to consider whether / how to make this process traceable, either by adding an attribute identifying the source of the dataset chunk, or some other means of logging.

@glopesdev glopesdev added the feature New planned feature label Oct 13, 2021
@glopesdev glopesdev self-assigned this Oct 13, 2021
glopesdev added a commit that referenced this issue Oct 13, 2021
glopesdev added a commit that referenced this issue Nov 4, 2021
@jkbhagatio
Copy link
Member

PR merged, closing

iamamutt pushed a commit that referenced this issue Feb 2, 2022
* Allow load to resolve priority list of paths

Fixes #42

* Add support for parsing multianimal position data

Fixes #55

* add plot for experiment-level average time spent at different locations of the arena

* update docker stuff

* Merge remote-tracking branch 'origin/datajoint_pipeline'

* python package changes for docker install

* docker updates

* fix context path

* fix dockerfile path

* docker ghcr.io PAT instructions for private repo

* use datajoint_pipeline for actions branch

* pull latest conda/mamba image

* Docker updates (#7)

* new branch avoid build

* changes to docker dev yml

* user group settings

* minor updates

* update docker image

* docker container paths permissions fixes

* minor rename

* refactor docker for aeon_db worker

* incorporate thinhs changes

* get env for linux amd64

* image updates

* add script for sudo

* process root dirs separately then concat

* some keys not in json file

* docker env updates

* update sciviz deployment

* confirm API works, revert back the temporary changes in `chunkdata`

* gh actions github context

* change cache type

* remove old cache

* devcontainer updates

* dockerfile updates

* docker updates

* docker updates

* Docker updates (#61)

Updates to docker and trigger gh actions

* test action

* env instructions

* fix repo owner

* add locked envs

* Docker updates

Updates to docker and trigger gh actions

* docker updates

* docker updates

* rm tmp dir gh actions

* rm tmp dir gh actions

* add back api fix

* gh action

* none path

* docker updates

* Docker ingestion tested (#64)

* docker updates

* try without cache

* debug buildx

* ingestion updates

Co-authored-by: glopesdev <glopesdev@users.noreply.github.com>
Co-authored-by: Thinh Nguyen <thinh@vathes.com>
Co-authored-by: Jai Bhagat <jkbhagatio@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature New planned feature
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants