Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP Dataset Visualization Script #688

Merged
merged 32 commits into from
Jul 23, 2022
Merged

Conversation

ruisi-su
Copy link
Collaborator

@ruisi-su ruisi-su commented Jun 6, 2022

this is the PR for running the stream vis app locally. vis_app.py is for the streamlit app, and vis_data_card.py is to generate the data cards. The latter file needs to be updated to generate cards for all datasets.

@ruisi-su
Copy link
Collaborator Author

@jason-fries @hakunanatasha @galtay I have generated the initial batch of pdfs for:

  1. datasets that are not broken
  2. datasets that are not local
  3. datasets that are covered by the static metadata .json file

Note that these are very preliminary (e.g., axes names spanning through the chart). Please take a look at them and flag any changes we might need to make. Thanks!

@galtay galtay merged commit 38c10a8 into bigscience-workshop:master Jul 23, 2022
davidkartchner pushed a commit to davidkartchner/biomedical that referenced this pull request Jul 26, 2022
* edit

* unchange

* stats

* added init for ptm

* added proc meta script

* add single

* add vis code

* added vis changes

* remove proc file

* add vis code

* add paper script

* edit scripts

* edit scripts

* add readme

* remove wip code

* add ngram back in

* black and isort vis code

* move

* added pdfs

* added pdfs that are not local and not broken

* added agg pdf
phlobo pushed a commit that referenced this pull request Oct 24, 2024
* first half of dataloader

* updates to plant-phenotype script

* Complete plant-phenotype dataset loader

* unit test fixes

* fix formatting

* Updates to readme to include tutotrials, updated datasets, and misc content tweaks (#730)

Small changes only

* typo (#732)

* WIP Dataset Visualization Script  (#688)

* edit

* unchange

* stats

* added init for ptm

* added proc meta script

* add single

* add vis code

* added vis changes

* remove proc file

* add vis code

* add paper script

* edit scripts

* edit scripts

* add readme

* remove wip code

* add ngram back in

* black and isort vis code

* move

* added pdfs

* added pdfs that are not local and not broken

* added agg pdf

* update streamlit instructions (#734)

update streamlit instructions

* refactor: Refactor PPR implementation to HF hub schema

---------

Co-authored-by: Jason Alan Fries <jfries@stanford.edu>
Co-authored-by: barthfab <88676348+barthfab@users.noreply.github.com>
Co-authored-by: Rosaline Su <rosalinesu@gmail.com>
Co-authored-by: Gabriel Altay <gabriel.altay@gmail.com>
Co-authored-by: Mario Sänger <saengema@informatik.hu-berlin.de>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants