Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Things to do before neurips #46

Open
6 tasks
galv opened this issue Sep 8, 2021 · 1 comment
Open
6 tasks

Things to do before neurips #46

galv opened this issue Sep 8, 2021 · 1 comment

Comments

@galv
Copy link
Collaborator

galv commented Sep 8, 2021

  • Create two separate datasets to distribute, one CC-BY, one CC-BY-SA.
  • Rerun yamnet on the entire dataset. This means we need to make it more performant See yamnet WIP #40
  • Send data to be hand-transcribed.
    • Optionally, do audio-based deduplication first.
  • Add text deata deduplication to the data creation pipeline.
  • Train kaldi and/or nemo models on the dataset. Provide fixes to the dataset, based on this work.
    Adding more as time goes on...
@galv
Copy link
Collaborator Author

galv commented Sep 13, 2021

Poster + 3 minute talk due: Oct 18th

Camera-ready paper due: November 6th

Neurips (dataset release): Early December

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant