Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Split our datasets #26

Merged
merged 2 commits into from
Nov 26, 2023

Conversation

Grutschus
Copy link
Owner

closes #22

Added a split_data script that can either be called from the command line or from another module.
Added a notebook to show how we split the dataset and to show some information about the data.

Decided to stratify the "category" column so that we have similar proportions of ADL in all datasets. It is possible that we now have the same scenario from different angles in our dataset... don't know if that's a problem. Let's discuss!

@Grutschus Grutschus added this to the Development freeze milestone Nov 22, 2023
@Grutschus Grutschus linked an issue Nov 22, 2023 that may be closed by this pull request
@emiresenov emiresenov merged commit f4de9a4 into main Nov 26, 2023
1 check passed
@Grutschus Grutschus deleted the 22-separate-our-annotation-file-into-train-val-test branch November 26, 2023 19:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Separate our annotation file into train / val / test
2 participants