tag, branch and/or release code for reproducibility #108
Labels
documentation
Improvements or additions to documentation
next step
This issue should be addresses in Summer 2022
reproducibility
Improve transparency what was done
While people who want to replicate our paper can check out code based on a commit number found by inspecting the commit history, they are at risk to pick
We should tag, branch and/or release code to make it easy for visitors to pick the right code for reproducibility.
A branch would make it possible to keep updating the README (and to make late additions of code used in the experiments) even after the main branch diverges, e.g. when the main branch changes the steps and/or tools to carry out the experiment. This branch could be named "bert-base-irish-cased-v1", matching the model name in the huggingface model repository.
We also need to document commit number / version of wiki-bert-pipeline and opusfilter. (The idea of using a fork in your own github account only works as long as you remember to never hit the "fetch upstream" button or to make any other changes to your fork.)
The text was updated successfully, but these errors were encountered: