Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pretraining accuracy for retromae v1 #25

Open
wwx13 opened this issue Feb 4, 2024 · 2 comments
Open

Pretraining accuracy for retromae v1 #25

wwx13 opened this issue Feb 4, 2024 · 2 comments

Comments

@wwx13
Copy link

wwx13 commented Feb 4, 2024

Great job!
Hello , i wonder if you can tell me the training mlm accuracy of encoder and decoder. Im training my retromae model now.

@staoxiao
Copy link
Owner

staoxiao commented Feb 4, 2024

Hi, thanks for your interest in our work!
Actually, we didn't test the mlm accuracy of retromae on any data. We view the retrieval performance after fine-tuning as the quality of pre-trained model.

@soledad921
Copy link

Hi, I am also quite interested in this work. Could you tell me how much the loss function of the model decreased when you were training the model on Wikipedia? I am trying to pretrain a BERT encoder using retromae v1 on my local dataset (a document pool extracted from Wikipedia). But I have no idea if my model is fully trained?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants