We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Please provide a clear and concise description of what the question is.
The text was updated successfully, but these errors were encountered:
在pt阶段,注意到代码对预训练数据(比如书籍)的处理是 group by length,也查看了medical_book_zh.json 中的样本,感觉这种方式得到的样本质量比较差,增量预训练不会受到影响吗?
Sorry, something went wrong.
会影响,可以不group
No branches or pull requests
Describe the Question
Please provide a clear and concise description of what the question is.
The text was updated successfully, but these errors were encountered: