Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

【Hackathon 7th No 43】完善TokenizerFast功能支持 #998

Merged
merged 2 commits into from
Nov 4, 2024

Conversation

yinfan98
Copy link
Contributor

@yinfan98 yinfan98 commented Nov 3, 2024

PaddleNLP 完善 TokenizerFast支持

Copy link

paddle-bot bot commented Nov 3, 2024

你的PR提交成功,感谢你对开源项目的贡献!
请检查PR提交格式和内容是否完备,具体请参考示例模版
Your PR has been submitted. Thanks for your contribution!
Please check its format and content. For this, you can refer to Template and Demo.

@luotao1
Copy link
Collaborator

luotao1 commented Nov 4, 2024

@KB-Ding @DrownFish19 可以review了

@DrownFish19
Copy link
Contributor

此处需要注意,

  • 护航同学也在更新TokenizerFast相关代码,但是两项工作为辅助并行关系,黑客松主要面向单个模型的tokenizerfast,重点在验证精度,护航主要面向TokenizerFast基类模型,重点在提升基础能力。
  • BertTokenizerFast已有PR,可以直接验证,无需添加。

PaddlePaddle/PaddleNLP#9215
PaddlePaddle/PaddleNLP#9353

Copy link
Contributor

@DrownFish19 DrownFish19 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@luotao1 luotao1 merged commit 5aa7c84 into PaddlePaddle:master Nov 4, 2024
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants