-
Notifications
You must be signed in to change notification settings - Fork 23
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat: FromPretrained to load tokenizer directly from HF (#27)
* add LoadTokenizerFromHuggingFace function to load tokenizer directly from huggingface, update README.md * using channels as unbuffered channel, update channel names and minimize some approaches * fix: rename new func to FromPretrained, improve example * fix: clean up downloadFile * fix: concurrency issues in case of an error * fix: make optional params optional * fix: cache path has to be model specific * add unit tests for `FromPretrained` * migrate to table driven tests, unify/simplify test cases * fix: clean up nits --------- Co-authored-by: Resul Berkay Ersoy <resul.ersoy@trendyol.com> Co-authored-by: Daulet Zhanguzin <daulet@zhanguzin.kz>
- Loading branch information
1 parent
0d469f8
commit 9c972d9
Showing
4 changed files
with
369 additions
and
4 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.