Skip to content

Commit

Permalink
Handling more cases with gersam
Browse files Browse the repository at this point in the history
  • Loading branch information
Yomguithereal committed Mar 20, 2020
1 parent b9ba907 commit 3993939
Show file tree
Hide file tree
Showing 2 changed files with 29 additions and 3 deletions.
22 changes: 19 additions & 3 deletions src/tokenizers/words/gersam.js

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

10 changes: 10 additions & 0 deletions test/tokenizers/words/gersam.js
Original file line number Diff line number Diff line change
Expand Up @@ -53,6 +53,16 @@ describe('gersam', function() {
lang: 'it',
text: 'O.N.U. La vie.est foutue.',
tokens: ['O.N.U.', 'La', 'vie', '.', 'est', 'foutue', '.']
},
{
lang: 'en',
text: 'Mrs. Langley is back from the market.',
tokens: ['Mrs.', 'Langley', 'is', 'back', 'from', 'the', 'market', '.']
},
{
lang: 'fr',
text: 'Les É.U. sont nuls.',
tokens: ['Les', 'É.U.', 'sont', 'nuls', '.']
}
];

Expand Down

0 comments on commit 3993939

Please sign in to comment.