Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

move tokenizer API to seperate crate #1767

Merged
merged 1 commit into from
Jan 9, 2023
Merged

move tokenizer API to seperate crate #1767

merged 1 commit into from
Jan 9, 2023

Conversation

PSeitz
Copy link
Contributor

@PSeitz PSeitz commented Jan 8, 2023

closes #1766

Finding tantivy tokenizers is a frustrating experience currently, since
they need be updated for each tantivy version. That's unnecessary since
the API is rather stable anyway.

@PSeitz PSeitz force-pushed the tokenizer_api branch 3 times, most recently from c34a9e3 to 8ab60c1 Compare January 8, 2023 10:19
@PSeitz PSeitz requested a review from fulmicoton January 8, 2023 10:19
@codecov-commenter
Copy link

codecov-commenter commented Jan 8, 2023

Codecov Report

Merging #1767 (d7525a8) into main (4f9efe6) will increase coverage by 0.00%.
The diff coverage is 82.85%.

@@           Coverage Diff           @@
##             main    #1767   +/-   ##
=======================================
  Coverage   94.07%   94.07%           
=======================================
  Files         279      280    +1     
  Lines       52284    52285    +1     
=======================================
+ Hits        49185    49189    +4     
+ Misses       3099     3096    -3     
Impacted Files Coverage Δ
src/tokenizer/mod.rs 95.61% <ø> (ø)
src/tokenizer/ngram_tokenizer.rs 98.31% <ø> (ø)
src/tokenizer/tokenizer.rs 100.00% <ø> (+11.76%) ⬆️
tokenizer-api/src/lib.rs 82.85% <82.85%> (ø)
bitpacker/src/bitpacker.rs 98.98% <0.00%> (-1.02%) ⬇️
src/schema/schema.rs 98.91% <0.00%> (+0.13%) ⬆️
src/fastfield/multivalued/mod.rs 98.45% <0.00%> (+1.15%) ⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

closes #1766

Finding tantivy tokenizers is a frustrating experience currently, since
they need be updated for each tantivy version. That's unnecessary since
the API is rather stable anyway.
@PSeitz PSeitz merged commit 514d23a into main Jan 9, 2023
@PSeitz PSeitz deleted the tokenizer_api branch January 9, 2023 05:37
This was referenced Jan 13, 2023
Hodkinson pushed a commit to Hodkinson/tantivy that referenced this pull request Jan 30, 2023
closes quickwit-oss#1766

Finding tantivy tokenizers is a frustrating experience currently, since
they need be updated for each tantivy version. That's unnecessary since
the API is rather stable anyway.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Move Tokenizer API to own crate
3 participants