Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix typo and simplify wording in OnlineCountVectorizer docstring #1802

Merged
merged 1 commit into from
Feb 10, 2024
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions bertopic/vectorizers/_online_cv.py
Original file line number Diff line number Diff line change
Expand Up @@ -27,8 +27,8 @@ class OnlineCountVectorizer(CountVectorizer):
the previous bag-of-words should be decreased. For example,
a value of `.1` will decrease the frequencies in the bag-of-words
matrix with 10% at each iteration.
delete_min_df: Delete words eat each iteration from its vocabulary
that do not exceed a minimum frequency.
delete_min_df: Delete words at each iteration from its vocabulary
that are below a minimum frequency.
This will keep the resulting bag-of-words matrix small
such that it does not explode in size with increasing
vocabulary. If `decay` is None then this equals `min_df`.
Expand Down
Loading