-
Notifications
You must be signed in to change notification settings - Fork 764
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BerTopic Model - Visualization ignores 0th index #667
Comments
Thank you for the extensive description! This is indeed a known issue and has to do with how the topics are accessed. It will be fixed in the next release as there will be some changes to the internal structure and the way topics are accessed. For now, it should work by running the following: |
Thanks a lot for the quick response @MaartenGr , I appreciate your effort. After configuring the mentioned attribute |
* Online/incremental topic modeling with .partial_fit * Expose c-TF-IDF model for customization with bertopic.vectorizers.ClassTfidfTransformer * Expose attributes for easier access to internal data * Major changes to the Algorithm page of the documentation, which now contains three overviews of the algorithm * Added an example of combining BERTopic with KeyBERT * Added many tests with the intention of making development a bit more stable * Fix #632, #648, #673, #682, #667, #664
The BerTopic model resulted the below Topics:
As you can see from the above, the model is finetuned to generate lesser outliers '-1' which has the count of 3 and it appears in the last.
While visualizing the Topics per class,
topic_model.visualize_topics_per_class(topics_per_class)
the below interactive visual is generated, and however it ignored the
0th
index, to be precise theTopic 0
. The Global Topic Representations are displayed from1, 2, 3, 4, 5, 6, -1
The text was updated successfully, but these errors were encountered: