Replies: 1 comment
-
When you use |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm trying to build BERTopic models on a set of ~130k tweets, preprocessed in the same way as the DTM example in the docs. For some reason, topic 0 has over 200 representative docs. Topic 1 has 6 docs, and the rest of the topics have just 3 docs (which seems to be the expected/correct number). This makes me also wonder if the selected keywords for topics 0/1, as well as the docs assigned to those topics, can be trusted.
My code is below. I'm running BERTopic v0.13.0, in a jupyter notebook. Any ideas would be appreciated, thanks!
Beta Was this translation helpful? Give feedback.
All reactions