Replies: 1 comment 2 replies
-
It's very difficult to say anything without knowing a bit more about your code. Could you share your full code for creating the topic model? Most important are how you initialized BERTopic and any processing you did afterwards. Also, which version of BERTopic are you using?
That is to be expected as this method uses a different technique of calculating the probabilities, hence the If you are missing particular topic, then it might help to tweak the parameters of |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Dear developer and anyone who see this
After topic_model.fit_transform, i have assign my 10000 documents to 10 topics,
but i find some of them are not very reasonable,
for example
document1 "Advances in intracranial electroencephalography (iEEG) and neurophysiology have enabled the study of previously inaccessible brain regions with high fidelity temporal and spatial resolution...." should be assigned to Topic5 EEG and BCI.
However, The document1 has been assigned to Topic 6 adhd and cognition.
I think i have to tune some parameters to correct this fault. Here are my solutions
1. Maybe the UMAP component is too low to tell the difference between Topic3 and 6, I need to set component to >20 (not helpful)
2. I want to check contribution of each words by topic_model.visualize_approximate_distribution , but as shown in picture below,the result is different from probs.
How can i make the assignment right?
Beta Was this translation helpful? Give feedback.
All reactions