How to calculate perplexity value? #2258
Unanswered
powerhorse1986
asked this question in
Q&A
Replies: 1 comment 1 reply
-
Does the reviewer also have a suggestion of how to calculate the perplexity? BERTopic is not a probabilistic model and as such does not (directly) calculate word probabilities / document-topic probabilities the same way that something like LDA does. That said, you can approximate the word probabilities by using the c-TF-IDF matrix for each topic and push them between 0 and 1. Then, you can get the document-topic probabilities by setting So I think it would be possible to do so and get the perplexity, I'm just not sure how meaningful it would be. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi Maarten,
How are you? So sorry to bother you again. Recently our group finished a project by using BERTopic and submitted a manuscript. One of the reviewers insisted that we must calculate the perplexity score. I totally have no idea how to get it done. Would you please give some hints? Thank you. And, happy new year in advance.
Li
Beta Was this translation helpful? Give feedback.
All reactions