[Topic Models] Understand various shades of “Other/None of the Above” #1876

jucor · 2025-01-21T10:47:15Z

Most topic classifications, including humans, have an “Other” class, which humans mean as “None of the above”. Some topic models explicitly model for one or several “noise” classes.
When we use a topic model library, we will want to ensure that the “Other” class does match our human users expectation. In particular, some libraries use “Other” as a catch-all which includes both “none of the above” and “algorithm could not determine”. While this can seem a subtle nuance, there is a difference:

“None of the above” is a property of the comment: the classifier looked at all the above and said “this does not belong to any of them”. For an automated classifier that can mean “I’m sure that it does not belong”.
“Algorithm could not determine” is a property of the algorithm. It could for example be a mixture algorithm that fails to converge, or that it cannot handle some of the text in the comment, or that there has been an unexpected error.

That level of detail then allows us to better understand what are our failure cases, what we cover and what might be missing from those topic classifications. It becomes even more important if the “Other” category is large.

cianbrassilg · 2025-01-22T14:56:34Z

Great points here @jucor - this is an area we've been looking to improve recently also. We've also been considering that:

If the user is providing the categories, then a larger other/none category may make sense, depending on categories provided, if many comments don't logically fit.
If the model is generating the categories, then we should likely expect significantly less 'other' categorized comments.

Curious if in your testing you've come across typical comment types that fit the "algorithm could not determine" type category you mentioned?

jucor · 2025-01-22T15:10:41Z

Hi @cianbrassilg !
[Note: shall we move the discussion specific to Jigsaw's sensemaking-tools into the issue opened in the repository specific to that product/library ? here: Jigsaw-Code/sensemaking-tools#10 ]

Great points here @jucor - this is an area we've been looking to improve recently also.

Terrific :)

We've also been considering that:

If the user is providing the categories, then a larger other/none category may make sense, depending on categories provided, if many comments don't logically fit.

That makes sense to me! Might be worth then detecting it and issuing a warning at the end, sort of "You might want to re-run with automatic discovery of categories".

If the model is generating the categories, then we should likely expect significantly less 'other' categorized comments.

Agreed.

Curious if in your testing you've come across typical comment types that fit the "algorithm could not determine" type category you mentioned?

Yes, for example when I ran the BG2018-short "2018 BG with vote tallies (filtered) - comments-with-votes-small" example spreadsheet provided by @metasoarous : more than 70% of the comments ended in "algorithm could not determine". I suspect (but did not verify) that's because the spreadsheet had a lot of comments that were not filtered out but whose content had been deleted.

I remember also @DZNarayanan mentioning that "Other" is often pretty big, and one of your team mentioning that it's often the biggest category -- so as we're investigating why, I think ruling out "Algorithm could not determine" would be the first thing to check for (and since doing it automatically is just a code change, that'd be easier than doing it manually).

DZNarayanan · 2025-01-22T16:06:39Z

Hi @cianbrassilg,

It would helpful if you can modify the code to mark which statements in the "Other" category fall under "algorithm could not determine” and which ones "none of the above." That will make it easier to find patterns and then figure out how to reduce the size of the "Other" category.

Thanks.

cianbrassilg · 2025-01-22T18:01:36Z

shall we move the discussion specific to Jigsaw's sensemaking-tools into the issue opened in the repository specific to that product/library ?
@jucor Yes, sounds good! We can continue there 👍

mark which statements in the "Other" category fall under "algorithm could not determine” and which ones "none of the above."
@DZNarayanan Good suggestion, will discuss with the team here also

jucor added the feature-request For new feature suggestions label Jan 21, 2025

jucor mentioned this issue Jan 22, 2025

[Topics] Test-Time Compute for Topics: Embrace uncertainty to reduce hallucinations, with multi-sample and/or semantic entropy #1880

Open

jucor mentioned this issue Jan 23, 2025

[LLM Summarization and Topics] Test-Time Compute: Rejection / Best-of-N #1883

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Topic Models] Understand various shades of “Other/None of the Above” #1876

[Topic Models] Understand various shades of “Other/None of the Above” #1876

jucor commented Jan 21, 2025

cianbrassilg commented Jan 22, 2025 •

edited

Loading

jucor commented Jan 22, 2025 •

edited

Loading

DZNarayanan commented Jan 22, 2025

cianbrassilg commented Jan 22, 2025 •

edited

Loading

[Topic Models] Understand various shades of “Other/None of the Above” #1876

[Topic Models] Understand various shades of “Other/None of the Above” #1876

Comments

jucor commented Jan 21, 2025

cianbrassilg commented Jan 22, 2025 • edited Loading

jucor commented Jan 22, 2025 • edited Loading

DZNarayanan commented Jan 22, 2025

cianbrassilg commented Jan 22, 2025 • edited Loading

cianbrassilg commented Jan 22, 2025 •

edited

Loading

jucor commented Jan 22, 2025 •

edited

Loading

cianbrassilg commented Jan 22, 2025 •

edited

Loading