New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Unexpected Clustering Result with Given Parameters #643

Open

yata0 opened this issue Jun 26, 2024 · 0 comments

yata0 commented Jun 26, 2024

I've encountered an unexpected behavior in the clustering of my text corpus. The corpus contains the following texts:

text1
text2
text3
Based on the distance metric used, it is observed that dist(text1, text2) < dist(text1, text3). However, the clustering algorithm has grouped text1 with text3 and identified text2 as an isolated point. This result is contrary to my expectations.

Parameters Used:

min_cluster_size = 2
min_samples = 2
distance: Euclidean distance

The text was updated successfully, but these errors were encountered:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment