-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Different DBCV value from Matlab's implementation when calculating it on a dataset having all 0s cluster labels #9
Comments
Hi, Davide. Could you please elaborate on the expected semantics of the label 0? Is it supposed to represent all noise or a legitimate cluster? Please keep in mind that the matlab implementation considers "0" the label noise, thus the result being 0.0 makes sense, whereas my implementation considers - by default - "0" a legitimate cluster and "-1" representing the noise labels, thus the result being essentially 1.0 also makes sense. Am I missing something? Best regards, |
Hi Felipe Thanks for your reply. My file has the following meaning: We are aware of the Matlab implementation's differences, and therefore we added +1 to our data when dealing with Matlab code. We changed the clusters this way: So, our Matlab code does not interpret our 0 labels as noise, but as elements of the first cluster, as it should. Thanks -- Davide |
Understood. I'll investigate this asap. Thanks for the report. |
Thanks Felipe. In this dataset file (DB1_with_774_clusters.csv), that has the same structure of the DB1_with_307_clusters.csv file mentioned earlier, we get:
Perhaps you want to study it a bit as well. Muito obrigado! :-) |
Hi Felipe
Thanks again for your availability to fix the issues in your software package during December.
I am writing to you today because I found another case where your
dbcv()
function generates an outcome which is different from Matlab's implementation one.I have applied your
dbcv()
function to this DB1_with_307_clusters.csvdataset file, where the last column on the right contains the cluster labels, which are all zeros.
I used this piece of code:
The result is 0.999 and is clearly wrong. A collaborator of mine applied the function of the original DBCV Matlab implementation and obtained 0 as result.
Can you please investigate this problem?
Thanks
-- Davide
The text was updated successfully, but these errors were encountered: