Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Weights for data points #10

Closed
SirHodlington opened this issue Apr 21, 2021 · 2 comments
Closed

Weights for data points #10

SirHodlington opened this issue Apr 21, 2021 · 2 comments

Comments

@SirHodlington
Copy link

Is it possible to use weights for data points?
I currently add duplicates to create "weights" for my data points. Seems very inefficient. Is there a better way to do it?

@doxakis
Copy link
Owner

doxakis commented Apr 22, 2021

Hi,

I'm not sure if it will work, but one way you could try is by creating a new distance function.

You can see many examples here:
https://github.com/doxakis/HdbscanSharp/tree/master/HdbscanSharp/Distance

There are in fact many distance functions I could have implemented.
This issue list some that are implemented in one python library: #6

Thanks

@SirHodlington
Copy link
Author

SirHodlington commented Apr 22, 2021

EDIT: Solved. It can be done by calculating the distance matrix. Just one entry per identical data point and for the distances the value is distance/(weight of datapoint) instead of distance
Issue can be closed

@doxakis doxakis closed this as completed Apr 22, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants