Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add coreset k-medoid algorithms #4

Closed
jianshu93 opened this issue Dec 11, 2023 · 1 comment
Closed

Add coreset k-medoid algorithms #4

jianshu93 opened this issue Dec 11, 2023 · 1 comment
Labels
enhancement New feature or request needs funding Issues that would need funding to be completed

Comments

@jianshu93
Copy link

Dear rust-kmedoids team,

It seems several recently publications on coreset can be applied in the k-medoid case for approximate clustering. Check the newly developed coreset crate here: https://github.com/jean-pierreBoth/coreset

It seems only metric space is supported by coreset.

Thank you,

Jianshu

@kno10 kno10 added enhancement New feature or request help wanted Extra attention is needed labels Dec 11, 2023
@kno10
Copy link
Owner

kno10 commented Dec 11, 2023

In my experience, coreset approaches are mostly of theoretical interest because of the constant factors.
For many practical applications, you can either afford to just run the full optimization, or you can just sample uniformly and use the sample solution.
BanditPAM is another recent proposal (#2, NeurIPS 2020, NeurIPS 2023) where in my experience the theoretical results to do not transfer to practical benefits.
For both you can find implementations online - try them out, and let me know if they work well for you.

Both methods are currently out of scope for this package because they need distance functions, and this package currently operates only on precomputed distance matrices, and we do not include a single distance function so far.
Hence CLARA #5 should be added first, which is the simplest method that uses distance computations instead of distance matrixes, then CLARANS #6 next.

@kno10 kno10 changed the title coreset for even faster k-medoid--discussion? Add coreset k-medoid algorithms Dec 11, 2023
@kno10 kno10 added the needs funding Issues that would need funding to be completed label Dec 11, 2023
@kno10 kno10 removed the help wanted Extra attention is needed label Dec 11, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request needs funding Issues that would need funding to be completed
Projects
None yet
Development

No branches or pull requests

2 participants