-
Notifications
You must be signed in to change notification settings - Fork 56
Performance benchmarks #41
Comments
Hi @mikepalei, Thanks for interest in KNN plugin. We are in the process of publishing performance benchmarks [#42 ]. Stay tuned :) Couple of suggestions to improve performance
Lucene runs through each segment sequentially to answer search query on a shard. You can reduce to 1 segment, so you just have 1 graph.
If you are just trying out vector search(all you need is the nearest doc ids for the query vector), you can improve the performance by asking Elasticsearch not to read the stored fields.
Also let us know which ES version you are referring to. We recently fixed multiple leaks and improved performance in opendistro-1.4. We are yet to backport changes to other release. |
Thanks @vamshin ! I shall certainly try that and report the results. |
Hello, Is there a way to control ef_search in the query? I am trying to bench K-NN Plugin: Thanks, |
Sorry we cannot control ef_search in the query. But definitely seem like something we should support. Created issue #116 |
Thanks @vamshin ! I watched it. |
Also on the side note, couple of suggestions for bench mark,
You might find this link useful for indexing/search tuning https://medium.com/@kumon/how-to-realize-similarity-search-with-elasticsearch-3dd5641b9adb |
@vamshin Thanks you for suggestions. I will check it out. |
Duplicate of #42 |
Hi guys,
thanks for publishing this wonderful plugins. Do you by any chance have some performance benchmarks?
I used an EC2 instance (16 CPUs 64GB RAM) and indexed 100K documents with vectors of size 4K. It takes 70-80 ms to execute a search query. Is there a way to boost it yet further?
Many thanks,
Mike
The text was updated successfully, but these errors were encountered: