Several optimizations #737

masylum · 2024-06-17T09:56:49Z

I've been profiling orama, since it's taking 4s to ingest around 4k documents on my project and I would like to lower this down. I've noticed several things: It's ingesting a large amount of empty strings which is useless CPU time and also it's recalculating ids redundantly. This commit tries to address this two issues.

Additionally, I've noticed that due to the async APIs, the code is spending most of it's time waiting for "run microtasks". I have no idea if it would be possible to compile those away, because right now it makes the default implementation much worse (performance-wise) in order for people to be able to provide their storage solution.

Lastly, I've also noticed that providing the ID of the document, makes the ID be stored as part of the document properties. I thought it would only be to replace the default orama ID. I will fix this myself in userland by using getDocumentProperties, but perhaps is good to either change this default or document it.

I've been profiling orama, since it's taking 4s to ingest around 4k documents on my project and I would like to lower this down. I've noticed several things: It's ingesting a large amount of empty strings which is useless CPU time and also it's recalculating ids redundantly. This commit tries to address this two issues. Additionally, I've noticed that due to the async APIs, the code is spending most of it's time waiting for "run microtasks". I have no idea if it would be possible to compile those away, because right now it makes the default implementation much worse (performance-wise) in order for people to be able to provide their storage solution. Lastly, I've also noticed that providing the ID of the document, makes the ID be stored as part of the document properties. I thought it would only be to replace the default orama ID. I will fix this myself in userland by using `getDocumentProperties`, but perhaps is good to either change this default or document it.

vercel · 2024-06-17T09:56:53Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
orama-docs	❌ Failed (Inspect)			Jun 17, 2024 9:58am

micheleriva · 2024-06-28T10:18:58Z

Hi there! I fear you'll need to regenerate the test snapshots. The PR looks good then!

Thank you so much

masylum · 2024-07-24T20:39:27Z

how do I do that?

vercel bot had a problem deploying to Preview June 17, 2024 09:58 Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Several optimizations #737

Several optimizations #737

masylum commented Jun 17, 2024

vercel bot commented Jun 17, 2024 •

edited

Loading

micheleriva commented Jun 28, 2024

masylum commented Jul 24, 2024

Several optimizations #737

Are you sure you want to change the base?

Several optimizations #737

Conversation

masylum commented Jun 17, 2024

vercel bot commented Jun 17, 2024 • edited Loading

micheleriva commented Jun 28, 2024

masylum commented Jul 24, 2024

vercel bot commented Jun 17, 2024 •

edited

Loading