High memory consumption #3427

arouene · 2024-12-11T10:01:02Z

Hello,

We have a Danswer host installed with docker. The host have 64GB of RAM, but vespa keeps getting killed for OOM. And then while it is restarted, the backend cannot reach again for vespa.

Memory:

# free -h
               total        used        free      shared  buff/cache   available
Mem:            62Gi        42Gi       2.6Gi       148Mi        18Gi        20Gi
Swap:             0B          0B          0B

Vespa getting killed for OOM:

[Tue Dec 10 17:20:04 2024] oom_reaper: reaped process 163083 (vespa-proton-bi), now anon-rss:0kB, file-rss:208kB, shmem-rss:0kB

Backend cannot reach vespa anymore:

ERROR:    12/11/2024 09:29:58 AM       handle_regular_answer.py  269: [Channel ID: D07EXAZES4U] Unable to process message - did not successfully answer in 5 attempts
Traceback (most recent call last):
  File "/app/danswer/document_index/vespa/chunk_retrieval.py", line 303, in query_vespa
    response.raise_for_status()
  File "/usr/local/lib/python3.11/site-packages/httpx/_models.py", line 761, in raise_for_status
    raise HTTPStatusError(message, request=request, response=self)
httpx.HTTPStatusError: Server error '503 Service Unavailable' for url 'http://index:8081/search/'

If I restart the backend container it's working again.

Is it normal for Danswer to require more than 64GB of memory?

The text was updated successfully, but these errors were encountered:

rkuo-danswer · 2024-12-18T06:38:03Z

Hi Aroune, depends entirely on the number and size of the documents you are indexing. Could you provide us with some more context?

arouene · 2025-01-07T11:30:16Z

Hello, thanks for your interest in this boring oom problem !

Here is a screenshot of our connectors

Web connectors are for internals websites, kind of blog / forum / frontpages websites.

Is that helpful?

rkuo-danswer · 2025-01-09T21:32:52Z

That is definitely a lot of docs, it may be necessary to add RAM to support that. You can get a better feel for this by inspecting the memory usage of Vespa via the metrics endpoint.

https://docs.vespa.ai/en/operations/metrics.html
https://stackoverflow.com/questions/68014005/which-all-metric-to-trace-to-determine-if-the-resource-needs-to-be-added

arouene · 2025-01-10T00:28:33Z

Ok, so I as understand it seems pretty legit.
Thanks for the docs, I will take a peek !

JonnyPower · 2025-01-13T23:56:20Z

@rkuo-danswer we are experiencing a similar issue, is it expected that the index container's memory usage grows proportionately to number of indexed documents?

that seems like a poor design decision if that's the case - acts more like a memory leak. isn't the Vespa database meant to prevent a need for loading every document in ram?

rkuo-danswer · 2025-01-14T01:47:00Z

@rkuo-danswer we are experiencing a similar issue, is it expected that the index container's memory usage grows proportionately to number of indexed documents?

that seems like a poor design decision if that's the case - acts more like a memory leak. isn't the Vespa database meant to prevent a need for loading every document in ram?

Not really ... in fact being in memory is a key component of being able to perform similarity searches across documents quickly. There are probably some significant optimizations we can apply here, but generally speaking this is expected behavior.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

High memory consumption #3427

High memory consumption #3427

arouene commented Dec 11, 2024

rkuo-danswer commented Dec 18, 2024

arouene commented Jan 7, 2025

rkuo-danswer commented Jan 9, 2025

arouene commented Jan 10, 2025

JonnyPower commented Jan 13, 2025

rkuo-danswer commented Jan 14, 2025

High memory consumption #3427

High memory consumption #3427

Comments

arouene commented Dec 11, 2024

rkuo-danswer commented Dec 18, 2024

arouene commented Jan 7, 2025

rkuo-danswer commented Jan 9, 2025

arouene commented Jan 10, 2025

JonnyPower commented Jan 13, 2025

rkuo-danswer commented Jan 14, 2025