-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
llama-server <embedding> exited with status code -1 #3056
Comments
Hi - could you also share the command being used to start docker container? |
Sure, its the one in the getting started guide just with a different file path.
|
Thanks - could you execute the quoted command in container (e.g docker exec -it ...) and share its output?
|
|
Could you share the contents of your The error logging could use some improvement, though |
That may be it, lack of AVX2, Which version was the last that didn't require it?
|
I believe we need AVX2 support for all versions following the migration to llama.cpp from ctranslate2 (0.5+) |
It should be possible to fallback to AVX, its in llama.cpp |
The last time I checked, ppl still needed to compile llama.cpp individually for AVX/AVX2. Given that llama.cpp itself still distributes different versions for AVX, I guess that's still the case? |
Looks like they have seperated out NOAVX, AVX and AVX2. would this be possible to impliment as branches? I won't lie its outside my knowledge depth. I just remember running something else on this server before that used llama.cpp but with just AVX. |
This PR should do the trick: #3057 - you might want to try running Docker compilation from the branch to build the image. |
So that would be cloning the repo then making the Docker image from the provided docker/Dockerfile.cuda with the PR change done locally? |
Describe the bug
llama-server exited with status code -1
Information about your version
Unable to get version as it will not start. Docker image used:
Information about your GPU
Additional context
I removed my old Tabby setup and pulled the new container and set a new data folder however it fails with the error:
The text was updated successfully, but these errors were encountered: