llama-server as CLI #11219

ngxson · 2025-01-13T18:34:30Z

ngxson
Jan 13, 2025
Collaborator

@ericcurtin I'm creating a dedicated discussion in order not to make the thread become off-topic

Part of me wishes we had a llama-client that interacted with llama-server (with a CLI interface like llama-run). Although rather than C++, python possibly makes more sense as one can use the openai client library.

FYI my recent refactor #10691 is aimed to (somewhat) doing this. The idea is that in the future, we can expose llama-server as an internal library. A downstream program (like llama-client as you mentioned) can make calls internally to llama-server without ever touching HTTP stack.

Not sure if this is what you're looking for, feel free to discuss more.

ericcurtin · 2025-01-14T10:25:17Z

ericcurtin
Jan 14, 2025
Collaborator

I think something that can make calls without touching HTTP is useful, you can get that extra performance. But, it would still be nice to have a HTTP-based OpenAI one, since OpenAI is more popular and standard.

2 replies

ngxson Jan 14, 2025
Collaborator Author

Hmm if you want that, then it's far too simple. Just use something like rust or go, compiled to single binary. I can make one in under 30 mn.

But I'm doubt if anyone will gonna need that, because they will need to spin up 2 processes (server+client) just to get the job done.

ngxson Jan 14, 2025
Collaborator Author

Obviously someone already made that: https://github.com/picatz/openai

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama-server as CLI #11219

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 2 replies

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

llama-server as CLI #11219

ngxson Jan 13, 2025 Collaborator

Replies: 1 comment · 2 replies

ericcurtin Jan 14, 2025 Collaborator

ngxson Jan 14, 2025 Collaborator Author

ngxson Jan 14, 2025 Collaborator Author

ngxson
Jan 13, 2025
Collaborator

Replies: 1 comment 2 replies

ericcurtin
Jan 14, 2025
Collaborator

ngxson Jan 14, 2025
Collaborator Author

ngxson Jan 14, 2025
Collaborator Author