Skip to content

Distributed inference - 16 GPU limit #11218

Answered by slaren
justinjja asked this question in Q&A
Discussion options

You must be logged in to vote

You should be able to use any number of devices by increasing the value of GGML_SCHED_MAX_BACKENDS.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@justinjja
Comment options

Answer selected by justinjja
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants