llama : add `llama_model_load_from_splits` #11255

ngxson · 2025-01-15T16:59:42Z

Some downstream program may want to use non-conventional file name. For example, ollama is using SHA256 as file name. This can make adding support for multi-splits GGUF become tricky.

This PR adds a new API llama_model_load_from_splits that allow user to manually specify a list of GGUF files:

    // Load the model from multiple splits (support custom naming scheme)
    // The paths must be in the correct order
    LLAMA_API struct llama_model * llama_model_load_from_splits(
                             const char ** paths,
                                 size_t    n_paths,
              struct llama_model_params    params);

src/llama.cpp

ggerganov · 2025-01-16T08:57:21Z

src/llama-model-loader.h

+// return a list of splits for a given path
+// for example, given "<name>-00002-of-00004.gguf", returns list of all 4 splits
+std::vector<std::string> llama_get_list_splits(const std::string & path, const int n_split);


This can be static function in the source file only - no need to add it in the header.

There is also an existing a llama_split_ prefix which seems suitable to use for this function: llama_split_get_list()

Ah yeah, I wanted to use this in llama.cpp but decided not to do that in the end. Forgot to delete it in the header file.

It should be fixed with 49822ba

llama : add llama_model_load_from_splits

1782462

ngxson requested a review from ggerganov January 15, 2025 16:59

ggerganov approved these changes Jan 16, 2025

View reviewed changes

update

49822ba

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : add `llama_model_load_from_splits` #11255

llama : add `llama_model_load_from_splits` #11255

ngxson commented Jan 15, 2025

ggerganov Jan 16, 2025

ngxson Jan 16, 2025

llama : add llama_model_load_from_splits #11255

Are you sure you want to change the base?

llama : add llama_model_load_from_splits #11255

Conversation

ngxson commented Jan 15, 2025

ggerganov Jan 16, 2025

Choose a reason for hiding this comment

ngxson Jan 16, 2025

Choose a reason for hiding this comment

llama : add `llama_model_load_from_splits` #11255

llama : add `llama_model_load_from_splits` #11255