-
Notifications
You must be signed in to change notification settings - Fork 10.1k
Pull requests: ggerganov/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add support for QRWKV6 hybrid models & slight optimization for RWKV6
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#11001
opened Dec 28, 2024 by
MollySophia
Loading…
vulkan: experimental coalesced read to shared memory before dequantization
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#10999
opened Dec 27, 2024 by
netrunnereve
•
Draft
server : fix token duplication when streaming with stop strings
examples
server
#10997
opened Dec 27, 2024 by
z80maniac
Loading…
server: added more docs for response_fields field
examples
server
#10995
opened Dec 27, 2024 by
isaac-mcfadyen
Loading…
common, examples, ggml : fix MSYS2 GCC compiler errors and warnings when building with LLAMA_CURL=ON and GGML_OPENCL=ON
examples
ggml
changes relating to the ggml tensor library for machine learning
#10992
opened Dec 27, 2024 by
peter277
Loading…
draft: vulkan: optimize mul_mat for small values of N
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#10991
opened Dec 26, 2024 by
jeffbolznv
Loading…
Vulkan: Destroy Vulkan instance on exit
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#10989
opened Dec 26, 2024 by
0cc4m
Loading…
vulkan: Use push constant offset to handle misaligned descriptors
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#10987
opened Dec 26, 2024 by
jeffbolznv
Loading…
ggml : fix undefined reference to std::filesystem(#10978)
ggml
changes relating to the ggml tensor library for machine learning
#10979
opened Dec 26, 2024 by
Clauszy
Loading…
server : add OAI compat for /v1/completions
breaking change
Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility.
examples
python
python script changes
server
#10974
opened Dec 25, 2024 by
ngxson
Loading…
2 tasks done
Removed unnecessary iteration of batch n_tokens on sequence embedding…
examples
#10972
opened Dec 25, 2024 by
Emreerdog
Loading…
Cosine similarity is undefined when any vector is zero.
#10968
opened Dec 24, 2024 by
AndyM3
Loading…
vulkan: im2col and matmul optimizations for stable diffusion
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#10942
opened Dec 22, 2024 by
jeffbolznv
Loading…
Allow user to compile with any cuda version using github actions
devops
improvements to build systems and github actions
#10928
opened Dec 21, 2024 by
jianlins
Loading…
llamafile_sgemm API - INT8 implementation
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#10912
opened Dec 20, 2024 by
amritahs-ibm
Loading…
llama : add support for Cohere2ForCausalLM
python
python script changes
#10900
opened Dec 19, 2024 by
dranger003
Loading…
ASCII/Romanization for OuteTTS Multilingual Processing
demo
Demonstrate some concept or idea, not intended to be merged
examples
#10894
opened Dec 19, 2024 by
edwko
Loading…
SYCL: Fixes for building SYCL backend for AMD GPUs
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#10851
opened Dec 16, 2024 by
lhl
Loading…
Fix compilation on Pop!_OS 22.04 LTS CUDA
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#10835
opened Dec 15, 2024 by
mika314
Loading…
add changes relating to the ggml tensor library for machine learning
ggml_backend_sched_dump_dot
ggml
#10825
opened Dec 14, 2024 by
foldl
Loading…
Bamba architecture
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
testing
Everything test related
#10810
opened Dec 12, 2024 by
gabe-l-hart
•
Draft
3 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.