Skip to content

Actions: ggerganov/llama.cpp

Server

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
8,908 workflow runs
8,908 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

model: Add support for PhiMoE arch
Server #9409: Pull request #11003 synchronize by phymbert
December 29, 2024 13:57 7m 43s phymbert/model/phi35-moe
December 29, 2024 13:57 7m 43s
vulkan: im2col and matmul optimizations for stable diffusion (#10942)
Server #9408: Commit a813bad pushed by 0cc4m
December 29, 2024 09:16 6m 16s master
December 29, 2024 09:16 6m 16s
vulkan: Use push constant offset to handle misaligned descriptors (#1…
Server #9407: Commit fdd2188 pushed by 0cc4m
December 29, 2024 08:35 5m 55s master
December 29, 2024 08:35 5m 55s
vulkan: optimize mul_mat for small values of N
Server #9406: Pull request #10991 synchronize by jeffbolznv
December 28, 2024 21:59 6m 3s jeffbolznv:small_batch_opt
December 28, 2024 21:59 6m 3s
model: Add support for PhiMoE arch
Server #9405: Pull request #11003 synchronize by phymbert
December 28, 2024 18:38 13m 20s phymbert/model/phi35-moe
December 28, 2024 18:38 13m 20s
model: Add support for PhiMoE arch
Server #9404: Pull request #11003 synchronize by phymbert
December 28, 2024 18:38 33s phymbert/model/phi35-moe
December 28, 2024 18:38 33s
server : allow using LoRA adapters per-request
Server #9403: Pull request #10994 synchronize by ngxson
December 28, 2024 15:17 22m 32s ngxson:xsn/lora_per_request
December 28, 2024 15:17 22m 32s
server: added more docs for response_fields field (#10995)
Server #9402: Commit f865ea1 pushed by ngxson
December 28, 2024 15:09 14m 55s master
December 28, 2024 15:09 14m 55s
server : fix token duplication when streaming with stop strings (#10997)
Server #9401: Commit 16cdce7 pushed by ngxson
December 28, 2024 15:08 5m 27s master
December 28, 2024 15:08 5m 27s
model: Add support for PhiMoE arch
Server #9400: Pull request #11003 synchronize by phymbert
December 28, 2024 14:50 5m 55s phymbert/model/phi35-moe
December 28, 2024 14:50 5m 55s
model: Add support for PhiMoE arch
Server #9399: Pull request #11003 opened by phymbert
December 28, 2024 14:46 4m 5s phymbert/model/phi35-moe
December 28, 2024 14:46 4m 5s
vulkan: im2col and matmul optimizations for stable diffusion
Server #9392: Pull request #10942 synchronize by jeffbolznv
December 27, 2024 22:50 5m 21s jeffbolznv:im2col
December 27, 2024 22:50 5m 21s
server : allow using LoRA adapters per-request
Server #9390: Pull request #10994 synchronize by ngxson
December 27, 2024 19:22 4m 42s ngxson:xsn/lora_per_request
December 27, 2024 19:22 4m 42s
server : fix token duplication when streaming with stop strings
Server #9389: Pull request #10997 opened by z80maniac
December 27, 2024 19:05 8m 13s z80maniac:dup-fix
December 27, 2024 19:05 8m 13s
server : allow using LoRA adapters per-request
Server #9388: Pull request #10994 synchronize by ngxson
December 27, 2024 17:35 8m 26s ngxson:xsn/lora_per_request
December 27, 2024 17:35 8m 26s
server : allow using LoRA adapters per-request
Server #9386: Pull request #10994 opened by ngxson
December 27, 2024 15:12 6m 28s ngxson:xsn/lora_per_request
December 27, 2024 15:12 6m 28s