-
Notifications
You must be signed in to change notification settings - Fork 14.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
vulkan: Support UPSCALE w/antialias
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#18327
opened Dec 23, 2025 by
jeffbolznv
Loading…
Support Youtu-VL Model
examples
python
python script changes
#18315
opened Dec 23, 2025 by
f291400
Loading…
Add metal count equal op
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
#18314
opened Dec 23, 2025 by
gatbontonpc
Loading…
utils: beging using log.h in tokenize.cpp
examples
#18307
opened Dec 22, 2025 by
syedshazli
Loading…
vulkan: handle rope with large number of rows
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#18306
opened Dec 22, 2025 by
jeffbolznv
Loading…
server: modify return_progress to also report 0% processing state
examples
python
python script changes
server
#18305
opened Dec 22, 2025 by
ngxson
Loading…
vulkan: fix command buffer corruption in ggml_backend_vk_event_wait
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18302
opened Dec 22, 2025 by
jeffbolznv
Loading…
Webui/prompt processing progress
examples
server
#18300
opened Dec 22, 2025 by
ServeurpersoCom
Loading…
vulkan: extend topk_moe to handle sigmoid w/exp_probs_b for nemotron
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#18295
opened Dec 22, 2025 by
jeffbolznv
Loading…
eval-callback : add support for saving logits
examples
#18281
opened Dec 22, 2025 by
danbev
Loading…
Vulkan: Tune Flash Attention for MoE on AMD GPUs
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18280
opened Dec 22, 2025 by
0cc4m
Loading…
Prevent crash if TTFT >300sec, boosted to 90 days
examples
server
#18279
opened Dec 22, 2025 by
wbtek
Loading…
tools : use common_log_pause to fix fit-params output race
examples
#18276
opened Dec 22, 2025 by
Aadeshveer
Loading…
KYLIN: fix compile error for cuda backend
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#18275
opened Dec 22, 2025 by
lizhenneng
Loading…
docs: Fix typos in SYCL documentation
documentation
Improvements or additions to documentation
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#18269
opened Dec 21, 2025 by
yoka
Loading…
llama: fix magic number of 999 for GPU layers
#18266
opened Dec 21, 2025 by
JohannesGaessler
Loading…
Add Gemma3n multimodal support with MobileNetV5 vision encoder
examples
model
Model specific
python
python script changes
#18256
opened Dec 21, 2025 by
simrnsingh
Loading…
ggml-cpu: parallelize tensor repacking with OpenMP
ggml
changes relating to the ggml tensor library for machine learning
#18239
opened Dec 21, 2025 by
pestopoppa
Loading…
cli: buffering info log, only show if model load failed
examples
#18236
opened Dec 20, 2025 by
ngxson
Loading…
webui: Fix the header backdrop blur
examples
server
#18230
opened Dec 20, 2025 by
ImadSaddik
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.