ggml-org / llama.cpp Public

Notifications You must be signed in to change notification settings
Fork 14.2k
Star 91.9k

Code
Issues 363
Pull requests 625
Discussions
Actions
Projects 10
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: ggml-org/llama.cpp

Labels 88 Milestones 0

New pull request New

625 Open 8,244 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

model: support MiMo-V2-Flash model

Model specific

python

python script changes

#18328 opened Dec 23, 2025 by ngxson • Draft

vulkan: Support UPSCALE w/antialias ggml

changes relating to the ggml tensor library for machine learning

testing

Everything test related

Vulkan

Issues specific to the Vulkan backend

#18327 opened Dec 23, 2025 by jeffbolznv

Loading…

[WIP] model: add Trillion model model

Model specific

python

python script changes

#18325 opened Dec 23, 2025 by HelloKS • Draft

3 tasks

server: (preset) add unsafe-allow-api-override examples server

#18322 opened Dec 23, 2025 by ngxson • Draft

Support Youtu-VL Model examples python

python script changes

#18315 opened Dec 23, 2025 by f291400

Loading…

Add metal count equal op Apple Metal

https://en.wikipedia.org/wiki/Metal_(API)

documentation

Improvements or additions to documentation

ggml

changes relating to the ggml tensor library for machine learning

#18314 opened Dec 23, 2025 by gatbontonpc

Loading…

utils: beging using log.h in tokenize.cpp examples

#18307 opened Dec 22, 2025 by syedshazli

Loading…

vulkan: handle rope with large number of rows ggml

changes relating to the ggml tensor library for machine learning

testing

Everything test related

Vulkan

Issues specific to the Vulkan backend

#18306 opened Dec 22, 2025 by jeffbolznv

Loading…

server: modify return_progress to also report 0% processing state examples python

python script changes

server

#18305 opened Dec 22, 2025 by ngxson

Loading…

vulkan: fix command buffer corruption in ggml_backend_vk_event_wait ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#18302 opened Dec 22, 2025 by jeffbolznv

Loading…

Webui/prompt processing progress examples server

#18300 opened Dec 22, 2025 by ServeurpersoCom

Loading…

vulkan: extend topk_moe to handle sigmoid w/exp_probs_b for nemotron ggml

changes relating to the ggml tensor library for machine learning

testing

Everything test related

Vulkan

Issues specific to the Vulkan backend

#18295 opened Dec 22, 2025 by jeffbolznv

Loading…

server: implement --shutdown-timeout examples server

#18292 opened Dec 22, 2025 by ngxson

Loading…

eval-callback : add support for saving logits examples

#18281 opened Dec 22, 2025 by danbev

Loading…

Vulkan: Tune Flash Attention for MoE on AMD GPUs ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#18280 opened Dec 22, 2025 by 0cc4m

Loading…

Prevent crash if TTFT >300sec, boosted to 90 days examples server

#18279 opened Dec 22, 2025 by wbtek

Loading…

tools : use common_log_pause to fix fit-params output race examples

#18276 opened Dec 22, 2025 by Aadeshveer

Loading…

KYLIN: fix compile error for cuda backend ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#18275 opened Dec 22, 2025 by lizhenneng

Loading…

docs: Fix typos in SYCL documentation documentation

Improvements or additions to documentation

SYCL

https://en.wikipedia.org/wiki/SYCL - GPU programming language

#18269 opened Dec 21, 2025 by yoka

Loading…

add LLAMA_ARG_OVERRIDE_TENSOR env var for -ot arg

#18267 opened Dec 21, 2025 by ddh0

Loading…

llama: fix magic number of 999 for GPU layers

#18266 opened Dec 21, 2025 by JohannesGaessler

Loading…

Add Gemma3n multimodal support with MobileNetV5 vision encoder examples model

Model specific

python

python script changes

#18256 opened Dec 21, 2025 by simrnsingh

Loading…

ggml-cpu: parallelize tensor repacking with OpenMP ggml

changes relating to the ggml tensor library for machine learning

#18239 opened Dec 21, 2025 by pestopoppa

Loading…

cli: buffering info log, only show if model load failed examples

#18236 opened Dec 20, 2025 by ngxson

Loading…

webui: Fix the header backdrop blur examples server

#18230 opened Dec 20, 2025 by ImadSaddik

Loading…

Previous 1 2 3 4 5 … 24 25 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!