[pull] master from ggml-org:master by pull[bot] · Pull Request #1083 · LongLeCE/llama.cpp

pull · 2026-04-16T20:42:02Z

See Commits and Changes for more details.

Created by pull[bot] (v2.0.0-alpha.4)

Can you help keep this open source service alive? 💖 Please sponsor : )

* support nvfp4 tensors for Gemma4 * add wo_s to build_attn * add wo_s to build_attn * fix glm4

…ers (#21245) * model : refactor QKV into common build_qkv and create_tensor_qkv helpers * model : extend build_qkv to bert/mpt/dbrx/olmo/lfm2/nemotron-h/granite-hybrid/gemma3n-iswa/t5-dec and fix wqkv_s

…#21980) * server: tests: fetch random media marker via /apply-template (#21962 fix) * server: allow pinning media marker via LLAMA_MEDIA_MARKER env var get_media_marker() checks LLAMA_MEDIA_MARKER at first call and uses it as-is if set, falling back to the random marker otherwise. Tests no longer need to fetch the marker dynamically via /apply-template: the fixture sets LLAMA_MEDIA_MARKER=<__media__> so the hardcoded prompts work as before. Address review feedback from ngxson * server: make get_media_marker() thread-safe via magic statics Use a C++11 static local with a lambda initializer instead of a global static with an empty-check. The runtime guarantees initialization exactly once without explicit locking. Address review feedback from ggerganov * nits * nits

* model: using single llm_build per arch * fix merge * nits

CISC and others added 5 commits April 16, 2026 16:51

model : support NVFP4 tensors for Gemma4 (#21971)

f772f6e

* support nvfp4 tensors for Gemma4 * add wo_s to build_attn * add wo_s to build_attn * fix glm4

model : refactor QKV into common build_qkv and create_tensor_qkv help…

9db77a0

…ers (#21245) * model : refactor QKV into common build_qkv and create_tensor_qkv helpers * model : extend build_qkv to bert/mpt/dbrx/olmo/lfm2/nemotron-h/granite-hybrid/gemma3n-iswa/t5-dec and fix wqkv_s

opencl: add q5_K gemm and gemv kernels for Adreno (#21595)

e45dbde

model: using single llm_build per arch (#21970)

4fbdabd

* model: using single llm_build per arch * fix merge * nits

pull Bot locked and limited conversation to collaborators Apr 16, 2026

pull Bot added the ⤵️ pull label Apr 16, 2026

pull Bot merged commit 4fbdabd into LongLeCE:master Apr 16, 2026

github-actions Bot added examples python ggml OpenCL server model labels Apr 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pull] master from ggml-org:master#1083

[pull] master from ggml-org:master#1083
pull[bot] merged 5 commits intoLongLeCE:masterfrom
ggml-org:master

pull Bot commented Apr 16, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

pull Bot commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pull Bot commented Apr 16, 2026 •

edited

Loading