Skip to content

[pull] master from ggml-org:master#94

Merged
pull[bot] merged 3 commits intoLongLeCE:masterfrom
ggml-org:master
Apr 15, 2025
Merged

[pull] master from ggml-org:master#94
pull[bot] merged 3 commits intoLongLeCE:masterfrom
ggml-org:master

Conversation

@pull
Copy link
Copy Markdown

@pull pull Bot commented Apr 15, 2025

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.1)

Can you help keep this open source service alive? 💖 Please sponsor : )

Srihari-mcw and others added 3 commits April 15, 2025 09:22
* Add AVX512 implementation of GEMM - q4kx8

* Update changes to remove unnecessary whitespaces
* Merged using squash to remove all noise commit messages

* Force flash attention off for `LLM_ARCH_DEEPSEEK2` - embedding too large

* Removed 3 conts (2x RoPE and 1x RMS-norm)

* Changed to use `<cmath>` instead of `<math.h>`

* Reverted removal of the 3 conts

* Used `reshape` in `llm_graph_context::build_attn_mha()`

* Use `k_pe = ggml_reshape`

* Removed the 3 conts again

* Removed the 3D views of `wk_b` and `wv_b`, and just save and 3D in GGUF

* Removed MQA optimisation from `build_attn_mha()` as no gains now

* Simplified `is_mla` branch in `llm_build_deepseek2()`

* Removed `build_attn_mla` and added `nullptr` to all `build_atnn` calls

* Fixed call to `build_attn` in `llm_build_t5_enc`
* SYCL: Add ROPE vision kernel

* Add comment about rope mode
@pull pull Bot added the ⤵️ pull label Apr 15, 2025
@pull pull Bot merged commit 5106764 into LongLeCE:master Apr 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants