ml-explore / mlx-swift-lm Public

Notifications You must be signed in to change notification settings
Fork 168
Star 432

Code
Issues 40
Pull requests 28
Discussions
Actions
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security and quality
Insights

Pull requests: ml-explore/mlx-swift-lm

Labels 14 Milestones 0

New pull request New

28 Open 127 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

add TurboQuant KV cache compression

#232 opened Apr 22, 2026 by TheTom • Draft

add kvScheme parameter for extensible KV cache compression

#230 opened Apr 21, 2026 by TheTom

Loading…

4 tasks done

fix segsum dtype promotion -- 2x memory waste on hybrid SSM models

#229 opened Apr 21, 2026 by TheTom

Loading…

4 tasks done

fix Gemma 4 MoE router -- softmax order + fuse norm dispatches

#228 opened Apr 21, 2026 by TheTom

Loading…

4 tasks done

add FusedGateUpSwitchGLU -- single fused gate_up_proj for MoE models

#227 opened Apr 21, 2026 by TheTom

Loading…

4 tasks done

fix: disable CIContext intermediate caching to prevent IOSurface exha…

#226 opened Apr 21, 2026 by Satchitananda • Draft

4 tasks

pipeline prefill chunks with asyncEval -- 10x on GDN models

#225 opened Apr 20, 2026 by TheTom

Loading…

2 of 4 tasks

fix gated delta state precision -- fp32 state to match Python

#224 opened Apr 20, 2026 by TheTom

Loading…

4 tasks done

Fix EmbeddingGemma init-order crash + dense head hidden size

#223 opened Apr 20, 2026 by 0xweb3r

Loading…

2 of 3 tasks

Fix Qwen2.5-VL: 7 bugs + preprocessing for Python mlx-vlm parity

#222 opened Apr 19, 2026 by NivDvir

Loading…

Gemma 3/4 tool calling support

#215 opened Apr 15, 2026 by BRVWL

Loading…

Fix sporadic JSON tool-call detection and add parser hardening for mixed text/tool outputs

#205 opened Apr 11, 2026 by aleroot Contributor

Loading…

4 tasks done

Add Gemma 3n E4B audio encoder (Conformer) support

#194 opened Apr 7, 2026 by vahsaechao

Loading…

3 of 4 tasks

feat: expose speculative decoding in ChatSession (#181)

#193 opened Apr 7, 2026 by VDurocher

Loading…

Add Gemma 4 audio tower support (ASR via Conformer encoder)

#192 opened Apr 7, 2026 by antmanler

Loading…

Adopt GemmaFunctionParser to accomodate Gemma4 tool calls. swift-format

Swift format failure in CI

#183 opened Apr 4, 2026 by viktike

Loading…

2 of 4 tasks

Fix inaccuracies in (and possibly remove) "skills"

#175 opened Apr 1, 2026 by DePasqualeOrg Contributor

Loading…

Forward tools and additionalContext in GlmOcr and SmolVLM2 processors

#174 opened Mar 31, 2026 by alankessler Contributor

Loading…

Handle stringified JSON tool call arguments swift-format

Swift format failure in CI

#172 opened Mar 30, 2026 by kuosuko

Loading…

fix: flatten prompt in TokenRing.loadPrompt to handle 2D inputs

#170 opened Mar 29, 2026 by spokvulcan Contributor

Loading…

2 tasks done

Pass tools schema to ToolCallProcessor for type-aware parsing

#167 opened Mar 28, 2026 by alankessler Contributor

Loading…

feat: add ParoQuant (pairwise rotation quantization) support

#164 opened Mar 27, 2026 by spokvulcan Contributor

Loading…

4 tasks done

Add TurboQuant KV cache backend swift-format

Swift format failure in CI

#160 opened Mar 25, 2026 by timonharz

Loading…

Fix Qwen35 VLM crash on text-only inference

#149 opened Mar 15, 2026 by dirvine

Loading…

Add parser for GPT-OSS Harmony tool call format

#146 opened Mar 13, 2026 by aleroot Contributor

Loading…

3 of 4 tasks

Previous 1 2 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!