-
Notifications
You must be signed in to change notification settings - Fork 621
Pull requests: ml-explore/mlx-lm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Skip quantizing Gemma 4 per_layer_model_projection for Swift compatibility
#1209
opened Apr 27, 2026 by
kr1s0404
Loading…
3 tasks
feat(server): add OpenAI Responses API endpoint (/v1/responses)
#1207
opened Apr 27, 2026 by
cassiolpaixao90
Loading…
5 tasks
fix(gemma4): drop KV-shared layer projections in sanitize
#1205
opened Apr 26, 2026 by
Fox13
Loading…
minimax: validate head_dim against checkpoint, drop unused shared_intermediate_size
#1204
opened Apr 26, 2026 by
adurham
Contributor
Loading…
Add TurboQuantKVCache: 3-bit/4-bit KV cache compression for generation
#1202
opened Apr 26, 2026 by
dedalien
Loading…
Add DeepSeek-V4 (Flash) model support
#1201
opened Apr 26, 2026 by
akashgoswami
•
Draft
3 of 6 tasks
fix: prevent double-shift of norm weights for converted VLM checkpoints
#1198
opened Apr 25, 2026 by
Thump604
Loading…
feat: add thinking budget with early-stopping prompt injection
#1196
opened Apr 25, 2026 by
Thump604
Loading…
feat: add DeepSeek-V4 (Pro/Flash) model support
#1189
opened Apr 24, 2026 by
machiabeli
Loading…
5 of 7 tasks
Include context_length in /v1/models response (#1183)
#1184
opened Apr 23, 2026 by
seikixtc
Loading…
Auto-discover tool-call markers from tokenizer config fields
#1163
opened Apr 18, 2026 by
michaelstingl
Loading…
6 tasks done
feat(nemotron_h): add Multi-Token Prediction (MTP) module
#1161
opened Apr 16, 2026 by
Thump604
Loading…
Add TurboQuantKVCache: data-oblivious 2-4 bit KV cache compression
#1144
opened Apr 12, 2026 by
Smilefounder
•
Draft
3 tasks done
fix(gemma4): return [] instead of raising on empty tool-call match
#1142
opened Apr 10, 2026 by
gofastercloud
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.