ml-explore / mlx-lm Public

Notifications You must be signed in to change notification settings
Fork 621
Star 5k

Code
Issues 121
Pull requests 119
Discussions
Actions
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security and quality
Insights

Pull requests: ml-explore/mlx-lm

Labels 9 Milestones 0

New pull request New

119 Open 630 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Refactor server to use MLXServerConfig

#1213 opened Apr 27, 2026 by giskarda

Loading…

Add Hy3 preview

#1211 opened Apr 27, 2026 by kernelpool Contributor

Loading…

Skip quantizing Gemma 4 per_layer_model_projection for Swift compatibility

#1209 opened Apr 27, 2026 by kr1s0404

Loading…

3 tasks

feat(server): add OpenAI Responses API endpoint (/v1/responses)

#1207 opened Apr 27, 2026 by cassiolpaixao90

Loading…

5 tasks

fix(gemma4): drop KV-shared layer projections in sanitize

#1205 opened Apr 26, 2026 by Fox13

Loading…

minimax: validate head_dim against checkpoint, drop unused shared_intermediate_size

#1204 opened Apr 26, 2026 by adurham Contributor

Loading…

Add TurboQuantKVCache: 3-bit/4-bit KV cache compression for generation

#1202 opened Apr 26, 2026 by dedalien

Loading…

Add DeepSeek-V4 (Flash) model support

#1201 opened Apr 26, 2026 by akashgoswami • Draft

3 of 6 tasks

Add dense qwen3_5 support for learned quantization

#1200 opened Apr 25, 2026 by iamwavecut

Loading…

Add EngGPT MoE model support

#1199 opened Apr 25, 2026 by robertobissanti

Loading…

fix: prevent double-shift of norm weights for converted VLM checkpoints

#1198 opened Apr 25, 2026 by Thump604

Loading…

feat: add thinking budget with early-stopping prompt injection

#1196 opened Apr 25, 2026 by Thump604

Loading…

Implement DSV4

#1195 opened Apr 25, 2026 by rltakashige Contributor

Loading…

Add DeepSeek-v4 (Flash/Pro)

#1192 opened Apr 24, 2026 by Blaizzy Contributor

Loading…

feat: add DeepSeek-V4 (Pro/Flash) model support

#1189 opened Apr 24, 2026 by machiabeli

Loading…

5 of 7 tasks

Include context_length in /v1/models response (#1183)

#1184 opened Apr 23, 2026 by seikixtc

Loading…

Lc/fix xtc special tokens server

#1176 opened Apr 21, 2026 by micuentadecasa Contributor

Loading…

Auto-discover tool-call markers from tokenizer config fields

#1163 opened Apr 18, 2026 by michaelstingl

Loading…

6 tasks done

feat(nemotron_h): add Multi-Token Prediction (MTP) module

#1161 opened Apr 16, 2026 by Thump604

Loading…

Add reasoning → tool state machine transition

#1160 opened Apr 16, 2026 by christiangenco

Loading…

feature: dynamic quantized model support

#1155 opened Apr 15, 2026 by dsrenesanse • Draft

Feat/mamba mlx kernels

#1153 opened Apr 15, 2026 by Gal-bloch

Loading…

9 of 11 tasks

feat: Add KL Divergence command

#1146 opened Apr 13, 2026 by spicyneuron Contributor

Loading…

Add TurboQuantKVCache: data-oblivious 2-4 bit KV cache compression

#1144 opened Apr 12, 2026 by Smilefounder • Draft

3 tasks done

fix(gemma4): return [] instead of raising on empty tool-call match

#1142 opened Apr 10, 2026 by gofastercloud

Loading…

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!