-
Notifications
You must be signed in to change notification settings - Fork 913
Pull requests: jundot/omlx
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Adds 5 new intelligence benchmarks with bundled JSONL data and admin …
#837
opened Apr 17, 2026 by
SheeJiaWei
Loading…
fix: clean errors for non-LLM engines and missing STT processors (#507, #800)
#826
opened Apr 17, 2026 by
ethannortharc
Contributor
Loading…
5 tasks done
2
fix(admin): handle symlinked models in delete_hf_model endpoint (v2)
#824
opened Apr 17, 2026 by
Bahtya
Loading…
fix: honor response_format in /v1/audio/speech (#753)
#823
opened Apr 17, 2026 by
ethannortharc
Contributor
Loading…
4 tasks done
fix: SpecPrefill RoPE AttributeError on retry (#766) + Qwen3 MoE VLM misdetection (#812)
#822
opened Apr 17, 2026 by
Chuhan1112
Loading…
3 tasks
Adds JANG quantized model support to oMLX's batched engine.
#820
opened Apr 17, 2026 by
yohann-bearzi
Contributor
Loading…
fix(dflash): route image requests to VLM fallback; fix(tts): honour response_format
#818
opened Apr 17, 2026 by
Chuhan1112
Loading…
7 tasks
feat: preserve thinking across turns for Qwen 3.6+ incl. external clients
#814
opened Apr 16, 2026 by
latent-variable
Contributor
Loading…
feat: DDTree tree-based speculative decoding (rides DFlash)
#805
opened Apr 16, 2026 by
sooth
Loading…
8 tasks
refactor(integration): use [profiles.omlx] instead of top-level model override for codex integration
#776
opened Apr 15, 2026 by
yguilai
Loading…
feat: virtual model profiles via .virtual.yaml files
#773
opened Apr 15, 2026 by
TipKnuckle
Contributor
Loading…
feat(download): add concurrent model downloads and per-download worker settings
#758
opened Apr 14, 2026 by
uncle9x9
Loading…
5 tasks done
feat: add single_model_mode to force unload before load
#730
opened Apr 12, 2026 by
jroth1111
Loading…
[Performance] add hot cache only mode and optimize memory usage
#701
opened Apr 10, 2026 by
RepublicOfKorokke
Loading…
4 of 14 tasks
feat(admin): add hotswappable engine package management
#679
opened Apr 9, 2026 by
0xClandestine
Loading…
Add support for ParoQuant and custom quantization method loading
#209
opened Mar 13, 2026 by
liang2kl
Loading…
ProTip!
Follow long discussions with comments:>50.