Qwen3.6
Collection
4 items • Updated
"FP16" is M1/M2 Apple Silicon only optimization that leads to a very noticeable prompt processing boost. For details and benchmarks see jundot/omlx/issues/604.
Please use deepsweet/Qwen3.6-35B-A3B-MLX-oQ8 if you have M3+ Apple Silicon.
This model was converted to MLX format from Qwen/Qwen3.6-35B-A3B using [oMLX v0.36.0].
Settings:
8-bit
Base model
Qwen/Qwen3.6-35B-A3B