Qwen3.6-35B-A3B - Q6_K GGUF Quantization

This repository contains the Q6_K GGUF format of the Qwen3.6-35B-A3B model.

These files were quantized by Abiray using llama.cpp to make the model accessible for consumer hardware and CPU-heavy environments.

📦 Other Available Formats

I have processed this model into several different quantization formats. You can find them in my other repositories:

Abiray/Qwen3.6-35B-A3B-Q8_0-GGUF (High quality, largest file)
Abiray/Qwen3.6-35B-A3B-Q6_K-GGUF (Very high quality)
Abiray/Qwen3.6-35B-A3B-Q5_K_M-GGUF (Great balance of quality and size)
Abiray/Qwen3.6-35B-A3B-Q4_K_M-GGUF (Good quality, recommended for most users)
Abiray/Qwen3.6-35B-A3B-Q3_K_M-GGUF (Low quality, smallest file)

💻 How to run with llama.cpp

You can run this model locally using llama-cli from the llama.cpp project.

# Example command (adjust threads and context size to your machine)
./llama-cli -m Qwen3.6-35B-A3B-Q6_K.gguf -p "Your prompt here" -n 512 -t 8 -c 4096

Downloads last month: 827

GGUF

Model size

35B params

Architecture

qwen35moe

Hardware compatibility

6-bit

Model tree for Abiray/Qwen3.6-35B-A3B-Q6_K-GGUF

Base model

Qwen/Qwen3.6-35B-A3B

Quantized

(161)

this model

Collection including Abiray/Qwen3.6-35B-A3B-Q6_K-GGUF

Qwen3.6-35B-A3B

Collection

10 items • Updated 1 day ago