Qwen3.5-35B-A3B Moderation โ€” Sparse (BF16)

Merged LoRA fine-tune of Qwen/Qwen3.5-35B-A3B for chat content moderation (sparse output format).

  • Base model: Qwen/Qwen3.5-35B-A3B
  • Format: BF16
  • Task: 5-category chat moderation (underage, bestiality, selfHarm, sexualViolenceGore, realTerrorism)
  • Output: Sparse JSON โ€” {} for safe, {"underage": "evidence"} for flagged
  • Serving: vLLM with --tensor-parallel-size 1 on 1xH200 or --tensor-parallel-size 2 on 2xH100, requires CUDA 12.6+
Downloads last month
377
Safetensors
Model size
36B params
Tensor type
BF16
ยท
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for od-dev/qwen3.5-35b-a3b-mod-sparse-merged

Finetuned
(84)
this model