Qwen3.5-35B-A3B Moderation โ Sparse (BF16)
Merged LoRA fine-tune of Qwen/Qwen3.5-35B-A3B for chat content moderation (sparse output format).
- Base model: Qwen/Qwen3.5-35B-A3B
- Format: BF16
- Task: 5-category chat moderation (underage, bestiality, selfHarm, sexualViolenceGore, realTerrorism)
- Output: Sparse JSON โ
{}for safe,{"underage": "evidence"}for flagged - Serving: vLLM with
--tensor-parallel-size 1on 1xH200 or--tensor-parallel-size 2on 2xH100, requires CUDA 12.6+
- Downloads last month
- 377