Instructions to use artokun/comfy_gemma_3_12B_it_abliterated with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use artokun/comfy_gemma_3_12B_it_abliterated with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="artokun/comfy_gemma_3_12B_it_abliterated")

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("artokun/comfy_gemma_3_12B_it_abliterated", dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use artokun/comfy_gemma_3_12B_it_abliterated with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "artokun/comfy_gemma_3_12B_it_abliterated"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "artokun/comfy_gemma_3_12B_it_abliterated",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/artokun/comfy_gemma_3_12B_it_abliterated

SGLang

How to use artokun/comfy_gemma_3_12B_it_abliterated with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "artokun/comfy_gemma_3_12B_it_abliterated" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "artokun/comfy_gemma_3_12B_it_abliterated",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "artokun/comfy_gemma_3_12B_it_abliterated" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "artokun/comfy_gemma_3_12B_it_abliterated",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use artokun/comfy_gemma_3_12B_it_abliterated with Docker Model Runner:
```
docker model run hf.co/artokun/comfy_gemma_3_12B_it_abliterated
```

💎 Gemma 3 12B IT Abliterated — ComfyUI Text Encoder

This is a ComfyUI-compatible single-file repack of mlabonne/gemma-3-12b-it-abliterated, packaged as a drop-in replacement for the stock comfy_gemma_3_12B_it.safetensors text encoder used by the LTXV Audio Text Encoder Loader and other Gemma 3 12B ComfyUI nodes.

🔧 What's changed vs. the upstream abliterated model

The upstream model is an HF transformers checkpoint split across 5 safetensors shards with Gemma3ForConditionalGeneration's key layout. ComfyUI expects a single file with a slightly different layout and an embedded SentencePiece tokenizer. This repack does the following:

Merged all 5 shards into one .safetensors file (~23GB, bf16).
Remapped keys to match ComfyUI's expected layout:
- language_model.model.* → model.*
- vision_tower.vision_model.* → vision_model.*
- multi_modal_projector.* → (unchanged)
Embedded the SentencePiece tokenizer as a spiece_model uint8 byte tensor (~4.5MB), matching the official ComfyUI Gemma 3 12B text encoder layout. Without this, ComfyUI's SPieceTokenizer raises invalid tokenizer on load.
Dropped nothing — all 1066 tensors (1065 weights + 1 tokenizer) match the stock encoder's structure exactly (zero missing, zero extra).

📦 Why

Activism / uncensored video generation workflows need a text encoder that doesn't refuse legitimate subject matter. The stock ComfyUI Gemma 3 12B encoder refuses many prompts that are harmless in context (protest imagery, political speech, etc.). This repack gives you the abliterated model's acceptance rate with zero changes to your ComfyUI nodes or workflow graph — just swap the file.

🚀 Usage

Drop the file into ComfyUI/models/text_encoders/ and select it in the LTXV Audio Text Encoder Loader node (or any node that accepts a Gemma 3 12B text encoder).

Recommended generation parameters (from upstream): temperature=1.0, top_k=64, top_p=0.95.

Original Upstream Model Card

💎 Gemma 3 12B IT Abliterated

Gemma 3 1B Abliterated • Gemma 3 4B Abliterated • Gemma 3 27B Abliterated

This is an uncensored version of google/gemma-3-12b-it created with a new abliteration technique. See this article to know more about abliteration.

I was playing with model weights and noticed that Gemma 3 was much more resilient to abliteration than other models like Qwen 2.5. I experimented with a few recipes to remove refusals while preserving most of the model capabilities.

Note that this is fairly experimental, so it might not turn out as well as expected. I saw some garbled text from time to time (e.g., "It' my" instead of "It's my").

I recommend using these generation parameters: temperature=1.0, top_k=64, top_p=0.95.

⚡️ Quantization

GGUF: https://huggingface.co/mlabonne/gemma-3-12b-it-abliterated-GGUF

✂️ Layerwise abliteration

In the original technique, a refusal direction is computed by comparing the residual streams between target (harmful) and baseline (harmless) samples.

Here, the model was abliterated by computing a refusal direction based on hidden states (inspired by Sumandora's repo) for most layers (layer 3 to 45), independently. This is combined with a refusal weight of 0.6 to upscale the importance of this refusal direction in each layer.

This created a very high acceptance rate (>90%) and still produced coherent outputs.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

Image-Text-to-Text

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for artokun/comfy_gemma_3_12B_it_abliterated

Base model

google/gemma-3-12b-pt

Finetuned

google/gemma-3-12b-it

Finetuned

mlabonne/gemma-3-12b-it-abliterated

Finetuned

(2)

this model