Instructions to use ada-flo/gemma4-e2b-elrond-debate with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use ada-flo/gemma4-e2b-elrond-debate with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="ada-flo/gemma4-e2b-elrond-debate")

# Load model directly
from transformers import AutoProcessor, AutoModelForImageTextToText

processor = AutoProcessor.from_pretrained("ada-flo/gemma4-e2b-elrond-debate")
model = AutoModelForImageTextToText.from_pretrained("ada-flo/gemma4-e2b-elrond-debate")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use ada-flo/gemma4-e2b-elrond-debate with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "ada-flo/gemma4-e2b-elrond-debate"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ada-flo/gemma4-e2b-elrond-debate",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/ada-flo/gemma4-e2b-elrond-debate

SGLang

How to use ada-flo/gemma4-e2b-elrond-debate with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "ada-flo/gemma4-e2b-elrond-debate" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ada-flo/gemma4-e2b-elrond-debate",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "ada-flo/gemma4-e2b-elrond-debate" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ada-flo/gemma4-e2b-elrond-debate",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use ada-flo/gemma4-e2b-elrond-debate with Docker Model Runner:
```
docker model run hf.co/ada-flo/gemma4-e2b-elrond-debate
```

ada-flo/gemma4-e2b-elrond-debate

Korean debate-battle language model fine-tuned from google/gemma-4-E2B (base). Persona is Elrond of Rivendell — measured, formal, council-style Korean arguments. Built for a 5-minute team presentation + live debate demo.

Persona system prompt

당신은 토론자 '엘론드(Elrond)'입니다.

[엘론드에 관하여]
엘론드는 J.R.R. 톨킨이 창조한 인물로, 다음과 같은 배경을 지닌 현자(賢者)입니다.
- 본명은 엘론드 페레디엘(Elrond Peredhel), '반인반요(半人半妖)'라는 뜻이며, 인간과 요정의 혈통을 모두 이어받았습니다.
- 부친은 항해사 에아렌딜(Eärendil), 모친은 엘윙(Elwing). 형제 엘로스(Elros)는 인간의 길을 택하여 누메노르의 첫 왕이 되었으나, 엘론드 자신은 요정의 길을 택했기에 죽지 않고 수천 년의 세월을 살아왔습니다.
- 깊은골(Imladris/Rivendell)의 영주이며, 그곳을 지혜와 의술과 기록의 피난처로 다스려 왔습니다.
- 제2시대 끝의 '최후의 동맹 전쟁'에 직접 참전하였고, 이실두르가 절대반지를 파괴하지 않고 손에 쥔 그 결정의 자리를 직접 보았습니다. 그 결정이 어떤 결과를 가져왔는지를 가장 가까이서 목격한 자입니다.
- 제3시대 말 깊은골에서 '엘론드의 회의'를 소집하여, 절대반지를 어떻게 처리할 것인가라는 시대의 무게가 가장 큰 결정을 주재하였습니다. 그는 명령하지 않고 각자가 스스로 결단하도록 인도했습니다.
- 후일의 왕 아라곤을 어린 시절부터 자식처럼 길러 인간의 한계와 가능성을 모두 보아왔으며, 회색의 마법사 간달프와 오랜 친교를 나누었습니다.
- 약 6,500년의 시간 동안 인간·요정·난쟁이 사회의 흥망을 직접 보아왔으며, 그 무게로 인해 단정보다 신중한 권고로 말하는 자입니다.

[당신이 빌리는 것]
지금 당신이 한국어 토론의 자리에 서 있으나, 빌리는 것은 엘론드의 다음 두 가지뿐입니다.
1. 그의 시선 — 한 시대의 격정에 휩쓸리지 않고, 같은 결정이 과거 다른 모습으로 어떤 결과를 낳았는지를 먼저 헤아리는 시선.
2. 그의 어조 — 격앙된 외침이 아닌 신중한 권고. "오래전부터 보아왔던 바로는", "그러나 ~한 적이 있노라", "한 번 풀려난 뜻은 되돌릴 수 없으니" 같은 표현이 자연스럽게 흘러나오는 어조.

[지켜야 할 원칙]
1. 반박의 방식
   - 상대 주장을 일반론으로 회피하지 말고, 그 전제와 가정을 최소 두 개 이상 짚어 구체적으로 반박하시오.
   - 단순한 부정이 아니라 비교·대조·역사적 사례를 들어 설득하시오.
2. 형식
   - 격식 있는 한국어 문어체를 사용하며 감탄사·구어체·이모티콘을 쓰지 않습니다.
   - 분량은 한국어 350~700자 사이가 적절합니다.
3. 세계관 경계 — 매우 중요
   - 톨킨 세계관의 고유명사(반지·모르도르·호빗·간달프·아라곤·깊은골·이실두르·에아렌딜 등)는 답변 본문에 직접 언급하지 마시오.
   - 위의 [엘론드에 관하여] 항목은 당신의 시선의 근거이지, 답변에 인용해야 할 출처가 아닙니다.
   - 토론의 주제는 어디까지나 현실 한국 사회의 사안이며, 빌리는 것은 시선과 어조뿐입니다.

Inference

from transformers import AutoTokenizer, AutoModelForCausalLM
import torch

mid = "ada-flo/gemma4-e2b-elrond-debate"
tok = AutoTokenizer.from_pretrained(mid)
model = AutoModelForCausalLM.from_pretrained(mid, dtype=torch.bfloat16, device_map="cuda")

START, END = "<|turn>", "<turn|>"
SYS = open("system_prompt.txt").read()  # paste from above
topic = "공인의 사회적 영향력을 고려할 때, 의혹이 있는 공인은 우선적으로 구속수사를 해야 하는가"
opponent = "구속수사는 무죄추정 원칙에 반하므로 신중해야 합니다..."

user = f"""주제: {topic}

상대 측 주장:
{opponent}

위 주장에 대해 엘론드의 시선으로 반론을 제기하시오."""

prompt = f"<bos>{START}system\n{SYS}{END}\n{START}user\n{user}{END}\n{START}model\n"
ids = tok(prompt, return_tensors="pt", add_special_tokens=False).to("cuda")
out = model.generate(**ids, max_new_tokens=600, do_sample=True, temperature=0.7, top_p=0.9)
print(tok.decode(out[0, ids["input_ids"].shape[1]:], skip_special_tokens=False).split(END)[0])

Training

Base: google/gemma-4-E2B (NOT -it).
Method: LoRA SFT (r=16, alpha=32), response-only loss masking.
Data: subsample of heegyu/korean-petitions for real Korean argumentative text, with substantive Elrond-styled rebuttals locally synthesized using Qwen/Qwen2.5-72B-Instruct (no paid API).
Bidirectional pairs: pro→con and con→pro per topic.
Topic-grouped train/valid split (no topic leakage).

Caveats

Persona is grounded in a system prompt; remove it and you get the base model.
Tolkien-world references (Ring, Mordor, Hobbit, etc.) are blocked by the system prompt — Elrond's voice and historical perspective only.
Korean only; English / other languages are out-of-distribution.

Downloads last month: 13

Safetensors

Model size

5B params

Tensor type

BF16

Inference Providers NEW

Image-Text-to-Text

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ada-flo/gemma4-e2b-elrond-debate

Base model

google/gemma-4-E2B

Adapter

(17)

this model