jiwon9703/Gemma4-26B-A4B-Korean-Opus-4.6-Distilled

Gemma4-26B-A4B ๊ธฐ๋ฐ˜ ํ•œ๊ตญ์–ด Reasoning SFT ๋ชจ๋ธ. Claude Opus 4.6 distilled ํ•œ๊ตญ์–ด reasoning ๋ฐ์ดํ„ฐ 12K๋กœ ํ•™์Šต. LR 5e-5, alpha=2ร—r.

๋ชจ๋ธ ์ •๋ณด

ํ•ญ๋ชฉ ๋‚ด์šฉ
Base Model unsloth/gemma-4-26B-A4B-it
ํ•™์Šต ๋ฐฉ๋ฒ• LoRA SFT (Unsloth + TRL)
ํ”„๋ ˆ์ž„์›Œํฌ transformers, peft
๋ผ์ด์„ผ์Šค Apache 2.0

ํ•™์Šต ๋ฐ์ดํ„ฐ

์‚ฌ์šฉ๋ฒ•

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("jiwon9703/Gemma4-26B-A4B-Korean-SFT-v7")
tokenizer = AutoTokenizer.from_pretrained("jiwon9703/Gemma4-26B-A4B-Korean-SFT-v7")

vLLM ์„œ๋น™

vllm serve jiwon9703/Gemma4-26B-A4B-Korean-SFT-v7 --max-model-len 8192 --reasoning-parser gemma4
Downloads last month
42
Safetensors
Model size
26B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for jiwon9703/Gemma4-26B-A4B-Korean-Opus-4.6-Distilled

Finetuned
(9)
this model

Dataset used to train jiwon9703/Gemma4-26B-A4B-Korean-Opus-4.6-Distilled