jiwon9703/Gemma4-26B-A4B-Korean-Opus-4.6-Distilled
Gemma4-26B-A4B ๊ธฐ๋ฐ ํ๊ตญ์ด Reasoning SFT ๋ชจ๋ธ. Claude Opus 4.6 distilled ํ๊ตญ์ด reasoning ๋ฐ์ดํฐ 12K๋ก ํ์ต. LR 5e-5, alpha=2รr.
๋ชจ๋ธ ์ ๋ณด
| ํญ๋ชฉ | ๋ด์ฉ |
|---|---|
| Base Model | unsloth/gemma-4-26B-A4B-it |
| ํ์ต ๋ฐฉ๋ฒ | LoRA SFT (Unsloth + TRL) |
| ํ๋ ์์ํฌ | transformers, peft |
| ๋ผ์ด์ผ์ค | Apache 2.0 |
ํ์ต ๋ฐ์ดํฐ
์ฌ์ฉ๋ฒ
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("jiwon9703/Gemma4-26B-A4B-Korean-SFT-v7")
tokenizer = AutoTokenizer.from_pretrained("jiwon9703/Gemma4-26B-A4B-Korean-SFT-v7")
vLLM ์๋น
vllm serve jiwon9703/Gemma4-26B-A4B-Korean-SFT-v7 --max-model-len 8192 --reasoning-parser gemma4
- Downloads last month
- 42