nthakur/mirage-bench-sft-teacher-llama-3
Viewer • Updated • 52.6k • 25 • 4
How to use nthakur/Meta-Llama-3-8B-Instruct-miracl-raft-sft-v2.0 with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("meta-llama/Meta-Llama-3-8B-Instruct")
model = PeftModel.from_pretrained(base_model, "nthakur/Meta-Llama-3-8B-Instruct-miracl-raft-sft-v2.0")This model is a fine-tuned version of meta-llama/Meta-Llama-3-8B-Instruct on the nthakur/miracl-raft-sft-instruct-v0.2 dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 1.5961 | 0.1316 | 200 | 1.4755 |
| 1.6583 | 0.2633 | 400 | 1.4443 |
| 1.5272 | 0.3949 | 600 | 1.4324 |
| 1.5215 | 0.5266 | 800 | 1.4255 |
| 1.4857 | 0.6582 | 1000 | 1.4218 |
| 1.5324 | 0.7899 | 1200 | 1.4199 |
| 1.5235 | 0.9215 | 1400 | 1.4193 |
Base model
meta-llama/Meta-Llama-3-8B-Instruct