Uploaded finetuned model
- Developed by: NamrataThakur
- License: apache-2.0
- Finetuned from model : unsloth/meta-llama-3.1-8b-unsloth-bnb-4bit
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
| Base Model | Fine-Tuning | Train Dataset | Validation Loss | Evaluation Dataset | Mean Answer Relevancy Score | Mean Answer Correctness Score |
|---|---|---|---|---|---|---|
| Llama3.1-8bn | Supervised Fine-Tuning | GSM8K | 1.12 | SmallThoughts | 0.736 | 0.437 |
- Downloads last month
- 11