monkeypostulate
/

gpt-neo-1.3B

Model card Files Files and versions

monkeypostulate commited on Oct 28, 2024

Commit

f86f6d7

·

verified ·

1 Parent(s): 6f2cf2f

Update README.md

Files changed (1) hide show

README.md +18 -4

README.md CHANGED Viewed

@@ -1,17 +1,31 @@
 ---
 base_model: meta-llama/Llama-3.2-1B-Instruct
 library_name: peft
 ---
-# Model Card for Model ID
-<!-- Provide a quick summary of what the model is/does. -->
-## Model Details
-### Model Description
 <!-- Provide a longer summary of what this model is. -->

 ---
 base_model: meta-llama/Llama-3.2-1B-Instruct
 library_name: peft
+datasets:
+- mlabonne/orpo-dpo-mix-40k
 ---
+This model is a fine-tuned version of [meta-llama/Llama-3.2-1B, optimized with ORPO (Optimized Regularization for Prompt Optimization) Trainer. Fine-tuning was performed using a subset of the [meta-llama/Llama-3.2-1B  dataset, with only 100 samples selected to enable rapid training with ORPO’s efficient approach.
+ **Fine-tuning Method:** ORPO
+**Dataset:** mlabonne/orpo-dpo-mix-40k
+ **Evaluation**
+ The model was evaluated on the following benchmarks, with the following performance metrics:
+ |  Tasks  |Version|Filter|n-shot| Metric |   |Value |   |Stderr|
+|---------|------:|------|-----:|--------|---|-----:|---|-----:|
+|hellaswag|      1|none  |     0|acc     |↑  | 0.4772 |±  | 0.0050 |
+|         |       |none  |     0|acc_norm|↑  |0.6366 |±  | 0.0048 |
 <!-- Provide a longer summary of what this model is. -->