vukrosic
/

Blueberry-Nano-151M

Eval Results (legacy)

Model card Files Files and versions

vukrosic commited on Dec 17, 2025

Commit

312a937

·

verified ·

1 Parent(s): 696cdfe

Update README.md

Files changed (1) hide show

README.md +0 -4

README.md CHANGED Viewed

@@ -68,10 +68,6 @@ Final metrics after 1B tokens:
 This model is a base model trained on a mix of educational data. It demonstrates reasonable storytelling and factual knowledge for its size, but may hallucinate and is not yet fine-tuned for instruction following.
-```python
-# Example inference code coming soon
-```
 ## Historical Context
 This model (151M parameters) reached similar complexity to OpenAI's original GPT-1 (117M) in under 3 hours on a single consumer GPU, showcasing the massive improvement in training efficiency in recent years.

 This model is a base model trained on a mix of educational data. It demonstrates reasonable storytelling and factual knowledge for its size, but may hallucinate and is not yet fine-tuned for instruction following.
 ## Historical Context
 This model (151M parameters) reached similar complexity to OpenAI's original GPT-1 (117M) in under 3 hours on a single consumer GPU, showcasing the massive improvement in training efficiency in recent years.