SmolLM 🤏 SmolLM models, datasets and demos HuggingFaceTB/SmolLM3-3B Text Generation • 3B • Updated Sep 10, 2025 • 1.07M • 930 HuggingFaceTB/SmolLM2-1.7B-Instruct Text Generation • Updated Apr 21, 2025 • 158k • 728 HuggingFaceTB/SmolVLM-Instruct Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 29.8k • 583 HuggingFaceTB/SmolLM2-360M-Instruct Text Generation • Updated Sep 22, 2025 • 437k • 185
📚 Filtering the web with LLMs HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 355k • 1.02k HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 8k • • 210 HuggingFaceFW/ablation-model-fineweb-edu Text Generation • 2B • Updated Jun 11, 2024 • 452 • 21 math-ai/AutoMathText Viewer • Updated Jul 16, 2025 • 7.89M • 8.33k • 185
✨ Code Generation Code generation models and datassets! bigcode/starcoder2-15b Text Generation • Updated Jun 5, 2024 • 7.5k • 668 bigcode/the-stack Viewer • Updated Apr 13, 2023 • 546M • 11.9k • 979 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 90.6k • 217 bigcode/starcoder Text Generation • 16B • Updated Oct 8, 2024 • 10.8k • 2.94k
Instruct datasets QuixiAI/SystemChat-2.0 Viewer • Updated Jun 15, 2025 • 141k • 263 • 74 arcee-ai/infini-instruct-top-500k Viewer • Updated Jun 30, 2024 • 500k • 27 • 6 arcee-ai/The-Tome Viewer • Updated Aug 15, 2024 • 1.75M • 262 • 105 teknium/OpenHermes-2.5 Viewer • Updated Apr 15, 2024 • 1M • 24.3k • 809
🌌 Synthetic textbooks Synthetically generated textbooks HuggingFaceTB/cosmopedia Viewer • Updated Aug 12, 2024 • 31.1M • 14.2k • 683 Locutusque/UltraTextbooks Viewer • Updated Feb 2, 2024 • 5.52M • 343 • 198 microsoft/phi-2 Text Generation • 3B • Updated Dec 8, 2025 • 1.19M • 3.45k HuggingFaceTB/cosmo-1b Text Generation • 2B • Updated Jul 8, 2024 • 219 • 134
SmolLM 🤏 SmolLM models, datasets and demos HuggingFaceTB/SmolLM3-3B Text Generation • 3B • Updated Sep 10, 2025 • 1.07M • 930 HuggingFaceTB/SmolLM2-1.7B-Instruct Text Generation • Updated Apr 21, 2025 • 158k • 728 HuggingFaceTB/SmolVLM-Instruct Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 29.8k • 583 HuggingFaceTB/SmolLM2-360M-Instruct Text Generation • Updated Sep 22, 2025 • 437k • 185
Instruct datasets QuixiAI/SystemChat-2.0 Viewer • Updated Jun 15, 2025 • 141k • 263 • 74 arcee-ai/infini-instruct-top-500k Viewer • Updated Jun 30, 2024 • 500k • 27 • 6 arcee-ai/The-Tome Viewer • Updated Aug 15, 2024 • 1.75M • 262 • 105 teknium/OpenHermes-2.5 Viewer • Updated Apr 15, 2024 • 1M • 24.3k • 809
📚 Filtering the web with LLMs HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 355k • 1.02k HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 8k • • 210 HuggingFaceFW/ablation-model-fineweb-edu Text Generation • 2B • Updated Jun 11, 2024 • 452 • 21 math-ai/AutoMathText Viewer • Updated Jul 16, 2025 • 7.89M • 8.33k • 185
🌌 Synthetic textbooks Synthetically generated textbooks HuggingFaceTB/cosmopedia Viewer • Updated Aug 12, 2024 • 31.1M • 14.2k • 683 Locutusque/UltraTextbooks Viewer • Updated Feb 2, 2024 • 5.52M • 343 • 198 microsoft/phi-2 Text Generation • 3B • Updated Dec 8, 2025 • 1.19M • 3.45k HuggingFaceTB/cosmo-1b Text Generation • 2B • Updated Jul 8, 2024 • 219 • 134
✨ Code Generation Code generation models and datassets! bigcode/starcoder2-15b Text Generation • Updated Jun 5, 2024 • 7.5k • 668 bigcode/the-stack Viewer • Updated Apr 13, 2023 • 546M • 11.9k • 979 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 90.6k • 217 bigcode/starcoder Text Generation • 16B • Updated Oct 8, 2024 • 10.8k • 2.94k