Sample Efficiency Evaluation Small casual language models trained for the evaluation of sample efficiency. J4bb4wukis/mamba2_432m_wikipedia_en_shuffeld 0.4B • Updated Feb 10, 2025 J4bb4wukis/gpt2_355m_wikipedia_en_shuffeld 0.4B • Updated Feb 10, 2025 J4bb4wukis/gpt2_209m_wikipedia_en_shuffeld 0.2B • Updated Jan 31, 2025 J4bb4wukis/xlstm_247m_wikipedia_en_shuffeld 0.2B • Updated Jan 31, 2025
Sample Efficiency Evaluation Small casual language models trained for the evaluation of sample efficiency. J4bb4wukis/mamba2_432m_wikipedia_en_shuffeld 0.4B • Updated Feb 10, 2025 J4bb4wukis/gpt2_355m_wikipedia_en_shuffeld 0.4B • Updated Feb 10, 2025 J4bb4wukis/gpt2_209m_wikipedia_en_shuffeld 0.2B • Updated Jan 31, 2025 J4bb4wukis/xlstm_247m_wikipedia_en_shuffeld 0.2B • Updated Jan 31, 2025