This model aims to test the conversion between Megatron-LM and transformers. It is a small GPT-2-like model that has been used to debug the script. Use it only for integration tests
Downloads last month
23,889
Safetensors
Model size
16.2M params
Tensor type
BF16
ยท
Model tree for bigscience/bigscience-small-testing
docker model run hf.co/bigscience/bigscience-small-testing