Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
proxectonos
's Collections
Domain Specific Corpora
CorpusNÓS: A massive Galician corpus for training LLM
Text Datasets for Fine-tuning and Instruction tuning
Text Datasets for Evaluation
MT
Text Models
TTS Models
ASR Models
Instruction Pretrained Experiments
MT Models (former)
ASR Datasets
TTS Datasets
CorpusNÓS: A massive Galician corpus for training LLM
updated
4 days ago
CorpusNÓS is the largest collection of data in Galician language for training LLM.
Upvote
-
proxectonos/corpusnos
Viewer
•
Updated
4 days ago
•
10.8M
•
63
Upvote
-
Share collection
View history
Collection guide
Browse collections