Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
proxectonos
's Collections
Domain Specific Corpora
CorpusNÓS: A massive Galician corpus for training LLM
Text Datasets for Fine-tuning and Instruction tuning
Text Datasets for Evaluation
MT
Text Models
TTS Models
ASR Models
Instruction Pretrained Experiments
MT Models (former)
ASR Datasets
TTS Datasets
Domain Specific Corpora
updated
10 days ago
Collection of corpora prepared from specific domains mainly in Galician language.
Upvote
-
proxectonos/corpus_dominio_legal_administrativo
Preview
•
Updated
10 days ago
•
50
proxectonos/corpus_dominio_periodistico
Viewer
•
Updated
9 days ago
•
280k
•
41
proxectonos/corpus_dominio_cientifico
Preview
•
Updated
9 days ago
•
52
proxectonos/corpus_dominio_museistico_patrimonio
Viewer
•
Updated
7 days ago
•
14.5k
•
64
Upvote
-
Share collection
View history
Collection guide
Browse collections