Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
proxectonos 's Collections
Domain Specific Corpora
CorpusNÓS: A massive Galician corpus for training LLM
Text Datasets for Fine-tuning and Instruction tuning
Text Datasets for Evaluation
MT
Text Models
TTS Models
ASR Models
Instruction Pretrained Experiments
MT Models (former)
ASR Datasets
TTS Datasets

Domain Specific Corpora

updated 10 days ago

Collection of corpora prepared from specific domains mainly in Galician language.

Upvote
-

  • proxectonos/corpus_dominio_legal_administrativo

    Preview • Updated 10 days ago • 50

  • proxectonos/corpus_dominio_periodistico

    Viewer • Updated 9 days ago • 280k • 41

  • proxectonos/corpus_dominio_cientifico

    Preview • Updated 9 days ago • 52

  • proxectonos/corpus_dominio_museistico_patrimonio

    Viewer • Updated 7 days ago • 14.5k • 64
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs