| --- |
| license: apache-2.0 |
| base_model: facebook/wav2vec2-xls-r-300m |
| datasets: |
| - openslr/openslr |
| - mozilla-foundation/common_voice_17_0 |
| - GTM-UVigo/FalAI |
| - google/fleurs |
| - proxectonos/Nos_Parlaspeech-GL |
| language: |
| - gl |
| metrics: |
| - wer |
| - cer |
| tags: |
| - audio |
| - automatic-speech-recognition |
| - gl |
| model-index: |
| - name: Wav2Vec2-XLS-R-300M-GL |
| results: |
| - task: |
| name: Automatic Speech Recognition |
| type: automatic-speech-recognition |
| dataset: |
| name: Common Voice 17.0 |
| type: mozilla-foundation/common_voice_17_0 |
| args: gl |
| metrics: |
| - name: WER |
| type: wer |
| value: 12.04 |
| - name: CER |
| type: cer |
| value: 3.82 |
| - task: |
| name: Automatic Speech Recognition |
| type: automatic-speech-recognition |
| dataset: |
| name: OpenSLR |
| type: openslr |
| args: gl |
| metrics: |
| - name: WER |
| type: wer |
| value: 7.85 |
| - name: CER |
| type: cer |
| value: 1.66 |
| - task: |
| name: Automatic Speech Recognition |
| type: automatic-speech-recognition |
| dataset: |
| name: FalAI |
| type: falai |
| args: validated |
| metrics: |
| - name: WER |
| type: wer |
| value: 4.39 |
| - name: CER |
| type: cer |
| value: 1.17 |
| - task: |
| name: Automatic Speech Recognition |
| type: automatic-speech-recognition |
| dataset: |
| name: FLEURS |
| type: fleurs |
| args: gl_es |
| metrics: |
| - name: WER |
| type: wer |
| value: 15.83 |
| - name: CER |
| type: cer |
| value: 5.08 |
| - task: |
| name: Automatic Speech Recognition |
| type: automatic-speech-recognition |
| dataset: |
| name: Nos_Parlaspeech-GL |
| type: nos_parlaspeech-GL |
| args: clean |
| metrics: |
| - name: WER |
| type: wer |
| value: 8.92 |
| - name: CER |
| type: cer |
| value: 2.65 |
| --- |
| ## Wav2Vec2-XLS-R-300M-GL |
|
|
| This model is a finetuned version of [Facebook's Wav2Vec2 XLS-R 300M](https://huggingface.co/facebook/wav2vec2-xls-r-300m) for Galician on the datasets [Common Voice Corpus 17.0](https://datacollective.mozillafoundation.org/datasets/cmj8u3p5r009tnxxb88cqm12d), [Open SLR77](https://www.openslr.org/77), [FalAI](https://huggingface.co/datasets/GTM-UVigo/FalAI), [Fleurs](https://huggingface.co/datasets/google/fleurs) and [Nos_ParlaSpeech-GL](https://huggingface.co/datasets/proxectonos/Nos_Parlaspeech-GL). |
|
|
| ### Test |
| This model has been tested in the test splits of the Galician OpenSLR dataset, the Galician Common Voice 17.0 dataset, the FalAI dataset, the Galician FLEURS dataset and Nos_Parlaspeech-GL. |
| The results are shown in the following tables: |
| |
| |Corpus|WER|CER|RTF| |
| |:-----|:--|:--|:--| |
| |Common Voice 17.0|7.85|1.66|0.0085| |
| |Open SLR77|12.04|3.82|0.0087| |
| |FalAI|4.39|1.17|0.0260| |
| |FLEURS|15.83|5.08|0.0091| |
| |Nos_Parlaspeech-GL|8.92|2.65|0.0114| |
|
|
| ### Citation information |
|
|
| If you use this model, please cite as follows: |
|
|
| Moscoso Sánchez, Antonio; Magariños, Carmen; Castedo, Carla. 2025. Nos_ASR-wav2vec2-xls-r-300m-gl. URL: https://huggingface.co/proxectonos/Nos_ASR-wav2vec2-xls-r-300m-gl |