ASR for European Portuguese
Collection
Collection of datasets and models for Portuguese ASR • 7 items • Updated • 4
openai/whisper-large-v3, trained for European Portuguese (EP) (around 425h) automatic speech recognition (ASR).
From CAMÕES work.
openai/whisper-large-v3task="transcribe")import whisperx
device = "cuda" # or "cpu"
compute_type = "float16"
model = whisperx.load_model(
"inesc-id/WhisperLv3-EP-X",
device=device,
compute_type=compute_type,
language="pt",
task="transcribe"
)
## Citation
**BibTeX:**
@inproceedings{camoes, title={{CAMÕES: A Comprehensive Automatic Speech Recognition Benchmark for European Portuguese}}, author={Carlos Carvalho, Francisco Teixeira, Catarina Botelho, Anna Pompili, Rubén Solera-Ureña, Sérgio Paulo, Mariana Julião, Thomas Rolland, John Mendonça, Diogo Pereira, Isabel Trancoso, Alberto Abad}, booktitle={Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)}, year={2025}, }
Base model
openai/whisper-large-v3