Faster-Whisper CTranslate2 - Egyptian Arabic Whisper Small

This repository hosts a CTranslate2 conversion of the Hugging Face model MAdel121/whisper-small-egyptian-arabic for use with faster-whisper.

Attribution

Original model: MAdel121/whisper-small-egyptian-arabic
Base model: openai/whisper-small
Dataset: MAdel121/arabic-egy-cleaned
License: MIT (same as the original model)
Fine-tuning framework: SpeechBrain (per original model card)

Usage (faster-whisper)

from faster_whisper import WhisperModel

model = WhisperModel(
    "faster-whisper-small-egyptian-arabic",
    device="cuda",
    compute_type="float16",
)

CPU usage:

from faster_whisper import WhisperModel

model = WhisperModel(
    "faster-whisper-small-egyptian-arabic",
    device="cpu",
    compute_type="int8",
)

Conversion

Converted with:

ct2-transformers-converter \
  --model whisper-small-egyptian-arabic \
  --output_dir faster-whisper-small-egyptian-arabic \
  --quantization float16 \
  --copy_files tokenizer.json preprocessor_config.json

Citations

Please cite the original Whisper paper and dataset:

@article{radford2023robust,
  title={Robust Speech Recognition via Large-Scale Weak Supervision},
  author={Radford, Alec and Kim, Jong Wook and Xu, Tao and Brockman, Greg and McLeavey, Christine and Sutskever, Ilya},
  journal={arXiv preprint arXiv:2212.04356},
  year={2023}
}

@misc{adel_mohamed_2024_12860997,
  author       = {Adel Mohamed},
  title        = {MAdel121/arabic-egy-cleaned},
  month        = jun,
  year         = 2024,
  publisher    = {Zenodo},
  doi          = {10.5281/zenodo.12860997},
  url          = {https://doi.org/10.5281/zenodo.12860997}
}

@misc{speechbrain,
  title={{SpeechBrain}: A General-Purpose Speech Toolkit},
  author={Ravanelli, Mirco and Parcollet, Titouan and Plantinga, Peter and Rouhe, Aku and Cornell, Samuele and Lugosch, Loren and Subakan, Cem and Dawalatabad, Nauman and Heba, Abdelwahab and Zhong, Jianyuan and Chou, Ju-Chieh and Yeh, Sung-Lin and Fu, Szu-Wei and Liao, Chien-Feng and Rastorgueva, Elena and Grondin, Francois and Aris, William and Na, Hwidong and Gao, Yan and De Mori, Renato and Bengio, Yoshua},
  year={2021},
  eprint={2106.04624},
  archivePrefix={arXiv},
  primaryClass={eess.AS}
}

Downloads last month: 3

Model tree for moeshawky/faster-whisper-small-egyptian-arabic

Base model

MAdel121/whisper-small-egyptian-arabic

Finetuned

(2)

this model

Dataset used to train moeshawky/faster-whisper-small-egyptian-arabic

Papers for moeshawky/faster-whisper-small-egyptian-arabic

Robust Speech Recognition via Large-Scale Weak Supervision

Paper • 2212.04356 • Published Dec 6, 2022 • 53

SpeechBrain: A General-Purpose Speech Toolkit

Paper • 2106.04624 • Published Jun 8, 2021 • 2