Breeze-ASR-26 Model for CTranslate2
This repository contains the MediaTek-Research/Breeze-ASR-26 model converted to the CTranslate2 format.
The model can be used with CTranslate2 or CTranslate2-based projects such as faster-whisper.
About Breeze-ASR-26
Breeze-ASR-26 (BreezeASR-Taigi) is a Taiwanese Hokkien (Taigi / 台語) automatic speech recognition model fine-tuned from openai/whisper-large-v2. It was trained on approximately 10,000 hours of synthetic Taiwanese Hokkien speech data and outputs Mandarin Chinese character transcriptions.
For more details about the original model, please refer to its model card.
Example
from faster_whisper import WhisperModel
model = WhisperModel("path/to/faster-whisper-Breeze-ASR-26")
segments, info = model.transcribe("audio.wav")
for segment in segments:
print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))
Conversion Details
The original model was converted with the following command:
ct2-transformers-converter \
--model /path/to/Breeze-ASR-26 \
--output_dir faster-whisper-Breeze-ASR-26 \
--copy_files preprocessor_config.json \
--quantization float16
Note: The model weights are saved in FP16 format. You can change the type when loading the model using the compute_type option in CTranslate2.
Benchmark Results
The original Breeze-ASR-26 model achieves an average Character Error Rate (CER) of 30.13% on the Taigi ASR Benchmark, outperforming several baseline systems:
| System | Average CER (%) |
|---|---|
| BreezeASR-Taigi (Ours) | 30.13 ✅ |
| Taiwanese Input Method | 30.70 |
| Yating (雅婷逐字稿) | 32.11 |
| Gemini 3 Flash | 32.52 |
| Breeze ASR 25 | 49.99 |
More Information
For more information about the original model, please refer to its model card
- Downloads last month
- 50
Model tree for SoybeanMilk/faster-whisper-Breeze-ASR-26
Base model
openai/whisper-large-v2