Breeze-ASR-26 Model for CTranslate2

This repository contains the MediaTek-Research/Breeze-ASR-26 model converted to the CTranslate2 format.

The model can be used with CTranslate2 or CTranslate2-based projects such as faster-whisper.

About Breeze-ASR-26

Breeze-ASR-26 (BreezeASR-Taigi) is a Taiwanese Hokkien (Taigi / 台語) automatic speech recognition model fine-tuned from openai/whisper-large-v2. It was trained on approximately 10,000 hours of synthetic Taiwanese Hokkien speech data and outputs Mandarin Chinese character transcriptions.

For more details about the original model, please refer to its model card.

Example

from faster_whisper import WhisperModel

model = WhisperModel("path/to/faster-whisper-Breeze-ASR-26")

segments, info = model.transcribe("audio.wav")
for segment in segments:
    print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))

Conversion Details

The original model was converted with the following command:

ct2-transformers-converter \
  --model /path/to/Breeze-ASR-26 \
  --output_dir faster-whisper-Breeze-ASR-26 \
  --copy_files preprocessor_config.json \
  --quantization float16

Note: The model weights are saved in FP16 format. You can change the type when loading the model using the compute_type option in CTranslate2.

Benchmark Results

The original Breeze-ASR-26 model achieves an average Character Error Rate (CER) of 30.13% on the Taigi ASR Benchmark, outperforming several baseline systems:

System Average CER (%)
BreezeASR-Taigi (Ours) 30.13
Taiwanese Input Method 30.70
Yating (雅婷逐字稿) 32.11
Gemini 3 Flash 32.52
Breeze ASR 25 49.99

More Information

For more information about the original model, please refer to its model card

Downloads last month
50
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for SoybeanMilk/faster-whisper-Breeze-ASR-26

Finetuned
(4)
this model