YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Evaluation Results

  • Latest checkpoint uploaded: best_model.pt (size: 368K)

  • Training in progress: Currently at epoch 11 (see train.log for details)

  • ASR WER (IWSLT 2026 test_set): 45.08%

  • ASR CER (IWSLT 2026 test_set): 12.60%

Sample predictions and full results are in results_test_set.txt.

language: - or license: cc-by-4.0 tags: - automatic-speech-recognition - odia - conformer - ctc - ai4bharat datasets: - openslr - MUCS2021 metrics: - wer base_model: ai4bharat/indic-conformer-600m-multilingual

Odia ASR — Fine-tuned IndicConformer 600M

Fine-tuned version of ai4bharat/indic-conformer-600m-multilingual on Odia (ଓଡ଼ିଆ) speech recognition.

Model Details

Property Value
Base Model ai4bharat/indic-conformer-600m-multilingual
Language Odia (or)
Task Automatic Speech Recognition (ASR)
Architecture Conformer + CTC head
Encoder Dim 1024
Vocabulary 91 Odia characters
Sample Rate 16 kHz

Training Data

  • Train: OpenSLR-103 — MUCS 2021 Odia (~94.5h, 59,782 utterances)
  • Test (OpenSLR): OpenSLR-103 test set (~5.5h, 3,471 utterances)
  • Test (IWSLT 2026): OdiaGenAI/iwslt-odia-speech test set (~2.7h, 1,600 utterances)

Training Configuration

Hyperparameter Value
Batch size 8 (effective 32 with grad_accum=4)
Learning rate 1e-4
Optimizer AdamW (weight_decay=1e-4)
Scheduler Cosine with 500 warmup steps
Max epochs 30
Max audio length 10s
GPU NVIDIA A100 80GB

Architecture Notes

The base IndicConformer-600M model uses:

  • Preprocessor: NeMo TorchScript mel-spectrogram extractor (80-dim, runs on CUDA)
  • Encoder: ONNX InferenceSession (1024-dim output, CUDAExecutionProvider)
  • CTC Head: nn.Linear(1024, 91) — the only fine-tuned parameters

The encoder is frozen (ONNX); only the Odia CTC projection head is trained.

Usage

from transformers import AutoModel
import torch

model = AutoModel.from_pretrained(
    "shantipriya/odia-asr",
    trust_remote_code=True,
)

Citation

If you use this model, please cite:

@misc{odia-asr-2026,
  title  = {Fine-tuned IndicConformer for Odia ASR},
  author = {Shantipriya Parida},
  year   = {2026},
  url    = {https://huggingface.co/shantipriya/odia-asr},
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support