India is the second largest English-speaking country in the world, with a speaker base of roughly 130 million. Unfortunately, Indian speakers are underrepresented in many existing English ASR benchmarks such as LibriSpeech, Switchboard, and the Speech Accent Archive. To address this gap, we introduce Svarah—a benchmark that comprises 9.6 hours of transcribed English audio from 117 speakers across 65… See the full description on the dataset page: https://huggingface.co/datasets/ai4bharat/Svarah.
Attribution 4.0 International (CC BY- 4.0)
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.