Indian Flag
Government Of India
A-
A
A+
Svarah

Svarah

Bridging the accent gap in English ASR for India

About Dataset

India is the second largest English-speaking country in the world, with a speaker base of roughly 130 million. Unfortunately, Indian speakers are underrepresented in many existing English ASR benchmarks such as LibriSpeech, Switchboard, and the Speech Accent Archive. To address this gap, we introduce Svarah—a benchmark that comprises 9.6 hours of transcribed English audio from 117 speakers across 65… See the full description on the dataset page: https://huggingface.co/datasets/ai4bharat/Svarah.

Activity Overview Activity Overview

  • Downloads0
  • Redirect 23
  • Views 97
  • File Size 0

Tags Tags

  • Speech Dataset
  • Speech Recognition

License Control License Control

Attribution 4.0 International (CC BY- 4.0)