Indian Flag
Government Of India
A-
A
A+
IndicVoices

IndicVoices

Towards building an Inclusive Multilingual Speech Dataset for Indian Languages

About Dataset

INDICVOICES is a dataset of natural and spontaneous speech containing a total of 23.7K hours of read (8%), extempore (76%) and conversational (15%) audio from 51K speakers covering 400+ Indian districts and 22 languages. See the full description on the dataset page: https://huggingface.co/datasets/ai4bharat/IndicVoices.

Purpose of Dataset

To Build Robust Speech Interfaces

Activity Overview Activity Overview

  • Downloads0
  • Redirect 23
  • Views 120
  • File Size 0

Tags Tags

  • Speech Dataset

License Control License Control

Attribution 4.0 International (CC BY- 4.0)