Indian Flag
Government Of India
A-
A
A+
Gramvaani Hindi ASR Benchmark Dataset for Speech Recognition

Gramvaani Hindi ASR Benchmark Dataset for Speech Recognition

Hindi ASR (Automatic Speech Recognition) benchmark dataset from Bhashini for supporting the development of robust regional speech recognition systems.

About Dataset

This is a Hindi ASR benchmark dataset developed to evaluate and improve Automatic Speech Recognition (ASR) systems for the Hindi language. The dataset includes diverse and high-quality audio samples, focusing on topics such as agriculture, healthcare, and general knowledge. It serves as a critical resource for researchers and developers to build robust ASR models. Submitted by AI4Bharat, this dataset supports advancements in speech recognition technologies for regional languages.

Activity Overview Activity Overview

  • Downloads0
  • Downloads 13
  • Views 506
  • File Size 307.62 MB

Tags Tags

  • NLP Dataset
  • Hindi
  • Benchmark
  • General Domain
  • Automatic Speech Recognition
  • Speech Technology
  • AI4Bharat
  • ASR
  • Regional Languages
  • Audio Processing
  • Healthcare Domain
  • Agriculture Domain

License Control License Control

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

01-00004-02.wav ( 375.20 KB )


To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Data Quality Score BetaData Quality Score Beta

Version Control Version Control

FolderVersion 1(307.62 MB)
  • admin·1 year(s) ago
    • audio/wav
      01-00004-02.wav
    • audio/wav
      01-00008-03.wav
    • audio/wav
      01-00031-03.wav
    • audio/wav
      01-00071-02.wav
    • audio/wav
      01-00078-01.wav
    • audio/wav
      01-00093-01.wav
    • audio/wav
      01-00097-03.wav
    • audio/wav
      01-00119-02.wav
    • audio/wav
      01-00121-02.wav
    • audio/wav
      01-00129-02.wav
    • more_horiz 1024 more