Indian Flag
Government Of India
A-
A
A+
Marathi ASR Validation Dataset with Indian Language Support

Marathi ASR Validation Dataset with Indian Language Support

Marathi ASR (Automatic Speech Recognition) validation dataset from Bhashini for supporting the development of robust regional speech recognition systems.

About Dataset

The Kathbath-Marathi-Valid dataset serves as a critical validation resource for Automatic Speech Recognition (ASR) systems in Marathi. It includes 1684 hours of labeled speech data spanning 12 Indian languages, providing a robust platform for validating ASR models in general domains. Submitted by Tahir Javed, this dataset supports advancements in speech recognition technologies for Marathi and other Indian regional languages, contributing to the development of reliable multilingual ASR systems.

Activity Overview Activity Overview

  • Downloads0
  • Downloads 10
  • Views 124
  • File Size 554.02 MB

Tags Tags

  • NLP Dataset
  • Benchmark
  • General Domain
  • Automatic Speech Recognition
  • Speech Technology
  • ASR
  • Regional Languages
  • Indian Languages
  • Multilingual Dataset
  • Audio Processing
  • Validation Dataset
  • Marathi

License Control License Control

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

844424930323365-291-f.wav ( 291.05 KB )


To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Data Quality Score BetaData Quality Score Beta

Version Control Version Control

FolderVersion 1(554.02 MB)
  • admin·11 month(s) ago
    • chevron_rightFolder
      audios
      • audio/wav
        844424930323365-291-f.wav
      • audio/wav
        844424930323383-291-f.wav
      • audio/wav
        844424930323403-291-f.wav
      • audio/wav
        844424930323408-291-f.wav
      • audio/wav
        844424930323415-291-f.wav
      • audio/wav
        844424930323417-291-f.wav
      • audio/wav
        844424930323431-291-f.wav
      • audio/wav
        844424930323439-291-f.wav
      • audio/wav
        844424930328299-291-f.wav
      • audio/wav
        844424930328308-291-f.wav
      • more_horiz 2839 more
    • application/json
      data.json
    • application/json
      params.json