Indian Flag
Government Of India
A-
A
A+
Marathi ASR Benchmark Dataset (Kathbath Marathi Test known)

Marathi ASR Benchmark Dataset (Kathbath Marathi Test known)

Marathi ASR (Automatic Speech Recognition) benchmark test dataset from Bhashini for supporting the development of robust regional speech recognition systems.

About Dataset

The Kathbath-Marathi-Test-Known dataset is a specialized benchmark designed to evaluate the performance of Automatic Speech Recognition (ASR) systems in Marathi. With 1684 hours of labeled speech data spanning 12 Indian languages, this dataset focuses on testing ASR models in general domains. Submitted by Tahir Javed, it provides a valuable resource for advancing speech recognition technologies in Marathi and other Indian regional languages, contributing to the development of multilingual ASR systems.

Activity Overview Activity Overview

  • Downloads0
  • Downloads 11
  • Views 143
  • File Size 552.37 MB

Tags Tags

  • NLP Dataset
  • Benchmark
  • General Domain
  • Automatic Speech Recognition
  • Speech Technology
  • ASR
  • Regional Languages
  • Indian Languages
  • Multilingual Dataset
  • Audio Processing
  • Marathi

License Control License Control

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

844424930323355-291-f.wav ( 271.46 KB )


To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Data Quality Score BetaData Quality Score Beta

Version Control Version Control

FolderVersion 1(552.37 MB)
  • admin·11 month(s) ago
    • chevron_rightFolder
      audios
      • audio/wav
        844424930323355-291-f.wav
      • audio/wav
        844424930323360-291-f.wav
      • audio/wav
        844424930323364-291-f.wav
      • audio/wav
        844424930323369-291-f.wav
      • audio/wav
        844424930323392-291-f.wav
      • audio/wav
        844424930323401-291-f.wav
      • audio/wav
        844424930323414-291-f.wav
      • audio/wav
        844424930323438-291-f.wav
      • audio/wav
        844424930323442-291-f.wav
      • audio/wav
        844424930328269-291-f.wav
      • more_horiz 2708 more
    • application/json
      data.json
    • application/json
      params.json