Home/Datasets/Marathi ASR Validation Dataset with Indian Language Support

ORGANISATION

Marathi ASR Validation Dataset with Indian Language Support

Marathi ASR (Automatic Speech Recognition) validation dataset from Bhashini for supporting the development of robust regional speech recognition systems.

About Dataset

The Kathbath-Marathi-Valid dataset serves as a critical validation resource for Automatic Speech Recognition (ASR) systems in Marathi. It includes 1684 hours of labeled speech data spanning 12 Indian languages, providing a robust platform for validating ASR models in general domains. Submitted by Tahir Javed, this dataset supports advancements in speech recognition technologies for Marathi and other Indian regional languages, contributing to the development of reliable multilingual ASR systems.

Dataset Metadata

License

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

Geographical coverage

Sector

Sector Agnostic

Author

AI4Bharat

Source Organisation

Digital India BHASHINI Division

Uploaded by

Shailendra Pal Singh

Data Quality Score (Beta)

Dataset type

Unstructured

Frequency

Time Granularity

Year range

N.A.

Date & Time

24/02/25 13:23:01

Visibility

Open

Hosted / Redirected

Hosted

Activity Overview

0
16
554.02 MB
230

License Control

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

844424930323365-291-f.wav ( 291.05 KB )

To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Data Quality Score Beta

Version Control

Version 1(554.02 MB)

admin·1 year(s) ago
- audios
  844424930323365-291-f.wav
  844424930323383-291-f.wav
  844424930323403-291-f.wav
  844424930323408-291-f.wav
  844424930323415-291-f.wav
  844424930323417-291-f.wav
  844424930323431-291-f.wav
  844424930323439-291-f.wav
  844424930328299-291-f.wav
  844424930328308-291-f.wav
  2839 more
- data.json
- params.json

Accessibility options by UX4G

Marathi ASR Validation Dataset with Indian Language Support

About Dataset

Dataset Metadata

Activity Overview

Tags

License Control

844424930323365-291-f.wav ( 291.05 KB )

Data Quality Score Beta

Version Control

Version 1(554.02 MB)

audios

844424930323365-291-f.wav

844424930323383-291-f.wav

844424930323403-291-f.wav

844424930323408-291-f.wav

844424930323415-291-f.wav

844424930323417-291-f.wav

844424930323431-291-f.wav

844424930323439-291-f.wav

844424930328299-291-f.wav

844424930328308-291-f.wav

data.json

params.json

AIKosh

Resources

Support