Home/Datasets/Marathi ASR Benchmark Dataset: Kathbath Marathi noisy test known

ORGANISATION

Marathi ASR Benchmark Dataset: Kathbath Marathi noisy test known

Marathi ASR (Automatic Speech Recognition) noisy test dataset from Bhashini for supporting the development of robust regional speech recognition systems.

About Dataset

This is a Marathi ASR benchmark dataset specifically designed to evaluate and improve Automatic Speech Recognition (ASR) systems in noisy and challenging scenarios, particularly in the general domain. The dataset comprises 1684 hours of labeled speech data across 12 Indian languages, with a focus on Marathi. This dataset variant, known as "Kathbath-Marathi-Noisy-Test-Known_1," provides researchers and developers with a valuable resource for building robust ASR models capable of handling real-world noisy conditions. Submitted by Tahir Javed, it supports advancements in speech recognition technologies for regional languages.

Dataset Metadata

License

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

Geographical coverage

Sector

Sector Agnostic

Author

AI4Bharat

Source Organisation

Digital India BHASHINI Division

Uploaded by

Shailendra Pal Singh

Data Quality Score (Beta)

Dataset type

Unstructured

Frequency

Time Granularity

Year range

N.A.

Date & Time

24/02/25 13:23:44

Visibility

Open

Hosted / Redirected

Hosted

Activity Overview

0
22
551.57 MB
123

License Control

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

844424930323045-45-m.wav ( 170.60 KB )

To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Data Quality Score Beta

Version Control

Version 1(551.57 MB)

admin·1 year(s) ago
- audios
  844424930323045-45-m.wav
  844424930323048-45-m.wav
  844424930323050-45-m.wav
  844424930323051-45-m.wav
  844424930323052-45-m.wav
  844424930323059-45-m.wav
  844424930323061-45-m.wav
  844424930323063-45-m.wav
  844424930323065-45-m.wav
  844424930323072-45-m.wav
  2596 more
- data.json
- params.json

Accessibility options by UX4G

Marathi ASR Benchmark Dataset: Kathbath Marathi noisy test known

About Dataset

Dataset Metadata

Activity Overview

Tags

License Control

844424930323045-45-m.wav ( 170.60 KB )

Data Quality Score Beta

Version Control

Version 1(551.57 MB)

audios

844424930323045-45-m.wav

844424930323048-45-m.wav

844424930323050-45-m.wav

844424930323051-45-m.wav

844424930323052-45-m.wav

844424930323059-45-m.wav

844424930323061-45-m.wav

844424930323063-45-m.wav

844424930323065-45-m.wav

844424930323072-45-m.wav

data.json

params.json

AIKosh

Resources

Support