Home/Datasets/Punjabi ASR Validation Dataset: Kathbath-Punjabi-Valid

ORGANISATION

Punjabi ASR Validation Dataset: Kathbath-Punjabi-Valid

Punjabi ASR (Automatic Speech Recognition) benchmark validation dataset for supporting the development of robust regional speech recognition systems.

About Dataset

The Kathbath-Punjabi-Valid dataset serves as a validation resource for testing and improving the performance of Automatic Speech Recognition (ASR) systems in Punjabi. With 1684 hours of labeled speech data across 12 Indian languages, this dataset is designed for validating ASR models in general-domain applications. Submitted by Tahir Javed, it provides essential support for advancing speech recognition technologies in Punjabi and other Indian regional languages, contributing to the development of reliable multilingual ASR systems.

Dataset Metadata

License

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

Geographical coverage

Sector

Sector Agnostic

Author

AI4Bharat

Source Organisation

Digital India BHASHINI Division

Uploaded by

Shailendra Pal Singh

Data Quality Score (Beta)

Dataset type

Unstructured

Frequency

Time Granularity

Year range

N.A.

Date & Time

24/02/25 13:23:15

Visibility

Open

Hosted / Redirected

Hosted

Activity Overview

0
12
552.26 MB
180

License Control

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

844424930560952-973-f.wav ( 248.96 KB )

To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Data Quality Score Beta

Version Control

Version 1(552.26 MB)

admin·1 year(s) ago
- audios
  844424930560952-973-f.wav
  844424930560965-973-f.wav
  844424930560980-973-f.wav
  844424930560981-973-f.wav
  844424930562721-973-f.wav
  844424930562727-973-f.wav
  844424930562737-973-f.wav
  844424930562781-973-f.wav
  844424930562831-812-f.wav
  844424930562882-812-f.wav
  3260 more
- data.json
- params.json

Accessibility options by UX4G

Punjabi ASR Validation Dataset: Kathbath-Punjabi-Valid

About Dataset

Dataset Metadata

Activity Overview

Tags

License Control

844424930560952-973-f.wav ( 248.96 KB )

Data Quality Score Beta

Version Control

Version 1(552.26 MB)

audios

844424930560952-973-f.wav

844424930560965-973-f.wav

844424930560980-973-f.wav

844424930560981-973-f.wav

844424930562721-973-f.wav

844424930562727-973-f.wav

844424930562737-973-f.wav

844424930562781-973-f.wav

844424930562831-812-f.wav

844424930562882-812-f.wav

data.json

params.json

AIKosh

Resources

Support