Home/Datasets/Punjabi ASR Benchmark Dataset (Kathbath Punjabi test unknown)

ORGANISATION

Punjabi ASR Benchmark Dataset (Kathbath Punjabi test unknown)

Punjabi ASR (Automatic Speech Recognition) benchmark test dataset for supporting the development of robust regional speech recognition systems.

About Dataset

The Kathbath-Punjabi-Test-Unknown dataset is a robust benchmark for testing Automatic Speech Recognition (ASR) systems in Punjabi. This dataset comprises 1684 hours of labeled speech data spanning 12 Indian languages, tailored for general-domain scenarios. Submitted by Tahir Javed, it is a vital resource for advancing speech recognition technologies for Punjabi and other regional Indian languages, facilitating the development of reliable multilingual ASR systems.

Dataset Metadata

License

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

Geographical coverage

Sector

Sector Agnostic

Author

AI4Bharat

Source Organisation

Digital India BHASHINI Division

Uploaded by

Shailendra Pal Singh

Data Quality Score (Beta)

Dataset type

Unstructured

Frequency

Time Granularity

Year range

N.A.

Date & Time

24/02/25 13:23:13

Visibility

Open

Hosted / Redirected

Hosted

Activity Overview

0
9
330.81 MB
133

License Control

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

844424930437057-34-f.wav ( 111.82 KB )

To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Data Quality Score Beta

Version Control

Version 1(330.81 MB)

admin·1 year(s) ago
- audios
  844424930437057-34-f.wav
  844424930437058-34-f.wav
  844424930437059-34-f.wav
  844424930437060-34-f.wav
  844424930437062-34-f.wav
  844424930437063-34-f.wav
  844424930437066-34-f.wav
  844424930437067-34-f.wav
  844424930437068-34-f.wav
  844424930437069-34-f.wav
  1904 more
- data.json
- params.json

Accessibility options by UX4G

Punjabi ASR Benchmark Dataset (Kathbath Punjabi test unknown)

About Dataset

Dataset Metadata

Activity Overview

Tags

License Control

844424930437057-34-f.wav ( 111.82 KB )

Data Quality Score Beta

Version Control

Version 1(330.81 MB)

audios

844424930437057-34-f.wav

844424930437058-34-f.wav

844424930437059-34-f.wav

844424930437060-34-f.wav

844424930437062-34-f.wav

844424930437063-34-f.wav

844424930437066-34-f.wav

844424930437067-34-f.wav

844424930437068-34-f.wav

844424930437069-34-f.wav

data.json

params.json

AIKosh

Resources

Support