Indian Flag
Government Of India
A-
A
A+
Punjabi ASR Benchmark Dataset (Kathbath Punjabi test unknown)

Punjabi ASR Benchmark Dataset (Kathbath Punjabi test unknown)

Punjabi ASR (Automatic Speech Recognition) benchmark test dataset for supporting the development of robust regional speech recognition systems.

About Dataset

The Kathbath-Punjabi-Test-Unknown dataset is a robust benchmark for testing Automatic Speech Recognition (ASR) systems in Punjabi. This dataset comprises 1684 hours of labeled speech data spanning 12 Indian languages, tailored for general-domain scenarios. Submitted by Tahir Javed, it is a vital resource for advancing speech recognition technologies for Punjabi and other regional Indian languages, facilitating the development of reliable multilingual ASR systems.

Activity Overview Activity Overview

  • Downloads0
  • Downloads 9
  • Views 69
  • File Size 330.81 MB

Tags Tags

  • NLP Dataset
  • Benchmark
  • Punjabi
  • General Domain
  • Automatic Speech Recognition
  • Speech Technology
  • ASR
  • Regional Languages
  • Indian Languages
  • Multilingual Dataset
  • Audio Processing

License Control License Control

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

844424930437057-34-f.wav ( 111.82 KB )


To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Data Quality Score BetaData Quality Score Beta

Version Control Version Control

FolderVersion 1(330.81 MB)
  • admin·11 month(s) ago
    • chevron_rightFolder
      audios
      • audio/wav
        844424930437057-34-f.wav
      • audio/wav
        844424930437058-34-f.wav
      • audio/wav
        844424930437059-34-f.wav
      • audio/wav
        844424930437060-34-f.wav
      • audio/wav
        844424930437062-34-f.wav
      • audio/wav
        844424930437063-34-f.wav
      • audio/wav
        844424930437066-34-f.wav
      • audio/wav
        844424930437067-34-f.wav
      • audio/wav
        844424930437068-34-f.wav
      • audio/wav
        844424930437069-34-f.wav
      • more_horiz 1904 more
    • application/json
      data.json
    • application/json
      params.json