Indian Flag
Government Of India
A-
A
A+
Kathbath hard Punjabi ASR Benchmark Dataset

Kathbath hard Punjabi ASR Benchmark Dataset

Hard Punjabi ASR (Automatic Speech Recognition) benchmark dataset from Bhashini for supporting the development of robust regional speech recognition systems.

About Dataset

The kathbath_hard_punjabi dataset is a Punjabi Automatic Speech Recognition (ASR) benchmark dataset. Designed to evaluate ASR models for the Punjabi language, it includes challenging scenarios and diverse data from news and general domains. This dataset is an essential resource for researchers and developers working on Punjabi speech recognition, offering a robust foundation for building and benchmarking ASR systems. Submitted by AI4Bharat, it contributes to advancing speech technology for low-resource languages.

Activity Overview Activity Overview

  • Downloads0
  • Downloads 15
  • Views 194
  • File Size 171.68 MB

Tags Tags

  • NLP Dataset
  • Benchmark
  • News Domain
  • Punjabi
  • General Domain
  • Low-Resource Languages
  • Automatic Speech Recognition
  • AI4Bharat
  • ASR
  • Speech Processing

License Control License Control

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

844424930437057-34-f.wav ( 111.79 KB )


To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Data Quality Score BetaData Quality Score Beta

Version Control Version Control

FolderVersion 1(171.68 MB)
  • admin·11 month(s) ago
    • audio/wav
      844424930437057-34-f.wav
    • audio/wav
      844424930437058-34-f.wav
    • audio/wav
      844424930437059-34-f.wav
    • audio/wav
      844424930437060-34-f.wav
    • audio/wav
      844424930437062-34-f.wav
    • audio/wav
      844424930437063-34-f.wav
    • audio/wav
      844424930437066-34-f.wav
    • audio/wav
      844424930437067-34-f.wav
    • audio/wav
      844424930437068-34-f.wav
    • audio/wav
      844424930437069-34-f.wav
    • more_horiz 931 more