Indian Flag
Government Of India
A-
A
A+
Odia ASR Benchmark Dataset - Kathbath-Odia-Test-Unknown

Odia ASR Benchmark Dataset - Kathbath-Odia-Test-Unknown

Odia ASR (Automatic Speech Recognition) benchmark test dataset from Bhashini for supporting the development of robust regional speech recognition systems.

About Dataset

The Kathbath-Odia-Test-Unknown dataset is a robust ASR benchmark dataset designed to test Automatic Speech Recognition (ASR) systems in Odia. Comprising 1684 hours of labeled speech data across 12 Indian languages, it is tailored for general-domain testing scenarios. Submitted by Tahir Javed, this dataset provides a valuable resource for advancing ASR technologies for Odia and other Indian regional languages, contributing to the development of multilingual ASR solutions.

Activity Overview Activity Overview

  • Downloads0
  • Downloads 8
  • Views 57
  • File Size 339.18 MB

Tags Tags

  • NLP Dataset
  • Benchmark
  • General Domain
  • Automatic Speech Recognition
  • Odia
  • Speech Technology
  • ASR
  • Regional Languages
  • Indian Languages
  • Multilingual Dataset
  • Audio Processing

License Control License Control

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

844424930330266-1000-f.wav ( 135.04 KB )


To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Data Quality Score BetaData Quality Score Beta

Version Control Version Control

FolderVersion 1(339.18 MB)
  • admin·11 month(s) ago
    • chevron_rightFolder
      audios
      • audio/wav
        844424930330266-1000-f.wav
      • audio/wav
        844424930330267-1000-f.wav
      • audio/wav
        844424930330269-1000-f.wav
      • audio/wav
        844424930330270-1000-f.wav
      • audio/wav
        844424930330275-1000-f.wav
      • audio/wav
        844424930330277-1000-f.wav
      • audio/wav
        844424930330286-1000-f.wav
      • audio/wav
        844424930330287-1000-f.wav
      • audio/wav
        844424930330290-1000-f.wav
      • audio/wav
        844424930330291-1000-f.wav
      • more_horiz 1852 more
    • application/json
      data.json
    • application/json
      params.json