Indian Flag
Government Of India
A-
A
A+
Fluers Malayalam ASR Benchmark Dataset for Speech Recognition Technology

Fluers Malayalam ASR Benchmark Dataset for Speech Recognition Technology

Malayalam ASR (Automatic Speech Recognition) benchmark dataset from Bhashini for supporting the development of robust regional speech recognition systems.

About Dataset

This is a Malayalam ASR benchmark dataset developed to evaluate and improve Automatic Speech Recognition (ASR) systems for the Malayalam language. The dataset includes diverse and high-quality audio samples, providing researchers and developers with a critical resource for building robust ASR models. Submitted by Microsoft, it supports the advancement of speech recognition technologies in regional languages.

Activity Overview Activity Overview

  • Downloads0
  • Downloads 8
  • Views 130
  • File Size 430 MB

Tags Tags

  • NLP Dataset
  • Benchmark
  • General Domain
  • Automatic Speech Recognition
  • Malayalam
  • Speech Technology
  • AI4Bharat
  • ASR
  • Regional Languages
  • Audio Processing

License Control License Control

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

10013487754489332324.wav ( 356.33 KB )


To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Data Quality Score BetaData Quality Score Beta

Version Control Version Control

FolderVersion 1(430 MB)
  • admin·11 month(s) ago
    • audio/wav
      10013487754489332324.wav
    • audio/wav
      10069218717188856999.wav
    • audio/wav
      10078044035084322628.wav
    • audio/wav
      10101537866226983572.wav
    • audio/wav
      10134676116368565149.wav
    • audio/wav
      10168662218067058959.wav
    • audio/wav
      10199103907587183560.wav
    • audio/wav
      10234648330876278951.wav
    • audio/wav
      10267091564765546340.wav
    • audio/wav
      10270044487424671525.wav
    • more_horiz 950 more