Indian Flag
Government Of India
A-
A
A+
Tamil ASR Benchmark Dataset for Speech Recognition(Fluers Tamil)

Tamil ASR Benchmark Dataset for Speech Recognition(Fluers Tamil)

Tamil ASR (Automatic Speech Recognition) benchmark dataset from Bhashini for supporting the development of robust regional speech recognition systems.

About Dataset

This is a Tamil ASR benchmark dataset developed to evaluate and improve Automatic Speech Recognition (ASR) systems for the Tamil language. The dataset includes diverse and high-quality audio samples, focusing on general topics such as literature, cultural narratives, and daily conversations. This provides researchers and developers with a critical resource for building robust ASR models. Submitted by AI4Bharat, it supports advancements in speech recognition technologies for regional languages.

Activity Overview Activity Overview

  • Downloads0
  • Downloads 17
  • Views 354
  • File Size 234.48 MB

Tags Tags

  • NLP Dataset
  • Benchmark
  • Tamil
  • General Domain
  • Automatic Speech Recognition
  • Speech Technology
  • AI4Bharat
  • ASR
  • Regional Languages
  • Audio Processing

License Control License Control

Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

10015420708072669120.wav ( 243.83 KB )


To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Data Quality Score BetaData Quality Score Beta

Version Control Version Control

FolderVersion 1(234.48 MB)
  • admin·11 month(s) ago
    • audio/wav
      10015420708072669120.wav
    • audio/wav
      10072217146537983584.wav
    • audio/wav
      10072241588933076862.wav
    • audio/wav
      10151944176129497809.wav
    • audio/wav
      10170056955437280049.wav
    • audio/wav
      10222238337252024030.wav
    • audio/wav
      10265284731185110799.wav
    • audio/wav
      10278812186982315493.wav
    • audio/wav
      10296046279567274586.wav
    • audio/wav
      10367238297771483490.wav
    • more_horiz 583 more