Punjabi ASR (Automatic Speech Recognition) benchmark test dataset for supporting the development of robust regional speech recognition systems.
The Kathbath-Punjabi-Test-Unknown dataset is a robust benchmark for testing Automatic Speech Recognition (ASR) systems in Punjabi. This dataset comprises 1684 hours of labeled speech data spanning 12 Indian languages, tailored for general-domain scenarios. Submitted by Tahir Javed, it is a vital resource for advancing speech recognition technologies for Punjabi and other regional Indian languages, facilitating the development of reliable multilingual ASR systems.
Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.