An Automatic Speech Recognition (ASR) model for Urdu, built using the IndicConformer architecture with a Hybrid CTC-RNNT approach for accurate and efficient transcription.
The AI4Bharat IndicConformer-STT-UR-Hybrid-CTC-RNNT-Large model is an advanced Automatic Speech Recognition (ASR) system designed to transcribe Urdu speech to text. It uses the IndicConformer architecture, which integrates Conformer-based feature extraction with the powerful Hybrid CTC-RNNT (Connectionist Temporal Classification - Recurrent Neural Network Transducer) approach. This architecture enables robust transcription performance, even in noisy environments and for diverse speech patterns. The model is ideal for applications in voice-driven interfaces, transcription services, accessibility solutions, and multilingual ASR systems, providing a reliable solution for recognizing Urdu speech.
MIT
AI4Bharat
Audio-to-text
N.A.
Open
Sector Agnostic
21/02/25 13:21:27
0
MIT
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.