A large-scale Automatic Speech Recognition Model (ASR) model for Marathi, utilizing a hybrid CTC-RNNT decoder.
The ai4bharat/indicconformer_stt_mr_hybrid_ctc_rnnt_large model is an Automatic Speech Recognition (ASR) system designed for the Marathi language. It employs a Conformer-Large architecture with 120 million parameters, featuring 17 conformer blocks and a model dimension of 512. This model processes 16 kHz mono-channel audio (wav files) and outputs transcriptions in Marathi. Its hybrid CTC-RNNT decoder enhances recognition performance for spoken Marathi.
MIT
AI4Bharat
Audio-to-text
N.A.
Open
Sector Agnostic
21/02/25 13:21:35
0
MIT
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.