This model is a conformer-Large model, consisting of 120M parameters, as the encoder, with a hybrid CTC-RNNT decoder.
This model is a large-scale Automatic Speech Recognition (ASR) model developed by AI4Bharat for the Nepali language. It employs a hybrid architecture combining Connectionist Temporal Classification (CTC) and Recurrent Neural Network Transducer (RNNT) decoders, built upon a Conformer-Large encoder comprising 17 blocks and approximately 120 million parameters.
MIT
N.A
Automatic Speech Recognition Model
N.A.
Open
Sector Agnostic
02/05/25 11:01:13
0
MIT
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.