Automatic Speech Recognition (ASR) model for Nagamese speech, designed to transcribe spoken Nagamese into text for real-world usage.
This is an Automatic Speech Recognition (ASR) model for Nagamese, a widely spoken creole language of Northeast India. The model processes 16 kHz audio and generates text transcriptions reflecting natural, conversational Nagamese speech. The system is built on the Whisper-Small architecture and adapted specifically for Nagamese using real speech recordings. The model supports informal speech patterns, fillers, repetitions, and everyday vocabulary commonly used by Nagamese speakers. To improve fluency and transcription stability, the model was further refined using controlled synthetic speech data, while evaluation and validation were consistently performed on real Nagamese speech. This model is intended for: speech-to-text applications accessibility tools language technology research prototyping conversational and voice-enabled systems in Nagamese
Attribution 4.0 International (CC BY- 4.0)
MWirelabs
Transformers
PyTorch
Open
Social
23/01/26 13:03:49
0
Attribution 4.0 International (CC BY- 4.0)
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.