Indian Flag
Government Of India
A-
A
A+

Nagamese Speech-to-Text

Automatic Speech Recognition (ASR) model for Nagamese speech, designed to transcribe spoken Nagamese into text for real-world usage.

About Model

This is an Automatic Speech Recognition (ASR) model for Nagamese, a widely spoken creole language of Northeast India. The model processes 16 kHz audio and generates text transcriptions reflecting natural, conversational Nagamese speech. The system is built on the Whisper-Small architecture and adapted specifically for Nagamese using real speech recordings. The model supports informal speech patterns, fillers, repetitions, and everyday vocabulary commonly used by Nagamese speakers. To improve fluency and transcription stability, the model was further refined using controlled synthetic speech data, while evaluation and validation were consistently performed on real Nagamese speech. This model is intended for: speech-to-text applications accessibility tools language technology research prototyping conversational and voice-enabled systems in Nagamese

Nagamese Speech-to-Text

Metadata Metadata

Attribution 4.0 International (CC BY- 4.0)

MWirelabs

Transformers

PyTorch

Open

MWire Labs

Social

23/01/26 13:03:49

Badal Nyalang

0

Activity Overview Activity Overview

  • Downloads0
  • Redirect 3
  • File Size 0
  • Views 121

Tags Tags

  • Automatic Speech Recognition
  • ASR
  • Speech Recognition
  • Nagamese
  • whisper
  • low-resource-language

License Control License Control

Attribution 4.0 International (CC BY- 4.0)

Related Datasets Related Datasets

Updated 11 month(s) ago
VAANI: Multi-modal, Multi-lingual Dataset
VAANI: Multi-modal, Multi-lingual Dataset
Information-
VAANI is a multi-modal, multi-lingual dataset designed to represent the rich linguistic diversity of India. It currently includes data from two phases—Phase 1 (80 districts) and Phase 2 (40 districts)—spanning a total of ~21,500 hours of spontaneous, image-prompted speech collected from more than 110K speakers across 120 districts, describing 210K images in 86 languages. From this, 835 hours of transcribed audio data is available, distributed nearly evenly across all 120 districts.
Bengali
Gujarati
Kannada
Nepali
Punjabi
Telugu
Urdu
Sindhi
English
Tamil
Low-Resource Languages
Marathi
Malayalam
Odia
speech transcription
spontaneous speech
Sangtam
Ao
Halbi
Malvi
Sadri
Chhattisgarhi
Bhili
Malvani
multilingual corpus
Sumi
Bagheli
Khorth
Nyishi
multimodal dataset
Shekhawati
Bagri
Mewati
Meitei
86 Indian languages
Surgujia
audio-visual dataset
Garo
MagadhiMagahi
TTS training
Wancho
Awadhi
Galo
Oriya
speech + image + text
Tenyidie
regional dialects
Rajbanshi
Nagamese
manual transcription
Thethi
LLM speech integration
KhorthKhotta
Konkani
quality evaluated
speaker identification
diverse demographics
telemedicine AI applications
Marwadi
speaker diversity
Tulu
Assamese
Harauti
Rajasthani
language identification
120+ districts
Tagin
Marwari
Gondi
Bajjika
22 Indian states
Surjapuri
dialect diversity
Kokborok
image-prompted data
Santali
speech enhancement
Khariboli
Rengma
geo-centric data collection
Hajong
Hindi
ASR training
Wagdi
Bhatri
Dorli
multi-modal language resources
reallife recording environments
Mewari
linguistic diversity
Nimadi
Khandeshi
data for conversational AI
benchmarking dataset
Lotha
Kurmali
Bhojpuri
NissiDafla
Angika
Lambani
Magahi
Jaipuri
Bihari
Chakhesang
Magadhi
Angami
Chakma
Duruwa
Bearybashe
Khortha
Kumaoni
Kurukh
Garhwali
Maithili
Agariya
Bundeli
large-scale speech corpus
  • See Upvoters1
  • Downloads139
  • File Size0
  • Views1,407

INDIAN INSTITUTE OF SCIENCE (IISC), BANGALORE

More Models from MWire Labs More Models from MWire Labs

Mizo OCR - Text Recognition for Mizo Language
OCR model for the Mizo language achieving 90.68% character accuracy on synthetic and curated printed text
Mizo
northeast-india
trocr
low-resource
OCR
Image-to-Text
  • See Upvoters0
  • Downloads2
  • File Size0
  • Views171
Updated 2 month(s) ago

MWIRE LABS

NE-OCR
NE-OCR is a multilingual Optical Character Recognition model developed by MWire Labs to accurately recognize printed text from documents in Northeast Indian languages. The model supports Assamese, Bodo, English, Garo, Hindi, Khasi, Kokborok, Meitei (Bengali script), Meitei (Meitei Mayek script), Mizo, Nagamese, and Nyishi. It is designed to enable reliable digitization of books, newspapers, government records, educational materials, and cultural archives from Northeast India where mainstream OCR
Printed Text Recognition
Northeast India OCR
Multilingual OCR
vitstr
doctr
BODO
Optical Character Recognition
Mizo
khasi
northeast-india
Nagamese
Meitei
Nyishi
Kokborok
Garo
OCR
  • See Upvoters0
  • Downloads7
  • File Size0
  • Views128
Updated 2 month(s) ago

MWIRE LABS

Nagamese Speech-to-Text
Automatic Speech Recognition (ASR) model for Nagamese speech, designed to transcribe spoken Nagamese into text for real-world usage.
whisper
Speech Recognition
ASR
Automatic Speech Recognition
low-resource-language
Nagamese
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views122
Updated 2 month(s) ago

MWIRE LABS

Garo OCR - Text Recognition for Garo
OCR model for the Garo language achieving 93.13% character accuracy.
Garo
florence-2
northeast-india
OCR
Image-to-Text
  • See Upvoters0
  • Downloads0
  • File Size0
  • Views163
Updated 2 month(s) ago

MWIRE LABS

Northeast Language Identification
NE-LID is a fast and accurate language identification model for Northeast Indian languages using character level features. It is designed for low resource and script diverse text and achieves high accuracy on short sentences.
Multilingual
language identification
low-resource
northeast-india
MWire Labs
fasttext
fastText
  • See Upvoters1
  • Downloads11
  • File Size0
  • Views596
Updated 4 month(s) ago

MWIRE LABS

NortheastNER
NortheastNER is a token classification model built on XLM-RoBERTa and fine-tuned on ~25k sentences from gazetteers, news, and cultural texts across Northeast India. It detects region-specific entities, places, tribes, festivals, tourist sites, flora, fauna, and experimental local names; ideal for low-resource NER, regional search, cultural analytics, and knowledge graph applications.
Conservation
Token Classification
NER
XLM-RoBERTa
low-resource
northeast-india
Meghalaya
Northeast India
  • See Upvoters0
  • Downloads12
  • File Size0
  • Views307
Updated 6 month(s) ago

MWIRE LABS

Kren-M
Northeast India's first AI language model. Kren-M is a 2.6B parameter bilingual model for Khasi-English, built on Gemma-2-2B. Features Kren-NE custom tokenizer covering 7 NE languages (Khasi, Garo, Mizo, Assamese, Manipuri, Nagamese, Nyishi) with 35.7% efficiency gain. Trained on 5.43M Khasi sentences. Capabilities: bidirectional translation, natural conversation, cultural context. Designed for language preservation across Northeast India
Northeast India Languages
Tokenizer
Foundational model
Kren-M
bilingual
continued-pretraining
Northeast India
khasi
northeast-india
Garo
low-resource
Instruction-Tuning
Indian Languages
  • See Upvoters0
  • Downloads30
  • File Size0
  • Views754
Updated 6 month(s) ago

MWIRE LABS

NE-BERT
NE-BERT is Northeast India's first domain-specific multilingual foundation model. Built on the ModernBERT architecture and trained on 8.3 million sentences, it supports 9 regional languages: Assamese, Khasi, Garo, Manipuri (Meitei), Mizo, Nyishi, Nagamese, Kokborok, and Pnar. It achieves State-of-the-Art performance on regional benchmarks and offers 1.6x faster inference, bridging the digital divide for low-resource languages.
kokborok
Mizo
token-efficiency
northeast bert
low-resource-NLP
Masked Language Modeling
Assamese
Garo
Nyishi
Meitei
Nagamese
northeast-india
khasi
A'chik
modernbert
mwirelabs
Pnar
  • See Upvoters0
  • Downloads20
  • File Size0
  • Views555
Updated 6 month(s) ago

MWIRE LABS

KhasiBERT
Khasi language model trained on 3.6M sentences using RoBERTa architecture. 110M parameters. Supports NLP tasks for Khasi text processing.
austroasiatic
roberta
foundational-model
digital-india
Bert
safetensors
endpoints_compatible
autotrain_compatible
Fill-Mask
Indian Language
low-resource
region:us
kha
khasi
Meghalaya
masked-lm
  • See Upvoters1
  • Downloads27
  • File Size0
  • Views861
Updated 8 month(s) ago

MWIRE LABS

Khasi English Semantic Search Model
Khasi-English semantic search model, trained on 66,794 pairs with 0.69-0.74 similarity. ~90MB, supports Meghalaya tourism/culture. By MWirelabs
Meghalaya
safetensors
Sentence Similarity
cross-lingual
en
autotrain_compatible
semantic search
sentence-transformers
license:cc0-1.0
kha
khasi
text-embeddings-inference
khasi-culture
  • See Upvoters0
  • Downloads24
  • File Size0
  • Views785
Updated 8 month(s) ago

MWIRE LABS