Indian Flag
Government Of India
A-
A
A+

Indic-Conformer model for ASR

Indo-Aryan Indic-Conformer is a multilingual speech model for North-Indian languages. This model is based on Conformer large architecture, with 115M parameters.

About Model

Bhashini - The Indo-Aryan Indic-Conformer is a multilingual automatic speech recognition (ASR) model designed specifically for North-Indian languages. It is based on the Conformer large architecture, which is known for its efficiency and accuracy in processing speech signals. The model contains 115 million parameters, enabling it to effectively transcribe spoken language into text with high precision.

This ASR model has been trained on the Shrutlip dataset, a rich dataset designed to enhance automatic speech recognition capabilities in Indian languages. The model primarily supports the Odia language and has been developed by AI4Bharat, a leading research initiative focused on advancing AI-driven solutions for Indian languages.

With a batch processing setup, this model is optimized for large-scale speech-to-text tasks across general domains. It is a valuable resource for applications in speech transcription, voice-enabled interfaces, digital accessibility, and natural language processing (NLP) research. Given the increasing demand for multilingual ASR systems, this model serves as a foundational tool for improving speech technology in India’s diverse linguistic landscape.

The Indo-Aryan Indic-Conformer is open-source, and its implementation is available on GitHub, making it accessible for researchers, developers, and AI practitioners working in the domain of Indian language speech processing.

For more details about the use of model, refer to github: https://github.com/AI4Bharat/IndicTrans2/tree/main

Indic-Conformer model for ASR

Metadata Metadata

MIT

AI4Bharat

Speech Recognition Model

Other

Open

Sector Agnostic

05/03/25 15:23:44

Admin

64.91 KB

Activity Overview Activity Overview

  • Downloads2
  • Downloads 86
  • Views 2,172
  • File Size 64.91 KB

Tags Tags

  • Automatic Speech Recognition
  • Speech Technology
  • Speech Processing
  • Speech Lab
  • Bhashini

License Control License Control

MIT

Version Control Version Control

FolderVersion 1(64.91 KB)
  • admin·1 year(s) ago
    • chevron_rightFolder
      indic-asr-api-backend-master
      • chevron_rightFolder
        serving
      • undefined
        .gitignore
      • undefined
        api.py
      • application/json
        conformer.json
      • undefined
        example_ai4b_asr_rest_api.py
      • undefined
        LICENSE
      • text/markdown
        README.md
      • text/plain
        requirements.txt

More Models from TechCorp More Models from TechCorp

SANTHAM-Gemma3-4B-SH-Seg-Poetry-Finetuned
SANTHAM-Gemma3-4B-SH-Seg-Poetry-Finetuned is a model designed to translate Sanskrit into Tamil specialized on Segmented text obtained using Sanskrit Heritage segmenter.
translation
poetry
santham
Segmened
language:tam
language:san
  • See Upvoters0
  • Downloads0
  • File Size115.62 MB
  • Views46
Updated 17 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SANTHAM-Gemma3-4B-Finetuned
SANTHAM-Gemma3-4B-Finetuned is a Sanskrit → Tamil translation model built on the Gemma 3 (4B) architecture. It is trained on a parallel corpus developed as part of the Sanskrit Knowledge Accessor project, enabling it to capture linguistic nuances and generate fluent Tamil translations from classical Sanskrit inputs.
translation
language:san
language:tam
santham
  • See Upvoters0
  • Downloads2
  • File Size2.08 GB
  • Views63
Updated 17 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SANTHAM-Gemma3-4B-Anvaya-Poetry-Finetuned
SANTHAM-Gemma3-4B-Anvaya-Potery-Finetuned is a model designed to translate Sanskrit into Tamil specialized on Anvaya translation in Poetry.
poetry
santham
anvaya
language:tam
language:san
translation
  • See Upvoters0
  • Downloads2
  • File Size2.09 GB
  • Views41
Updated 17 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-URDU
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
spring_lab
Data2vec_aqc
low-resource-language
SSL_finetunning
ssl
urdu
IITM
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views62
Updated 1 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-TELUGU
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
spring_lab
low-resource-language
SSL_finetunning
Data2vec_aqc
IITM
telugu
ssl
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views45
Updated 1 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-TAMIL
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
SSL_finetunning
Data2vec_aqc
spring_lab
IITM
tamil
ssl
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views49
Updated 1 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BENGALI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
Data2vec_aqc
IITM
spring_lab
ssl
low-resource-languages
SSL_finetunning
bengali
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views89
Updated 2 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BODO
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
IITM
spring_lab
SSL_finetunning
low-resource-language
BODO
Data2vec_aqc
ssl
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views118
Updated 2 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BHOJPURI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
SSL_finetunning
Data2vec_aqc
spring_lab
IITM
ssl
Bhojpuri
low-resource-language
  • See Upvoters0
  • Downloads4
  • File Size3.52 GB
  • Views104
Updated 2 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-MALAYALAM
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
SSL_finetunning
ssl
malayalam
IITM
spring_lab
Data2vec_aqc
low-resource-language
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views114
Updated 2 month(s) ago

DIGITAL INDIA BHASHINI DIVISION