Indian Flag
Government Of India
A-
A
A+

Indic-Conformer model for ASR

Indo-Aryan Indic-Conformer is a multilingual speech model for North-Indian languages. This model is based on Conformer large architecture, with 115M parameters.

About Model

Bhashini - The Indo-Aryan Indic-Conformer is a multilingual automatic speech recognition (ASR) model designed specifically for North-Indian languages. It is based on the Conformer large architecture, which is known for its efficiency and accuracy in processing speech signals. The model contains 115 million parameters, enabling it to effectively transcribe spoken language into text with high precision.

This ASR model has been trained on the Shrutlip dataset, a rich dataset designed to enhance automatic speech recognition capabilities in Indian languages. The model primarily supports the Odia language and has been developed by AI4Bharat, a leading research initiative focused on advancing AI-driven solutions for Indian languages.

With a batch processing setup, this model is optimized for large-scale speech-to-text tasks across general domains. It is a valuable resource for applications in speech transcription, voice-enabled interfaces, digital accessibility, and natural language processing (NLP) research. Given the increasing demand for multilingual ASR systems, this model serves as a foundational tool for improving speech technology in India’s diverse linguistic landscape.

The Indo-Aryan Indic-Conformer is open-source, and its implementation is available on GitHub, making it accessible for researchers, developers, and AI practitioners working in the domain of Indian language speech processing.

For more details about the use of model, refer to github: https://github.com/AI4Bharat/IndicTrans2/tree/main

Indic-Conformer model for ASR

Metadata Metadata

MIT

AI4Bharat

Speech Recognition Model

Other

Open

Sector Agnostic

05/03/25 15:23:44

Admin

64.91 KB

Activity Overview Activity Overview

  • Downloads2
  • Downloads 79
  • Views 1,741
  • File Size 64.91 KB

Tags Tags

  • Automatic Speech Recognition
  • Speech Technology
  • Speech Processing
  • Speech Lab
  • Bhashini

License Control License Control

MIT

Version Control Version Control

FolderVersion 1(64.91 KB)
  • admin·1 year(s) ago
    • chevron_rightFolder
      indic-asr-api-backend-master
      • chevron_rightFolder
        serving
      • undefined
        .gitignore
      • undefined
        api.py
      • application/json
        conformer.json
      • undefined
        example_ai4b_asr_rest_api.py
      • undefined
        LICENSE
      • text/markdown
        README.md
      • text/plain
        requirements.txt

More Models from TechCorp More Models from TechCorp

SPRING-INX-DATA2VEC-AQC-BENGALI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
IITM
spring_lab
Data2vec_aqc
ssl
low-resource-languages
SSL_finetunning
bengali
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views52
Updated 19 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BODO
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
Data2vec_aqc
ssl
IITM
spring_lab
SSL_finetunning
low-resource-language
BODO
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views59
Updated 19 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BHOJPURI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
SSL_finetunning
Bhojpuri
ssl
IITM
spring_lab
Data2vec_aqc
low-resource-language
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views52
Updated 19 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-MALAYALAM
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
malayalam
IITM
spring_lab
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views39
Updated 19 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-KANNADA
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
IITM
low-resource-language
SSL_finetunning
Data2vec_aqc
kannada
spring_lab
ssl
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views45
Updated 19 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-MARATHI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
Marathi
low-resource-language
SSL_finetunning
Data2vec_aqc
spring_lab
IITM
ssl
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views52
Updated 19 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-SANSKRIT
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
IITM
spring_lab
Sanskrit
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views49
Updated 19 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-PUNJABI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
IITM
spring_lab
PUNJABI
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views40
Updated 19 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-ODIA
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
spring_lab
Odia
ssl
IITM
Data2vec_aqc
SSL_finetunning
low-resource-language
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views43
Updated 19 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING LAB TAMIL-STREAMING
Automatic Speech Recognition (ASR) model for Tamil speech recognition, processing audio and transcribing spoken content into text.
Icefall-K2
ASR
tamil
IITM
spring_lab
streaming
MODELS
zipformer
  • See Upvoters0
  • Downloads8
  • File Size260.42 MB
  • Views168
Updated 1 month(s) ago

DIGITAL INDIA BHASHINI DIVISION