Indian Flag
Government Of India
A-
A
A+
ORGANISATION

Indic-Conformer model for ASR

Indo-Aryan Indic-Conformer is a multilingual speech model for North-Indian languages. This model is based on Conformer large architecture, with 115M parameters.

About Model

Bhashini - The Indo-Aryan Indic-Conformer is a multilingual automatic speech recognition (ASR) model designed specifically for North-Indian languages. It is based on the Conformer large architecture, which is known for its efficiency and accuracy in processing speech signals. The model contains 115 million parameters, enabling it to effectively transcribe spoken language into text with high precision.

This ASR model has been trained on the Shrutlip dataset, a rich dataset designed to enhance automatic speech recognition capabilities in Indian languages. The model primarily supports the Odia language and has been developed by AI4Bharat, a leading research initiative focused on advancing AI-driven solutions for Indian languages.

With a batch processing setup, this model is optimized for large-scale speech-to-text tasks across general domains. It is a valuable resource for applications in speech transcription, voice-enabled interfaces, digital accessibility, and natural language processing (NLP) research. Given the increasing demand for multilingual ASR systems, this model serves as a foundational tool for improving speech technology in India’s diverse linguistic landscape.

The Indo-Aryan Indic-Conformer is open-source, and its implementation is available on GitHub, making it accessible for researchers, developers, and AI practitioners working in the domain of Indian language speech processing.

For more details about the use of model, refer to github: https://github.com/AI4Bharat/IndicTrans2/tree/main

Indic-Conformer model for ASR

Metadata Metadata

MIT

AI4Bharat

Speech Recognition Model

Other

Open

Sector Agnostic

05/03/25 15:23:44

Admin

64.91 KB

indic-asr-api-backend-master ( 7 files, 1 directories )


Directory
serving

2 files, 7 directories

undefined
.gitignore

1.76 KB

undefined
api.py

4.95 KB

application/json
conformer.json

211 Bytes

undefined
example_ai4b_asr_rest_api.py

1.67 KB

undefined
LICENSE

1.04 KB

text/markdown
README.md

1.26 KB

text/plain
requirements.txt

66 Bytes

Activity Overview Activity Overview

  • Downloads2
  • Downloads 98
  • File Size 64.91 KB
  • Views 2,947

Tags Tags

  • Automatic Speech Recognition
  • Speech Technology
  • Speech Processing
  • Speech Lab
  • Bhashini

License Control License Control

MIT

Version Control Version Control

FolderVersion 1(64.91 KB)
  • admin·1 year(s) ago
    • chevron_rightFolder
      indic-asr-api-backend-master
      • chevron_rightFolder
        serving
      • undefined
        .gitignore
      • undefined
        api.py
      • application/json
        conformer.json
      • undefined
        example_ai4b_asr_rest_api.py
      • undefined
        LICENSE
      • text/markdown
        README.md
      • text/plain
        requirements.txt

More Models from TechCorp More Models from TechCorp

SPRING-INX-DATA2VEC-AQC-SANSKRIT
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text. The inference code, installation requirements, and usage instructions are available in the SPRING Lab, IIT Madras GitHub repository: https://github.com/Speech-Lab-IITM/Fairseq-Inference
low-resource-language
Sanskrit
ssl
IITM
spring_lab
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads5
  • File Size3.52 GB
  • Views192
Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-PUNJABI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text. The inference code, installation requirements, and usage instructions are available in the SPRING Lab, IIT Madras GitHub repository: https://github.com/Speech-Lab-IITM/Fairseq-Inference
low-resource-language
SSL_finetunning
Data2vec_aqc
PUNJABI
spring_lab
IITM
ssl
  • See Upvoters0
  • Downloads3
  • File Size3.52 GB
  • Views186
Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-ODIA
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text. The inference code, installation requirements, and usage instructions are available in the SPRING Lab, IIT Madras GitHub repository: https://github.com/Speech-Lab-IITM/Fairseq-Inference
low-resource-language
SSL_finetunning
Data2vec_aqc
spring_lab
IITM
ssl
Odia
  • See Upvoters0
  • Downloads4
  • File Size3.52 GB
  • Views155
Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-MALAYALAM
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text. The inference code, installation requirements, and usage instructions are available in the SPRING Lab, IIT Madras GitHub repository: https://github.com/Speech-Lab-IITM/Fairseq-Inference
SSL_finetunning
low-resource-language
Data2vec_aqc
spring_lab
IITM
malayalam
ssl
  • See Upvoters0
  • Downloads5
  • File Size3.52 GB
  • Views200
Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-MARATHI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text. The inference code, installation requirements, and usage instructions are available in the SPRING Lab, IIT Madras GitHub repository: https://github.com/Speech-Lab-IITM/Fairseq-Inference
SSL_finetunning
Marathi
ssl
IITM
spring_lab
Data2vec_aqc
low-resource-language
  • See Upvoters0
  • Downloads6
  • File Size3.52 GB
  • Views148
Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-KANNADA
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text. The inference code, installation requirements, and usage instructions are available in the SPRING Lab, IIT Madras GitHub repository: https://github.com/Speech-Lab-IITM/Fairseq-Inference
SSL_finetunning
Data2vec_aqc
kannada
spring_lab
IITM
ssl
low-resource-language
  • See Upvoters0
  • Downloads5
  • File Size3.52 GB
  • Views129
Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BHOJPURI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text. The inference code, installation requirements, and usage instructions are available in the SPRING Lab, IIT Madras GitHub repository: https://github.com/Speech-Lab-IITM/Fairseq-Inference
SSL_finetunning
Bhojpuri
Data2vec_aqc
spring_lab
IITM
ssl
low-resource-language
  • See Upvoters0
  • Downloads7
  • File Size3.52 GB
  • Views184
Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BODO
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text. The inference code, installation requirements, and usage instructions are available in the SPRING Lab, IIT Madras GitHub repository: https://github.com/Speech-Lab-IITM/Fairseq-Inference
IITM
BODO
low-resource-language
SSL_finetunning
Data2vec_aqc
spring_lab
ssl
  • See Upvoters0
  • Downloads3
  • File Size3.52 GB
  • Views193
Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BENGALI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text. The inference code, installation requirements, and usage instructions are available in the SPRING Lab, IIT Madras GitHub repository: https://github.com/Speech-Lab-IITM/Fairseq-Inference
SSL_finetunning
ssl
bengali
IITM
low-resource-languages
spring_lab
Data2vec_aqc
  • See Upvoters0
  • Downloads5
  • File Size3.52 GB
  • Views168
Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-TAMIL
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text. The inference code, installation requirements, and usage instructions are available in the SPRING Lab, IIT Madras GitHub repository: https://github.com/Speech-Lab-IITM/Fairseq-Inference
SSL_finetunning
ssl
tamil
IITM
spring_lab
Data2vec_aqc
low-resource-language
  • See Upvoters0
  • Downloads4
  • File Size3.52 GB
  • Views123
Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION