Indian Flag
Government Of India
A-
A
A+

SPRING LAB ODIA-STREAMING

Automatic Speech Recognition (ASR) model for Odia speech recognition, processing audio and transcribing spoken content into text.

About Model

Automatic Speech Recognition (ASR) model for Odia speech recognition, developed using the Icefall toolkit with the Zipformer architecture. The model is trained on a dataset consisting of approximately 100 hours of labelled speech. It is trained on 16 kHz audio, including naturally occurring code-mixed speech, enabling robust recognition of bilingual Indian speech patterns. The system is based on a 65M-parameter Zipformer-Medium encoder, paired with an RNN-T prediction network and joiner, forming a low-latency streaming ASR model with 16 encoder layers and a 512-dimensional representation.

SPRING LAB ODIA-STREAMING

Metadata Metadata

Attribution 4.0 International (CC BY- 4.0)

SPRING LAB IITM

Speech -to-text Conversion

PyTorch

Open

Science, Technology and Research

12/12/25 07:32:04

Gokulapriya

253.36 MB

Activity Overview Activity Overview

  • Downloads0
  • Downloads 3
  • Views 133
  • File Size 253.36 MB

Tags Tags

  • ASR
  • IITM
  • spring_lab
  • streaming
  • MODELS
  • Odia
  • Icefall-K2
  • zipformer

License Control License Control

Attribution 4.0 International (CC BY- 4.0)

Version Control Version Control

FolderVersion 1(253.36 MB)
  • admin·4 month(s) ago
    • undefined
      jit_script_chunk_32_left_128.pt
    • text/plain
      tokens.txt

More Models from Digital India BHASHINI Division More Models from Digital India BHASHINI Division

SANTHAM-Gemma3-4B-SH-Seg-Poetry-Finetuned
SANTHAM-Gemma3-4B-SH-Seg-Poetry-Finetuned is a model designed to translate Sanskrit into Tamil specialized on Segmented text obtained using Sanskrit Heritage segmenter.
translation
poetry
santham
Segmened
language:tam
language:san
  • See Upvoters0
  • Downloads0
  • File Size115.62 MB
  • Views55
Updated 25 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SANTHAM-Gemma3-4B-Finetuned
SANTHAM-Gemma3-4B-Finetuned is a Sanskrit → Tamil translation model built on the Gemma 3 (4B) architecture. It is trained on a parallel corpus developed as part of the Sanskrit Knowledge Accessor project, enabling it to capture linguistic nuances and generate fluent Tamil translations from classical Sanskrit inputs.
translation
language:san
language:tam
santham
  • See Upvoters0
  • Downloads2
  • File Size2.08 GB
  • Views81
Updated 25 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SANTHAM-Gemma3-4B-Anvaya-Poetry-Finetuned
SANTHAM-Gemma3-4B-Anvaya-Potery-Finetuned is a model designed to translate Sanskrit into Tamil specialized on Anvaya translation in Poetry.
poetry
santham
anvaya
language:tam
language:san
translation
  • See Upvoters0
  • Downloads2
  • File Size2.09 GB
  • Views48
Updated 25 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-URDU
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
spring_lab
Data2vec_aqc
low-resource-language
SSL_finetunning
ssl
urdu
IITM
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views71
Updated 1 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-TELUGU
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
spring_lab
low-resource-language
SSL_finetunning
Data2vec_aqc
IITM
telugu
ssl
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views60
Updated 1 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-TAMIL
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
SSL_finetunning
Data2vec_aqc
spring_lab
IITM
tamil
ssl
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views58
Updated 1 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BENGALI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
Data2vec_aqc
IITM
spring_lab
ssl
low-resource-languages
SSL_finetunning
bengali
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views98
Updated 2 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BODO
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
IITM
spring_lab
SSL_finetunning
low-resource-language
BODO
Data2vec_aqc
ssl
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views124
Updated 2 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BHOJPURI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
SSL_finetunning
Data2vec_aqc
spring_lab
IITM
ssl
Bhojpuri
low-resource-language
  • See Upvoters0
  • Downloads4
  • File Size3.52 GB
  • Views115
Updated 2 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-MALAYALAM
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
SSL_finetunning
ssl
malayalam
IITM
spring_lab
Data2vec_aqc
low-resource-language
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views124
Updated 2 month(s) ago

DIGITAL INDIA BHASHINI DIVISION