Indian Flag
Government Of India
A-
A
A+

SPRING LAB GUJARATI-STREAMING

Automatic Speech Recognition (ASR) model for Gujarati speech recognition, processing audio and transcribing spoken content into text.

About Model

Automatic Speech Recognition (ASR) model for Gujarati speech recognition, developed using the Icefall toolkit with the Zipformer architecture. The model is trained on the Gujarati dataset consisting of approximately 300 hours of labelled speech. It is trained on 16 kHz audio, including naturally occurring code-mixed speech, enabling robust recognition of bilingual Indian speech patterns. The system is based on a 65M-parameter Zipformer-Medium encoder, paired with an RNN-T prediction network and joiner, forming a low-latency streaming ASR model with 16 encoder layers and a 512-dimensional representation.

SPRING LAB GUJARATI-STREAMING

Metadata Metadata

Attribution 4.0 International (CC BY- 4.0)

SPRING LAB IITM

Speech -to-text Conversion

Other

Open

Science, Technology and Research

16/12/25 06:40:49

Gokulapriya

258.07 MB

Activity Overview Activity Overview

  • Downloads0
  • Downloads 11
  • Views 176
  • File Size 258.07 MB

Tags Tags

  • ASR
  • IITM
  • spring_lab
  • streaming
  • MODELS
  • zipformer
  • Icefall-K2
  • Pytorch

License Control License Control

Attribution 4.0 International (CC BY- 4.0)

Version Control Version Control

FolderVersion 1(258.07 MB)
  • admin·2 month(s) ago
    • undefined
      jit_script_chunk_32_left_128.pt
    • text/plain
      tokens.txt

More Models from Digital India BHASHINI Division More Models from Digital India BHASHINI Division

SPRING-INX-DATA2VEC-AQC-URDU
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
urdu
IITM
spring_lab
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views22
Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-TELUGU
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
telugu
IITM
spring_lab
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views11
Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-TAMIL
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
SSL_finetunning
Data2vec_aqc
spring_lab
IITM
tamil
ssl
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views10
Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BENGALI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
bengali
SSL_finetunning
low-resource-languages
ssl
Data2vec_aqc
spring_lab
IITM
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views57
Updated 28 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BODO
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
Data2vec_aqc
ssl
IITM
spring_lab
SSL_finetunning
low-resource-language
BODO
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views70
Updated 28 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BHOJPURI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
Bhojpuri
low-resource-language
SSL_finetunning
Data2vec_aqc
spring_lab
IITM
ssl
  • See Upvoters0
  • Downloads2
  • File Size3.52 GB
  • Views65
Updated 28 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-MALAYALAM
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
malayalam
IITM
spring_lab
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views49
Updated 28 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-KANNADA
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
IITM
spring_lab
kannada
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views52
Updated 28 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-MARATHI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
SSL_finetunning
Data2vec_aqc
spring_lab
IITM
ssl
Marathi
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views58
Updated 28 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-SANSKRIT
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
SSL_finetunning
Sanskrit
ssl
IITM
spring_lab
Data2vec_aqc
low-resource-language
  • See Upvoters0
  • Downloads2
  • File Size3.52 GB
  • Views56
Updated 28 day(s) ago

DIGITAL INDIA BHASHINI DIVISION