Indian Flag
Government Of India
A-
A
A+

SPRING-INX-DATA2VEC-AQC-TELUGU

Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.

About Model

Data2vec-aqc is a self-supervised learning (SSL) model for speech representation learning, specifically designed to improve Automatic Speech Recognition (ASR) in low-resource settings. It is built using the Fairseq toolkit and extends the original data2vec framework by introducing three key modules: a quantizer (similar to wav2vec 2.0), a clustering module (from ccc-wav2vec 2.0), and a cross-contrastive loss mechanism. The model uses a Transformer-based architecture, consistent with data2vec, and operates in a teacher-student training setup. The student network processes randomly augmented versions of audio samples, while the teacher network (an exponentially moving average of the student) provides target representations. The student learns to predict the teacher’s contextualised latent representations, enabling robust feature learning.

SPRING-INX-DATA2VEC-AQC-TELUGU

Metadata Metadata

Attribution 4.0 International (CC BY- 4.0)

SPRING LAB IITM

Fine-Tuned Model

PyTorch

Open

Science, Technology and Research

03/02/26 05:18:00

Gokulapriya

3.52 GB

Activity Overview Activity Overview

  • Downloads0
  • Downloads 0
  • Views 6
  • File Size 3.52 GB

Tags Tags

  • ssl
  • telugu
  • IITM
  • spring_lab
  • Data2vec_aqc
  • SSL_finetunning
  • low-resource-language

License Control License Control

Attribution 4.0 International (CC BY- 4.0)

Version Control Version Control

FolderVersion 1(3.52 GB)
  • admin·21 day(s) ago
    • undefined
      SPRING_INX_data2vec_aqc_Telugu.pt
    • text/plain
      SPRING_INX_Telugu_dict.txt

More Models from Digital India BHASHINI Division More Models from Digital India BHASHINI Division

SPRING-INX-DATA2VEC-AQC-URDU
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
urdu
IITM
spring_lab
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views14
Updated 1 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-TELUGU
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
telugu
IITM
spring_lab
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views7
Updated 1 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-TAMIL
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
SSL_finetunning
Data2vec_aqc
spring_lab
IITM
tamil
ssl
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views7
Updated 1 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BENGALI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
bengali
SSL_finetunning
low-resource-languages
ssl
Data2vec_aqc
spring_lab
IITM
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views57
Updated 25 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BODO
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
Data2vec_aqc
ssl
IITM
spring_lab
SSL_finetunning
low-resource-language
BODO
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views64
Updated 25 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BHOJPURI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
Bhojpuri
low-resource-language
SSL_finetunning
Data2vec_aqc
spring_lab
IITM
ssl
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views59
Updated 25 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-MALAYALAM
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
malayalam
IITM
spring_lab
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views44
Updated 25 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-KANNADA
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
IITM
spring_lab
kannada
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views50
Updated 25 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-MARATHI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
SSL_finetunning
Data2vec_aqc
spring_lab
IITM
ssl
Marathi
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views58
Updated 25 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-SANSKRIT
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
SSL_finetunning
Sanskrit
ssl
IITM
spring_lab
Data2vec_aqc
low-resource-language
  • See Upvoters0
  • Downloads2
  • File Size3.52 GB
  • Views55
Updated 25 day(s) ago

DIGITAL INDIA BHASHINI DIVISION