Indian Flag
Government Of India
A-
A
A+

SPRING-INX-DATA2VEC-AQC-MARATHI

Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.

About Model

Data2vec-aqc is a self-supervised learning (SSL) model for speech representation learning, specifically designed to improve Automatic Speech Recognition (ASR) in low-resource settings. It is built using the Fairseq toolkit and extends the original data2vec framework by introducing three key modules: a quantizer (similar to wav2vec 2.0), a clustering module (from ccc-wav2vec 2.0), and a cross-contrastive loss mechanism. The model uses a Transformer-based architecture, consistent with data2vec, and operates in a teacher-student training setup. The student network processes randomly augmented versions of audio samples, while the teacher network (an exponentially moving average of the student) provides target representations. The student learns to predict the teacher’s contextualised latent representations, enabling robust feature learning.

SPRING-INX-DATA2VEC-AQC-MARATHI

Metadata Metadata

Attribution 4.0 International (CC BY- 4.0)

SPRING LAB IITM

Fine-Tuned Model

PyTorch

Open

Science, Technology and Research

30/01/26 06:08:55

Gokulapriya

3.52 GB

Activity Overview Activity Overview

  • Downloads0
  • Downloads 1
  • Views 42
  • File Size 3.52 GB

Tags Tags

  • Marathi
  • ssl
  • IITM
  • spring_lab
  • Data2vec_aqc
  • SSL_finetunning
  • low-resource-language

License Control License Control

Attribution 4.0 International (CC BY- 4.0)

Version Control Version Control

FolderVersion 1(3.52 GB)
  • admin·10 day(s) ago
    • undefined
      SPRING_INX_data2vec_aqc_Marathi.pt
    • text/plain
      SPRING_INX_Marathi_dict.txt

More Models from Digital India BHASHINI Division More Models from Digital India BHASHINI Division

SPRING-INX-DATA2VEC-AQC-BENGALI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
IITM
spring_lab
Data2vec_aqc
ssl
low-resource-languages
SSL_finetunning
bengali
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views38
Updated 10 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BODO
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
Data2vec_aqc
ssl
IITM
spring_lab
SSL_finetunning
low-resource-language
BODO
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views45
Updated 10 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BHOJPURI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
SSL_finetunning
Bhojpuri
ssl
IITM
spring_lab
Data2vec_aqc
low-resource-language
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views27
Updated 10 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-MALAYALAM
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
malayalam
IITM
spring_lab
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views22
Updated 10 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-KANNADA
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
IITM
low-resource-language
SSL_finetunning
Data2vec_aqc
kannada
spring_lab
ssl
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views35
Updated 10 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-MARATHI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
Marathi
low-resource-language
SSL_finetunning
Data2vec_aqc
spring_lab
IITM
ssl
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views43
Updated 10 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-SANSKRIT
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
IITM
spring_lab
Sanskrit
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views32
Updated 10 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-PUNJABI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
IITM
spring_lab
PUNJABI
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views27
Updated 10 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-ODIA
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
spring_lab
Odia
ssl
IITM
Data2vec_aqc
SSL_finetunning
low-resource-language
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views30
Updated 10 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING LAB TAMIL-STREAMING
Automatic Speech Recognition (ASR) model for Tamil speech recognition, processing audio and transcribing spoken content into text.
Icefall-K2
ASR
tamil
IITM
spring_lab
streaming
MODELS
zipformer
  • See Upvoters0
  • Downloads7
  • File Size260.42 MB
  • Views128
Updated 25 day(s) ago

DIGITAL INDIA BHASHINI DIVISION