Home/Models/SPRING-INX-DATA2VEC-AQC-MARATHI

ORGANISATION

SPRING-INX-DATA2VEC-AQC-MARATHI

Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.

Digital India BHASHINI Division
Gpriya

About Model

Data2vec-aqc is a self-supervised learning (SSL) model for speech representation learning, specifically designed to improve Automatic Speech Recognition (ASR) in low-resource settings. It is built using the Fairseq toolkit and extends the original data2vec framework by introducing three key modules: a quantizer (similar to wav2vec 2.0), a clustering module (from ccc-wav2vec 2.0), and a cross-contrastive loss mechanism. The model uses a Transformer-based architecture, consistent with data2vec, and operates in a teacher-student training setup. The student network processes randomly augmented versions of audio samples, while the teacher network (an exponentially moving average of the student) provides target representations. The student learns to predict the teacher’s contextualised latent representations, enabling robust feature learning.

SPRING-INX-DATA2VEC-AQC-MARATHI

Metadata

License

Attribution 4.0 International (CC BY- 4.0)

Hosted By

SPRING LAB IITM

Model Type

Fine-Tuned Model

Model Format

PyTorch

Visibility

Open

Source Organisation

Digital India BHASHINI Division

Sector

Science, Technology and Research

Updated Date & Time

30/01/26 06:08:55

Created By

Gokulapriya

Size

3.52 GB

SPRING_INX_data2vec_aqc_Marathi.pt ( 3.52 GB )

To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Activity Overview

0
6
3.52 GB
136

License Control

Attribution 4.0 International (CC BY- 4.0)

Version Control

Version 1(3.52 GB)

admin·4 month(s) ago
- SPRING_INX_data2vec_aqc_Marathi.pt
- SPRING_INX_Marathi_dict.txt

More Models from Digital India BHASHINI Division

SANTHAM-Gemma3-4B-SH-Seg-Poetry-Finetuned

SANTHAM-Gemma3-4B-SH-Seg-Poetry-Finetuned is a model designed to translate Sanskrit into Tamil specialized on Segmented text obtained using Sanskrit Heritage segmenter.

translation

poetry

santham

Segmened

language:tam

language:san

0
17
115.62 MB
119

Updated 3 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

SANTHAM-Gemma3-4B-Finetuned

SANTHAM-Gemma3-4B-Finetuned is a Sanskrit → Tamil translation model built on the Gemma 3 (4B) architecture. It is trained on a parallel corpus developed as part of the Sanskrit Knowledge Accessor project, enabling it to capture linguistic nuances and generate fluent Tamil translations from classical Sanskrit inputs.

translation

language:san

language:tam

santham

0
13
2.08 GB
197

Updated 3 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

SANTHAM-Gemma3-4B-Anvaya-Poetry-Finetuned

SANTHAM-Gemma3-4B-Anvaya-Potery-Finetuned is a model designed to translate Sanskrit into Tamil specialized on Anvaya translation in Poetry.

poetry

santham

anvaya

language:tam

language:san

translation

0
13
2.09 GB
138

Updated 3 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

SPRING-INX-DATA2VEC-AQC-URDU

Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.

spring_lab

Data2vec_aqc

low-resource-language

SSL_finetunning

ssl

urdu

IITM

0
3
3.52 GB
126

Updated 4 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

SPRING-INX-DATA2VEC-AQC-TELUGU

Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.

spring_lab

low-resource-language

SSL_finetunning

Data2vec_aqc

IITM

telugu

ssl

0
3
3.52 GB
113

Updated 4 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

SPRING-INX-DATA2VEC-AQC-TAMIL

Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.

low-resource-language

SSL_finetunning

Data2vec_aqc

spring_lab

IITM

tamil

ssl

0
4
3.52 GB
112

Updated 4 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

SPRING-INX-DATA2VEC-AQC-BENGALI

Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.

IITM

ssl

bengali

low-resource-languages

spring_lab

Data2vec_aqc

SSL_finetunning

0
5
3.52 GB
156

Updated 4 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

SPRING-INX-DATA2VEC-AQC-BODO

Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.

IITM

spring_lab

Data2vec_aqc

SSL_finetunning

low-resource-language

BODO

ssl

0
3
3.52 GB
178

Updated 4 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

SPRING-INX-DATA2VEC-AQC-BHOJPURI

Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.

SSL_finetunning

Data2vec_aqc

spring_lab

IITM

ssl

Bhojpuri

low-resource-language

0
7
3.52 GB
172

Updated 4 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

SPRING-INX-DATA2VEC-AQC-MALAYALAM

Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.

SSL_finetunning

ssl

malayalam

IITM

spring_lab

Data2vec_aqc

low-resource-language

0
5
3.52 GB
186

Updated 4 month(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

Accessibility options by UX4G

SPRING-INX-DATA2VEC-AQC-MARATHI

About Model

SPRING-INX-DATA2VEC-AQC-MARATHI

Metadata

SPRING_INX_data2vec_aqc_Marathi.pt ( 3.52 GB )

Activity Overview

Tags

License Control

Version Control

Version 1(3.52 GB)

SPRING_INX_data2vec_aqc_Marathi.pt

SPRING_INX_Marathi_dict.txt

More Models from Digital India BHASHINI Division

AIKosh

Resources

Support