Home/Models/SPRING-INX-DATA2VEC-AQC-TELUGU

ORGANISATION

SPRING-INX-DATA2VEC-AQC-TELUGU

Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text. The inference code, installation requirements, and usage instructions are available in the SPRING Lab, IIT Madras GitHub repository: https://github.com/Speech-Lab-IITM/Fairseq-Inference

About Model

Data2vec-aqc is a self-supervised learning (SSL) model for speech representation learning, specifically designed to improve Automatic Speech Recognition (ASR) in low-resource settings. It is built using the Fairseq toolkit and extends the original data2vec framework by introducing three key modules: a quantizer (similar to wav2vec 2.0), a clustering module (from ccc-wav2vec 2.0), and a cross-contrastive loss mechanism. The model uses a Transformer-based architecture, consistent with data2vec, and operates in a teacher-student training setup. The student network processes randomly augmented versions of audio samples, while the teacher network (an exponentially moving average of the student) provides target representations. The student learns to predict the teacher’s contextualised latent representations, enabling robust feature learning.

SPRING-INX-DATA2VEC-AQC-TELUGU

Metadata

License

Attribution 4.0 International (CC BY- 4.0)

Hosted By

SPRING LAB IITM

Task Type

Fine-Tuned Model

Model Format

PyTorch

Visibility

Open

Source Organisation

Digital India BHASHINI Division

Sector

Science, Technology and Research

Updated Date & Time

03/02/26 05:18:00

Created By

Gokulapriya

Size

3.52 GB

SPRING_INX_data2vec_aqc_Telugu.pt ( 3.52 GB )

To preview this file, you need to be a registered user. Please complete the registration process to gain access and continue viewing the content.

Activity Overview

0
3
3.52 GB
125

License Control

Attribution 4.0 International (CC BY- 4.0)

Version Control

Version 1(3.52 GB)

admin·5 month(s) ago
- SPRING_INX_data2vec_aqc_Telugu.pt
- SPRING_INX_Telugu_dict.txt

More Models from Digital India BHASHINI Division

IndicXlit

A Transformer-based multilingual transliteration model

Indian Languages

transliteration

Regional Languages

Machine Translation

Multilingual Translation

Language Modeling

NLP

0
47
3.94 MB
1,135

Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

Indic Trans2

AI4Bharat's Indic-Trans-v2 is a multilingual Transformer (~1.1BM) NMT model trained on Samanantar v2 dataset which is the largest publicly available parallel corpora collection for languages of India at the time of writing (23 March 2023). We currently release two models - Indic to English and English to Indic and support all the 22 scheduled languages of India.

Machine Translation

Computational Linguistics

Indian Languages

Indic-TransV2

NLP

Regional Languages

Machine Translation

Multilingual Translation

Bilingual Translation

Language Modeling

1
85
214.60 KB
2,206

Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

Bhashini - Fastspeech2 Model using (HS)

Text-to-speech models trained using FastPitch and HiFi-GAN vocoder, separately for each language. Supports both 'female' and 'male' voices.

Text to Speech

Multilingual

Language Detection

Transformer

Text Processing

NLP

0
95
286.72 MB
1,797

Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

Bhashini - IndicNER

IndicNER is a multilingual Named Entity Recognition model fine-tuned on 11 Indian languages to identify named entities in text

Bert

Samanantar

Pytorch

Token Classification

Transformer

NLP

Foreigners

Multilingual

NER

2
140
591.28 MB
2,676

Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

Bhashini-AI4Bharat Textual Language Detection v1.0

Detect language from provided text, Currently supports 23 languages (English, Bangla, Manipuri, Bodo, Konkani, Oriya, Nepali, Marathi, Sindhi, Sanskrit, Malayalam, Urdu, Assamese, Telugu, Dogri, Gujarati, Kashmiri, Punjabi, Santali, Maithili, Hindi, Tamil, Kannada)

Bhashini

Text Language Detection

Transformer

Deep Learning

Text Processing

NLP

AI4Bharat

Multilingual

5
270
3 MB
5,067

Updated 4 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

SPRING-INX-DATA2VEC-AQC-SANSKRIT

low-resource-language

SSL_finetunning

Data2vec_aqc

spring_lab

IITM

ssl

Sanskrit

0
5
3.52 GB
194

Updated 9 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

SPRING-INX-DATA2VEC-AQC-PUNJABI

low-resource-language

SSL_finetunning

Data2vec_aqc

PUNJABI

spring_lab

IITM

ssl

0
3
3.52 GB
187

Updated 9 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

SPRING-INX-DATA2VEC-AQC-ODIA

low-resource-language

SSL_finetunning

Data2vec_aqc

spring_lab

IITM

ssl

Odia

0
4
3.52 GB
157

Updated 9 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

SPRING-INX-DATA2VEC-AQC-MALAYALAM

low-resource-language

SSL_finetunning

Data2vec_aqc

spring_lab

IITM

malayalam

ssl

0
5
3.52 GB
203

Updated 9 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

SPRING-INX-DATA2VEC-AQC-MARATHI

low-resource-language

SSL_finetunning

Data2vec_aqc

spring_lab

IITM

ssl

Marathi

0
6
3.52 GB
150

Updated 9 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

View Details

Accessibility options by UX4G

SPRING-INX-DATA2VEC-AQC-TELUGU

About Model

SPRING-INX-DATA2VEC-AQC-TELUGU

Metadata

SPRING_INX_data2vec_aqc_Telugu.pt ( 3.52 GB )

Activity Overview

Tags

License Control

Version Control

Version 1(3.52 GB)

SPRING_INX_data2vec_aqc_Telugu.pt

SPRING_INX_Telugu_dict.txt

More Models from Digital India BHASHINI Division

AIKosh

Resources

Support