Indian Flag
Government Of India
A-
A
A+

AI4Bharat- IndicSeamless

IndicSeamless is a multilingual, sequence-to-sequence pre-trained model based on SeamlessM4T-v2 and fine-tuned on the BhasaAnuvaad dataset.

About Model

IndicSeamless is a multilingual, sequence-to-sequence pre-trained modethat leverages Meta’s state-of-the-art SeamlessM4T-v2 architecture and is fine-tuned on AI4Bharat’s massive BhasaAnuvaad corpus to deliver high-quality STT across 13 Indian languages and English. It preserves SeamlessM4T-v2’s unified handling of multiple modalities and languages while specializing performance on Indic speech data.

AI4Bharat- IndicSeamless

Metadata Metadata

Creative Commons Attribution Non Commercial 4.0

Sparsh Jain and Ashwin Sankar and Devilal Choudhary and Dhairya Suman and Nikhil Narasimhan and Mohammed Safi Ur Rahman Khan and Anoop Kunchukuttan and Mitesh M Khapra and Raj Dabre

Automatic Speech Recognition

N.A.

Open

AI4Bharat

Sector Agnostic

02/05/25 11:01:11

0

Activity Overview Activity Overview

  • Downloads0
  • Redirect 31
  • File Size 0
  • Views 173

Tags Tags

  • Transformers
  • Automatic Speech Recognition
  • seamless_m4t_v2

License Control License Control

Creative Commons Attribution Non Commercial 4.0

More Models from AI4Bharat More Models from AI4Bharat

AI4Bharat- 500 M - RomanSetu Multilingual Native-to-Roman Model
RomanSetu is a multilingual continual pretrained transformer model designed for transliteration across six Indic languages
Llama
Instruction-Tuning
Multilingual
LLaMA2
  • See Upvoters1
  • Downloads43
  • File Size0
  • Views767
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat- 400 M - RomanSetu Multilingual Native-to-Roman Model
RomanSetu is a multilingual continual pretrained transformer model designed for transliteration across six Indic languages
Multilingual
LLaMA2
Llama
Instruction-Tuning
  • See Upvoters1
  • Downloads75
  • File Size0
  • Views912
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat- Maithili - IndicConformer Automatic Speech Recognition (ASR) Model
This model takes in mono-channel audio files at a 16,000 Hz sampling rate (WAV format) and outputs the transcribed text of the speech contained in the audio.
Automatic Speech Recognition
Speech-to-Text
NLP
  • See Upvoters0
  • Downloads24
  • File Size0
  • Views612
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat- Konkani - IndicConformer Automatic Speech Recognition (ASR) Model
Automatic Speech Recognition (ASR) model for Konkani speech recognition, processing 16,000 KHz mono WAV audio and transcribing spoken content into text
Speech-to-Text
NLP
Automatic Speech Recognition
  • See Upvoters0
  • Downloads29
  • File Size0
  • Views617
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat- Kashmiri - IndicConformer Automatic Speech Recognition (ASR) Model
This Automatic Speech Recognition (ASR) model transcribes Kashmiri speech from 16,000 KHz mono WAV audio files into text
NLP
Speech-to-Text
Kashmiri
Automatic Speech Recognition
  • See Upvoters0
  • Downloads19
  • File Size0
  • Views621
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat - Romansetu-200M -Multilingual LLM for Indian langauges using romanization
RomanSetu is Efficiently unlocking multilingual (Indian Languages) capabilities of Large Language Models via Romanization.
Llama
Instruction-Tuning
Multilingual
LLaMA2
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views245
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat - Romansetu-100M - Multilingual LLM for Indian langauges using romanization
RomanSetu is Efficiently unlocking multilingual (Indian Languages) capabilities of Large Language Models via Romanization.
Multilingual
LLaMA2
Llama
Instruction-Tuning
  • See Upvoters0
  • Downloads9
  • File Size0
  • Views444
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat- Kannada - IndicConformer Automatic Speech Recognition (ASR) Model
This Kannada Automatic Speech Recognition (ASR) model transcribes 16kHz mono-channel audio into text. It utilizes a Conformer-Large architecture with 120M parameters and a hybrid CTC-RNNT decoder for high-accuracy speech recognition.
Automatic Speech Recognition
Audio Processing
NLP
  • See Upvoters0
  • Downloads22
  • File Size0
  • Views632
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat – Romanized Path – Base to Supervised Fine-Tuning (SFT)
Romansetu model is built on base pretrained model which is supervised fine tuned on instuction-following tasks using romanized Indian languages.
Instruction-Tuning
Llama
LLaMA2
Multilingual
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views186
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat-IndicTrans2 Large-1B -English-to-Hindi (Devanagari) – : Language Translation Model
A large-scale neural machine translation (NMT) model for translating English to Hindi (Devanagari) language, leveraging 1 billion parameters for high-quality translations.
Multilingual
Transformer
cross-lingual
high-quality-translation
Large Model
low-resource-NLP
NLP
Machine Translation
  • See Upvoters0
  • Downloads31
  • File Size0
  • Views754
Updated 10 month(s) ago

AI4BHARAT