Indian Flag
Government Of India
A-
A
A+

AI4Bharat-IndicTrans2 Distilled-200M -English-to-Hindi (Devanagari): Language Translation Model

A lightweight transformer-based neural machine translation (NMT) model designed for translating English to Hindi (Devanagari), optimized for efficiency with a 200M parameter size.

About Model

The IndicTrans2 Indic-to-English (200M) model by AI4Bharat is a distilled version of the IndicTrans2 series, designed for high-quality translation from multiple Indic languages into English. This model balances translation accuracy with computational efficiency, making it suitable for real-time applications on resource-constrained devices. IndicTrans2 builds on the strengths of its predecessor, IndicTrans, incorporating improvements in pretraining, tokenization, and model architecture. The 200M parameter version is a compact yet powerful option, optimized to deliver robust multilingual translations while reducing inference latency. It supports a wide range of Indic languages, making it valuable for research, education, and multilingual communication.

AI4Bharat-IndicTrans2 Distilled-200M -English-to-Hindi (Devanagari): Language Translation Model

Metadata Metadata

MIT

Jay Gala and Pranjal A Chitale and A K Raghavan and Varun Gumma and Sumanth Doddapaneni and Aswanth Kumar M and Janki Atul Nawale and Anupama Sujatha and Ratish Puduppully and Vivek Raghavan and Pratyush Kumar and Mitesh M Khapra and Raj Dabre and Anoop Kunchukuttan

text2text-translation

N.A.

Open

AI4Bharat

Sector Agnostic

21/02/25 13:21:18

0

Activity Overview Activity Overview

  • Downloads0
  • Redirect 17
  • File Size 0
  • Views 482

Tags Tags

  • Transformers
  • Translation
  • Multilingual
  • Hindi
  • English
  • Indic-Trans

License Control License Control

MIT

More Models from AI4Bharat More Models from AI4Bharat

AI4Bharat- 500 M - RomanSetu Multilingual Native-to-Roman Model
RomanSetu is a multilingual continual pretrained transformer model designed for transliteration across six Indic languages
Llama
Instruction-Tuning
Multilingual
LLaMA2
  • See Upvoters1
  • Downloads43
  • File Size0
  • Views767
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat- 400 M - RomanSetu Multilingual Native-to-Roman Model
RomanSetu is a multilingual continual pretrained transformer model designed for transliteration across six Indic languages
Multilingual
LLaMA2
Llama
Instruction-Tuning
  • See Upvoters1
  • Downloads75
  • File Size0
  • Views913
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat- Maithili - IndicConformer Automatic Speech Recognition (ASR) Model
This model takes in mono-channel audio files at a 16,000 Hz sampling rate (WAV format) and outputs the transcribed text of the speech contained in the audio.
Automatic Speech Recognition
Speech-to-Text
NLP
  • See Upvoters0
  • Downloads24
  • File Size0
  • Views612
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat- Konkani - IndicConformer Automatic Speech Recognition (ASR) Model
Automatic Speech Recognition (ASR) model for Konkani speech recognition, processing 16,000 KHz mono WAV audio and transcribing spoken content into text
Speech-to-Text
NLP
Automatic Speech Recognition
  • See Upvoters0
  • Downloads29
  • File Size0
  • Views617
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat- Kashmiri - IndicConformer Automatic Speech Recognition (ASR) Model
This Automatic Speech Recognition (ASR) model transcribes Kashmiri speech from 16,000 KHz mono WAV audio files into text
NLP
Speech-to-Text
Kashmiri
Automatic Speech Recognition
  • See Upvoters0
  • Downloads19
  • File Size0
  • Views621
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat - Romansetu-200M -Multilingual LLM for Indian langauges using romanization
RomanSetu is Efficiently unlocking multilingual (Indian Languages) capabilities of Large Language Models via Romanization.
Llama
Instruction-Tuning
Multilingual
LLaMA2
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views246
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat - Romansetu-100M - Multilingual LLM for Indian langauges using romanization
RomanSetu is Efficiently unlocking multilingual (Indian Languages) capabilities of Large Language Models via Romanization.
Multilingual
LLaMA2
Llama
Instruction-Tuning
  • See Upvoters0
  • Downloads9
  • File Size0
  • Views444
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat- Kannada - IndicConformer Automatic Speech Recognition (ASR) Model
This Kannada Automatic Speech Recognition (ASR) model transcribes 16kHz mono-channel audio into text. It utilizes a Conformer-Large architecture with 120M parameters and a hybrid CTC-RNNT decoder for high-accuracy speech recognition.
Automatic Speech Recognition
Audio Processing
NLP
  • See Upvoters0
  • Downloads22
  • File Size0
  • Views632
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat – Romanized Path – Base to Supervised Fine-Tuning (SFT)
Romansetu model is built on base pretrained model which is supervised fine tuned on instuction-following tasks using romanized Indian languages.
Instruction-Tuning
Llama
LLaMA2
Multilingual
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views186
Updated 10 month(s) ago

AI4BHARAT

AI4Bharat-IndicTrans2 Large-1B -English-to-Hindi (Devanagari) – : Language Translation Model
A large-scale neural machine translation (NMT) model for translating English to Hindi (Devanagari) language, leveraging 1 billion parameters for high-quality translations.
Multilingual
Transformer
cross-lingual
high-quality-translation
Large Model
low-resource-NLP
NLP
Machine Translation
  • See Upvoters0
  • Downloads31
  • File Size0
  • Views757
Updated 10 month(s) ago

AI4BHARAT