Indian Flag
Government Of India
A-
A
A+

AI4Bharat-IndicTrans2 -Large-1B -Hindi (Devanagari)-to-English: Language Translation Model

A large-scale neural machine translation (NMT) model for translating Hindi (Devanagari) to English, utilizing 1 billion parameters for high-quality and context-aware translations.

About Model

The IndicTrans2 Indic-to-English (1B) model by AI4Bharat is an advanced transformer-based neural machine translation (NMT) model designed to translate text from Hindi (Devanagari) to English. With 1 billion parameters, this model ensures high fluency, contextual accuracy, and improved handling of low-resource languages. It builds upon previous IndicTrans versions with enhanced multilingual training, improved tokenization, and better handling of linguistic diversity. This model is useful for content translation, academic research, multilingual applications, and localization efforts where accurate Indic-to-English translation is required.

AI4Bharat-IndicTrans2 -Large-1B -Hindi (Devanagari)-to-English: Language Translation Model

Metadata Metadata

MIT

Jay Gala and Pranjal A Chitale and A K Raghavan and Varun Gumma and Sumanth Doddapaneni and Aswanth Kumar M and Janki Atul Nawale and Anupama Sujatha and Ratish Puduppully and Vivek Raghavan and Pratyush Kumar and Mitesh M Khapra and Raj Dabre and Anoop Kunchukuttan

text2text-translation

N.A.

Open

AI4Bharat

Sector Agnostic

21/02/25 13:21:22

0

Activity Overview Activity Overview

  • Downloads0
  • Redirect 18
  • Views 541
  • File Size 0

Tags Tags

  • Machine Translation
  • Hindi-to-English
  • Multilingual
  • NLP
  • Transformer
  • Large Model
  • high-quality-translation
  • cross-lingual
  • low-resource-NLP

License Control License Control

MIT

More Models from AI4Bharat More Models from AI4Bharat

AI4Bharat- 500 M - RomanSetu Multilingual Native-to-Roman Model
RomanSetu is a multilingual continual pretrained transformer model designed for transliteration across six Indic languages
Instruction-Tuning
LLaMA2
Multilingual
Llama
  • See Upvoters1
  • Downloads39
  • File Size0
  • Views640
Updated 8 month(s) ago

AI4BHARAT

AI4Bharat- 400 M - RomanSetu Multilingual Native-to-Roman Model
RomanSetu is a multilingual continual pretrained transformer model designed for transliteration across six Indic languages
Multilingual
Llama
Instruction-Tuning
LLaMA2
  • See Upvoters1
  • Downloads68
  • File Size0
  • Views788
Updated 8 month(s) ago

AI4BHARAT

AI4Bharat- Maithili - IndicConformer Automatic Speech Recognition (ASR) Model
This model takes in mono-channel audio files at a 16,000 Hz sampling rate (WAV format) and outputs the transcribed text of the speech contained in the audio.
Automatic Speech Recognition
Speech-to-Text
NLP
  • See Upvoters0
  • Downloads21
  • File Size0
  • Views486
Updated 8 month(s) ago

AI4BHARAT

AI4Bharat- Konkani - IndicConformer Automatic Speech Recognition (ASR) Model
Automatic Speech Recognition (ASR) model for Konkani speech recognition, processing 16,000 KHz mono WAV audio and transcribing spoken content into text
Speech-to-Text
NLP
Automatic Speech Recognition
  • See Upvoters0
  • Downloads24
  • File Size0
  • Views519
Updated 8 month(s) ago

AI4BHARAT

AI4Bharat- Kashmiri - IndicConformer Automatic Speech Recognition (ASR) Model
This Automatic Speech Recognition (ASR) model transcribes Kashmiri speech from 16,000 KHz mono WAV audio files into text
Kashmiri
Speech-to-Text
NLP
Automatic Speech Recognition
  • See Upvoters0
  • Downloads17
  • File Size0
  • Views514
Updated 8 month(s) ago

AI4BHARAT

AI4Bharat - Romansetu-200M -Multilingual LLM for Indian langauges using romanization
RomanSetu is Efficiently unlocking multilingual (Indian Languages) capabilities of Large Language Models via Romanization.
Instruction-Tuning
LLaMA2
Llama
Multilingual
  • See Upvoters0
  • Downloads5
  • File Size0
  • Views200
Updated 8 month(s) ago

AI4BHARAT

AI4Bharat - Romansetu-100M - Multilingual LLM for Indian langauges using romanization
RomanSetu is Efficiently unlocking multilingual (Indian Languages) capabilities of Large Language Models via Romanization.
Llama
Multilingual
Instruction-Tuning
LLaMA2
  • See Upvoters0
  • Downloads8
  • File Size0
  • Views324
Updated 8 month(s) ago

AI4BHARAT

AI4Bharat- Kannada - IndicConformer Automatic Speech Recognition (ASR) Model
This Kannada Automatic Speech Recognition (ASR) model transcribes 16kHz mono-channel audio into text. It utilizes a Conformer-Large architecture with 120M parameters and a hybrid CTC-RNNT decoder for high-accuracy speech recognition.
Automatic Speech Recognition
Audio Processing
NLP
  • See Upvoters0
  • Downloads18
  • File Size0
  • Views487
Updated 8 month(s) ago

AI4BHARAT

AI4Bharat – Romanized Path – Base to Supervised Fine-Tuning (SFT)
Romansetu model is built on base pretrained model which is supervised fine tuned on instuction-following tasks using romanized Indian languages.
LLaMA2
Instruction-Tuning
Multilingual
Llama
  • See Upvoters0
  • Downloads3
  • File Size0
  • Views149
Updated 8 month(s) ago

AI4BHARAT

AI4Bharat-IndicTrans2 Large-1B -English-to-Hindi (Devanagari) – : Language Translation Model
A large-scale neural machine translation (NMT) model for translating English to Hindi (Devanagari) language, leveraging 1 billion parameters for high-quality translations.
Machine Translation
Transformer
low-resource-NLP
high-quality-translation
Large Model
cross-lingual
NLP
Multilingual
  • See Upvoters0
  • Downloads27
  • File Size0
  • Views591
Updated 8 month(s) ago

AI4BHARAT