Indian Flag
Government Of India
A-
A
A+

IndicXlit

A Transformer-based multilingual transliteration model

About Model

Bhashini - IndicXlit is a Transformer-based multilingual transliteration model, trained on Aksharantar dataset which is the largest publicly available parallel transliteration corpora collection for Indic languages at the time of writing (20 May 2022). It is used to convert any roman text written in Indian language (like Hinglish) to the native Indic-script (like Devanagari for Hindi). It supports 21 Indic languages: Assamese, Bangla, Bodo, Gujarati, Hindi, Kannada, Kashmiri, Konkani, Maithili, Malayalam, Manipuri, Marathi, Nepali, Oriya, Panjabi, Sanskrit, Sindhi, Sinhala, Tamil, Telugu, Urdu.

IndicXlit

Metadata Metadata

MIT

AI4Bharat

Machine Translation Model

Other

Open

Sector Agnostic

05/03/25 15:25:05

Admin

3.94 MB

Activity Overview Activity Overview

  • Downloads0
  • Downloads 20
  • Views 619
  • File Size 3.94 MB

Tags Tags

  • Language Modeling
  • Multilingual Translation
  • Machine Translation
  • Regional Languages
  • Indian Languages
  • NLP
  • transliteration

License Control License Control

MIT

Version Control Version Control

FolderVersion 1(3.94 MB)
  • admin·1 year(s) ago
    • chevron_rightFolder
      IndicXlit-master
      • chevron_rightFolder
        ablation_study
      • chevron_rightFolder
        app
      • chevron_rightFolder
        Checker
      • chevron_rightFolder
        corpus_preprocessing
      • chevron_rightFolder
        data_mining
      • chevron_rightFolder
        Dataset_Format
      • chevron_rightFolder
        inference
      • chevron_rightFolder
        model_training_scripts
      • undefined
        .gitignore
      • undefined
        LICENSE
      • more_horiz 5 more

More Models from TechCorp More Models from TechCorp

SPRING-INX-DATA2VEC-AQC-BENGALI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
IITM
spring_lab
Data2vec_aqc
ssl
low-resource-languages
SSL_finetunning
bengali
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views55
Updated 23 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BODO
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
Data2vec_aqc
ssl
IITM
spring_lab
SSL_finetunning
low-resource-language
BODO
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views62
Updated 23 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-BHOJPURI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
SSL_finetunning
Bhojpuri
ssl
IITM
spring_lab
Data2vec_aqc
low-resource-language
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views57
Updated 23 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-MALAYALAM
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
malayalam
IITM
spring_lab
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views44
Updated 23 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-KANNADA
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
IITM
low-resource-language
SSL_finetunning
Data2vec_aqc
kannada
spring_lab
ssl
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views48
Updated 23 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-MARATHI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
Marathi
low-resource-language
SSL_finetunning
Data2vec_aqc
spring_lab
IITM
ssl
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views54
Updated 23 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-SANSKRIT
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
IITM
spring_lab
Sanskrit
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads1
  • File Size3.52 GB
  • Views50
Updated 23 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-PUNJABI
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
low-resource-language
ssl
IITM
spring_lab
PUNJABI
Data2vec_aqc
SSL_finetunning
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views42
Updated 23 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING-INX-DATA2VEC-AQC-ODIA
Automatic Speech Recognition (ASR) model for speech recognition, processing audio and transcribing spoken content into text.
spring_lab
Odia
ssl
IITM
Data2vec_aqc
SSL_finetunning
low-resource-language
  • See Upvoters0
  • Downloads0
  • File Size3.52 GB
  • Views45
Updated 23 day(s) ago

DIGITAL INDIA BHASHINI DIVISION

SPRING LAB TAMIL-STREAMING
Automatic Speech Recognition (ASR) model for Tamil speech recognition, processing audio and transcribing spoken content into text.
Icefall-K2
ASR
tamil
IITM
spring_lab
streaming
MODELS
zipformer
  • See Upvoters0
  • Downloads8
  • File Size260.42 MB
  • Views179
Updated 1 month(s) ago

DIGITAL INDIA BHASHINI DIVISION