A transformer-based model pre-trained on multiple Indian languages using Masked Language Modeling and Sampling-based Translation Language Modeling techniques.
IndicBERTv2-MLM-Sam-TLM is a multilingual language model pre-trained on multiple Indian languages using Masked Language Modeling (MLM) and Sampling-based Translation Language Modeling (Sam-TLM) techniques. This training regimen enables the model to effectively handle translation tasks and understand semantic relationships across diverse Indian languages.
MIT
Doddapaneni, Sumanth and Aralikatte, Rahul and Ramesh, Gowtham and Goyal, Shreya and Khapra, Mitesh M. and Kunchukuttan, Anoop and Kumar, Pratyush.
Multilingual Language Model
N.A.
Open
Sector Agnostic
21/02/25 13:21:07
0
MIT
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.