MultiIndicWikiBioSS is a multilingual sequence-to-sequence model fine-tuned on the IndicWikiBio dataset for nine Indian languages
MultiIndicWikiBioSS is a multilingual, sequence-to-sequence model fine-tuned from an IndicBARTSS checkpoint on the IndicWikiBio dataset, supporting biography generation across nine Indian languages: Hindi, Bengali, Punjabi, Tamil, and Telugu. Unlike models such as mBART50 and mT5, MultiIndicWikiBioSS retains native scripts, removing the need for script mapping to or from Devanagari. It is optimized for efficiency, being significantly smaller and computationally less expensive for fine-tuning and decoding than mBART and mT5-base. Trained on a dataset of 34,653 examples, the model provides strong multilingual capabilities tailored to Indic languages, making it a valuable tool for biography generation tasks.
MIT
Aman Kumar and Himani Shrotriya and Prachi Sahu and Raj Dabre and Ratish Puduppully and Anoop Kunchukuttan and Amogh Mishra and Mitesh M. Khapra and Pratyush Kumar
Summarization Model
N.A.
Open
Sector Agnostic
21/02/25 13:21:56
0
MIT
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.