MultiIndicWikiBioUnified is a multilingual, sequence-to-sequence pre-trained model fine-tuned on the IndicWikiBio dataset, enabling biography generation across multiple Indian languages.
MultiIndicWikiBioUnified is a sequence-to-sequence pre-trained model based on IndicBART, fine-tuned on the IndicWikiBio dataset, supporting nine Indian languages: Hindi, Marathi, Punjabi, Tamil, Telugu, Bengali, Gujarati etc. With a smaller model size compared to mBART and mT5, it is less computationally expensive for fine-tuning and decoding. Fine-tuned on 34,653 examples, it encourages transfer learning among related languages, with all data represented in Devanagari script for better cross-lingual transfer.
MIT
Aman Kumar, Himani Shrotriya, Prachi Sahu, Raj Dabre, Ratish Puduppully, Anoop Kunchukuttan, Amogh Mishra, Mitesh M. Khapra, Pratyush Kumar
Text Generation
N.A.
Open
Sector Agnostic
21/02/25 13:20:59
0
MIT
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.