It is a multilingual, sequence-to-sequence pre-trained model fine-tuned on the IndicSentenceSummarization dataset for sentence summarization across 11 Indian languages.
IndicSentenceSummarization is a sequence-to-sequence pre-trained model based on IndicBART, fine-tuned on the IndicSentenceSummarization dataset, supporting 11 Indian languages: Hindi, Marathi, Punjabi, Tamil, Telugu, Bengali, Gujarati etc. Smaller than mBART and mT5, the model is more efficient for decoding. Trained on a large corpus of 431K sentences, it uses Devanagari script for all languages, enhancing transfer learning across related languages and making it more accessible for multilingual summarization tasks.
MIT
Aman Kumar, Himani Shrotriya, Prachi Sahu, Raj Dabre, Ratish Puduppully, Anoop Kunchukuttan, Amogh Mishra, Mitesh M. Khapra, Pratyush Kumar
Text Summarization
N.A.
Open
Sector Agnostic
21/02/25 13:20:57
0
MIT
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.