MultiIndicHeadlineGeneration is a multilingual, sequence-to-sequence pre-trained model fine-tuned on the IndicBART checkpoint for headline generation and summarization tasks across 11 Indian languages.
MultiIndicHeadlineGeneration is a sequence-to-sequence pre-trained model focusing on 11 Indian languages: Hindi, Marathi, Punjabi, Tamil, Telugu, Bengali, Gujarati etc. Fine-tuned on the IndicBART checkpoint, this model is ideal for building natural language generation applications such as headline generation, summarization, and related tasks. Smaller than mBART and mT5, it is computationally efficient for fine-tuning and decoding. Trained on a large corpus of 1.316 million paragraphs and 5.9 million unique tokens, all languages are represented in Devanagari script to support transfer learning among related languages.
MIT
Aman Kumar, Himani Shrotriya, Prachi Sahu, Raj Dabre, Ratish Puduppully, Anoop Kunchukuttan, Amogh Mishra, Mitesh M. Khapra, Pratyush Kumar
Text Summarization
N.A.
Open
Sector Agnostic
21/02/25 13:20:53
0
MIT
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.