Indian Flag
Government Of India
A-
A
A+
ORGANISATION
IndicParam

IndicParam

IndicParam is a graduate-level benchmark of 13,207 MCQs from UGC-NET exams, covering 11 Indic languages and a Sanskrit–English code-mixed subset. It evaluates LLMs on low-resource languages across native scripts, measuring linguistic understanding and domain knowledge. With diverse question formats, it enables fine-grained analysis and highlights gaps in multilingual and cross-lingual performance. Paper: https://arxiv.org/pdf/2512.00333

About Dataset

Paper: https://arxiv.org/pdf/2512.00333 IndicParam is a large-scale, graduate-level benchmark dataset designed to evaluate the performance of Large Language Models (LLMs) on low-resource and extremely low-resource Indic languages. The dataset consists of 13,207 multiple-choice questions (MCQs) collected from official UGC-NET language examination papers and their corresponding answer keys. These questions span 11 Indic languages—Nepali, Marathi, Gujarati, Odia, Maithili, Konkani, Santali, Bodo, Dogri, Rajasthani, and Sanskrit along with an additional Sanskrit–English code-mixed subset. Each data instance represents a single MCQ and includes: - A question in the native script of the target language - Four answer options (A–D) - The correct answer label - Metadata such as subject, exam name, and question type The dataset covers a wide range of question formats, including: - Standard multiple-choice questions - Assertion–Reason - List Matching - Fill in the Blanks - Identify Incorrect Statement - Ordering IndicParam is specifically structured to evaluate both: - Language Understanding (LU): linguistic knowledge such as grammar, syntax, and semantics - General Knowledge (GK): domain knowledge including literature, history, and cultural context All questions are preserved in their original scripts (Devanagari, Gujarati, Odia, and Ol Chiki), ensuring authentic evaluation of multilingual capabilities without reliance on transliteration. The dataset is released as a single test split (13,207 samples) and is intended exclusively for evaluation purposes, enabling standardized and reproducible benchmarking of LLMs across diverse Indic languages. Overall, IndicParam provides a comprehensive and challenging evaluation suite for measuring multilingual understanding, cross-lingual generalization, and cultural competence in modern language models.

Purpose of Dataset

The Primary Purpose Of Indicparam Is To Provide A Rigorous, Standardized Benchmark For Evaluating Large Language Models (Llms) On Low- And Extremely Low-resource Indic Languages, Addressing The Gap Where Models Perform Well On High-resource Languages But Struggle To Generalize. Indicparam Enables Evaluation Of Both Language Understanding (Morphology, Syntax, Semantics, Discourse) And Domain Knowledge (Literature, Culture, History). Through Diverse Mcq Formats Such As Normal Mcqs, Assertion–reason, List Matching, Fill In The Blanks, And Ordering, It Supports Fine-grained Analysis Beyond Simple Question Answering. Covering 11 Indic Languages And A Sanskrit–english Code-mixed Variant, The Dataset Allows Per-language Benchmarking And Comparison Across Scripts And Linguistic Settings, Helping Identify Disparities Between Low- And Extremely Low-resource Languages. Since All Questions Are Presented In Native Scripts, It Evaluates True Multilingual Capability Without Reliance On Transliteration. As A Test-only Benchmark With Deterministic Evaluation And Accuracy-based Metrics, Indicparam Ensures Standardized And Reproducible Comparisons Across Models. It Also Highlights Limitations In Cross-lingual Transfer From High-resource Languages Like English. Overall, Indicparam Aims To Drive The Development Of More Inclusive, Robust, And Culturally Grounded Ai Systems, While Informing Future Multilingual Pretraining, Data Collection, And Evaluation Strategies.

Activity Overview Activity Overview

  • Downloads0
  • Downloads 0
  • File Size 2.62 MB
  • Views 26

Tags Tags

  • Nepali
  • Odia
  • Indian Languages
  • Sanskrit
  • Gujarati
  • Marathi
  • Hindi
  • Santali
  • Maithili
  • Konkani
  • Indic Languages
  • LLM Evaluation
  • Dogri
  • Bodo
  • LLM Benchmark
  • Low Resource Languages
  • Multilingual NLP
  • Language Evaluation
  • Question Answering
  • MCQ Dataset
  • Linguistic Analysis
  • Cross-lingual Transfer
  • AI Benchmark
  • NLP Benchmark
  • Reasoning Dataset
  • UGC NET Questions
  • Academic QA
  • Sanskrit–English

License Control License Control

Attribution-Non-Commercial 4.0 International (CC BY-NC 4.0)

No Record(s) Found

Select a file to preview its contents.

Data Quality Score BetaData Quality Score Beta

Version Control Version Control

FolderVersion 1(2.62 MB)
  • Vivekkumar Vasudevbhai Patel· Today
    • undefined
      IndicParam.parquet

Related Datasets Related Datasets

Updated 6 month(s) ago
BharatGen : MHQA Dataset
BharatGen : MHQA Dataset
Information
MHQA: A Mental Health Solution for Healthcare
HealthCare
AI in healthcare
  • See Upvoters0
  • Downloads274
  • File Size44.06 MB
  • Views3,886

BHARATGEN

Updated 6 month(s) ago
BhashaBench-Finance
BhashaBench-Finance
Information-
BhashaBench-Finance (BBF): Benchmarking AI on Indian Financial Knowledge
library:pandas
language:en
modality:text
library:datasets
region:us
library:polars
library:mlcroissant
format:parquet
license:cc-by-4.0
size_categories:10K<n<100K
source_datasets:original
task_categories:multiple-choice
task_categories:question-answering
arxiv:2510.25409
language:hi
  • See Upvoters1
  • Downloads129
  • File Size0
  • Views948

BHARATGEN

Updated 6 month(s) ago
BhashaBench-Krishi
BhashaBench-Krishi
Information-
BhashaBench-Krishi (BBK): Benchmarking AI on Indian Agricultural Knowledge
language:hi
arxiv:2510.25409
task_categories:question-answering
task_categories:multiple-choice
source_datasets:original
size_categories:10K<n<100K
license:cc-by-4.0
format:parquet
library:mlcroissant
library:polars
region:us
library:datasets
modality:text
language:en
library:pandas
  • See Upvoters0
  • Downloads31
  • File Size0
  • Views193

BHARATGEN

Updated 6 month(s) ago
BhashaBench-Legal
BhashaBench-Legal
Information-
BhashaBench-Legal (BBL): Benchmarking AI on Indian Legal Knowledge
library:pandas
language:hi
language:en
modality:text
library:datasets
region:us
library:polars
library:mlcroissant
format:parquet
license:cc-by-4.0
size_categories:10K<n<100K
source_datasets:original
task_categories:multiple-choice
task_categories:question-answering
arxiv:2510.25409
  • See Upvoters1
  • Downloads98
  • File Size0
  • Views954

BHARATGEN

Updated 6 month(s) ago
BhashaBench-Ayur
BhashaBench-Ayur
Information-
BhashaBench-Ayur (BBA): Pioneering India’s Ayurvedic AI Benchmark
library:pandas
arxiv:2510.25409
task_categories:question-answering
task_categories:multiple-choice
source_datasets:original
size_categories:10K<n<100K
license:cc-by-4.0
format:parquet
library:mlcroissant
library:polars
region:us
library:datasets
modality:text
language:en
language:hi
  • See Upvoters0
  • Downloads51
  • File Size0
  • Views296

BHARATGEN

Related Models Related Models

BharatGen - Param 1 Indic-Scale Bilingual Foundation Model
Param1 is a 2.9 billion parameter language model pretrained on English and Hindi, designed for text completion.
Large Language Model
  • See Upvoters4
  • Downloads708
  • File Size13.79 GB
  • Views20,245
Updated 1 month(s) ago

BHARATGEN

Param2-17B-Thinking
BharatGen presents Param-2-17B-MoE-A2.4B, a large-scale Mixture-of-Experts (MoE) language model designed to deliver high model capacity while retaining the inference efficiency of a much smaller dense model. It uses a Hybrid MoE architecture with 17B total parameters, while activating only 2.4B parameters per token.
Mixture of Experts
Multilingual Text
pretrained
  • See Upvoters1
  • Downloads60
  • File Size57.29 GB
  • Views2,021
Updated 3 month(s) ago

BHARATGEN

BharatGen - Param-1-7B-MoE Advancing Multilingual GenAI for India
Param-1-7B-MoE is a multilingual large language model developed under the Param-1 family as part of BharatGen – A Suite of Generative AI Technologies for India. With 7 billion parameters and a Mixture of Experts (MoE) architecture, the model is designed to better understand and generate text across English, Hindi, and 14 additional Indian languages. The model is pretrained from scratch with a strong focus on linguistic diversity, cultural context, and large-scale multilingual representation.
safetensors
region:us
mixtral
  • See Upvoters1
  • Downloads83
  • File Size0
  • Views1,394
Updated 5 month(s) ago

BHARATGEN