
The Odia Language, Culture, Literature, History, and Geography Dataset is a curated collection of Odia text by the Odia Virtual Academy (OVA). It covers Odia language studies, cultural traditions, literary works, regional history, and physical and cultural geography. Designed for language modelling and NLP research, the dataset supports domain adaptation, information extraction, and AI-assisted education in Odia for linguistic, cultural, historical, and geographical studies.
The Odia Language, Culture, Literature, History, and Geography Dataset is a meticulously curated collection of Odia text by the Odia Virtual Academy (OVA). It brings together material across Odia linguistics, cultural traditions, literary heritage, historical studies, and regional geography to reflect the breadth and depth of these interconnected domains. The dataset encompasses topics such as grammar and linguistic analysis, folklore and oral traditions, classical and modern literature, historical movements and personalities, regional geography, cultural practices, art forms, social institutions, and the evolution of Odia identity across time. It further covers scholarly interpretations, archival records, community narratives, ethnographic accounts, geographical descriptions, and traditional knowledge embedded in Odia-speaking communities. Curated by Odia Virtual Academy, the dataset emphasizes domain-relevant terminology, literary language, regional expressions, and discipline-specific vocabularies to support accurate language modelling, information extraction, and domain adaptation for Odia NLP research and language modelling.
The Odia Language, Culture, Literature, History, And Geography Dataset Aims To Provide Odia Text Sourced From Digitized Literary Works, Historical Documents, Cultural Records, Geographical Surveys, And Related Academic Materials. It Is Suitable For A Range Of Applications, Including Training Language Models, Building Domain-aware Nlp Tools (Such As Named-entity Recognition For Authors, Historical Figures, Places, Cultural Practices, And Geographical Features; Relation Extraction; And Multilingual Grounding), Implementing Ai-assisted Education And Cultural Outreach For Students And Researchers, And Enabling Content Generation That Aligns With Odia Linguistic, Literary, Historical, And Geographical Contexts In Odia.
Attribution 4.0 International (CC BY- 4.0)
5 files
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.