Dataset of tribes and sub-tribes across Northeast India with details on regions, clans, languages, and linguistic families.
This dataset provides a structured compilation of tribes and sub-tribes across the eight states of Northeast India: Arunachal Pradesh, Assam, Manipur, Meghalaya, Mizoram, Nagaland, Tripura, and Sikkim. Each entry includes the state, tribe, sub-tribes or clans, regional distribution, languages spoken, and linguistic family classification. The dataset has been compiled from publicly available sources such as the Census of India, Ministry of Tribal Affairs documents, ethnographic studies, and community references. It is intended as a reference resource for cultural studies, linguistic analysis, education, and computational applications including natural language processing.
Attribution 4.0 International (CC BY- 4.0)
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.