PoliWAM is a large-scale corpus of WhatsApp political discussions collected during the Indian General Elections 2019. It consists of both raw and annotated data, enabling research in political discourse, misinformation, propaganda, and multilingual code-mixing.
Dataset Usage Disclaimer ‼️
This dataset contains real-world political discussions collected from public WhatsApp groups. It may include biased, offensive, or potentially harmful content, such as hate speech, misinformation, or political propaganda. The dataset is released strictly for academic research and should be used with appropriate ethical safeguards and content moderation strategies.
Citation:
If you use this dataset, please cite the following work:
@inproceedings{srivastava-singh-2021-poliwam, title = "{P}oli{WAM}: An Exploration of a Large Scale Corpus of Political Discussions on {W}hats{A}pp Messenger", author = "Srivastava, Vivek and Singh, Mayank", editor = "Xu, Wei and Ritter, Alan and Baldwin, Tim and Rahimi, Afshin", booktitle = "Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021)", month = nov, year = "2021", address = "Online", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2021.wnut-1.15/", doi = "10.18653/v1/2021.wnut-1.15", pages = "120--130" }
Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.