A large dataset of emails from the Enron Corporation, useful for studying email communication and developing NLP models for email data.
The Enron Email Dataset is a collection of real corporate emails from employees of the Enron Corporation, released as part of legal investigations and curated for research use by Carnegie Mellon University. The dataset contains hundreds of thousands of emails, including internal communication, threads, and metadata. The language reflects professional, semi-formal communication and organizational workflows, offering a rare real-world snapshot of corporate email usage.
This Dataset Is Primarily Used For Research In Email Classification, Communication Analysis, And Natural Language Processing Tasks Involving Enterprise Text. It Supports Studies On Topic Modeling, Social Network Analysis, Information Flow, And Document Classification. For Language Models, It Provides Exposure To Structured Workplace Communication, Helping Improve Performance On Tasks Like Email Summarization, Intent Detection, And Enterprise Search Systems.
Other
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.