Skip to content

Latest commit

 

History

History
59 lines (35 loc) · 2.51 KB

README.md

File metadata and controls

59 lines (35 loc) · 2.51 KB

Enron-logo

Enron Email Analysis

License: MIT

This analysis of the Enron Email Dataset focusses on 3 tasks: Anomaly Detection, Social Network Analysis, and Email Body Analysis.


Enron Email Dataset

  • The Enron scandal and collapse was one of the largest corporate meltdowns in history. In the year 2000, Enron was one of the largest energy companies in America. Then, after being outed for fraud, it spiraled downward into bankruptcy within a year.

  • The Enron Email Dataset contains 500,000 emails between 150 former Enron employees, mostly senior executives. It’s the only large public database of real emails, which makes it more valuable. The dataset can be found here.


Tasks performed

  • Anomaly Detection : Map the distribution of emails sent and received by hour and try to detect abnormal behavior leading up to the public scandal.
  • Social Network Analysis : Build a model for communication between employees to find key influencers.
  • Email Analysis : Analyze the body messages in conjunction with email metadata to classify emails based on their purposes. A word cloud is also generated based on the content of the emails, which gives a visual representation of word frequency and underscores the keywords used.

Find the detailed documentation here.


Results

The results of all the tasks can be viewed in the documentatio nbaove or by running the codes below in Google Collab:


Contributors

Mihir Gandhi - mihir-m-gandhi

Jasdeep Singh Grover - jasdeep100

Hardik Chodvadiya - willyhardik

Jay Gala - jaygala25

Sanjana Joshi - sanjana-j


License

This project is licensed under the MIT - see the LICENSE file for details.