Skip to content

This analysis of the Enron Email Dataset focusses on 3 tasks: Anomaly Detection, Social Network Analysis, and Email Body Analysis.

License

Notifications You must be signed in to change notification settings

mihir-m-gandhi/Enron-Email-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Enron-logo

Enron Email Analysis

License: MIT

This analysis of the Enron Email Dataset focusses on 3 tasks: Anomaly Detection, Social Network Analysis, and Email Body Analysis.


Enron Email Dataset

  • The Enron scandal and collapse was one of the largest corporate meltdowns in history. In the year 2000, Enron was one of the largest energy companies in America. Then, after being outed for fraud, it spiraled downward into bankruptcy within a year.

  • The Enron Email Dataset contains 500,000 emails between 150 former Enron employees, mostly senior executives. It’s the only large public database of real emails, which makes it more valuable. The dataset can be found here.


Tasks performed

  • Anomaly Detection : Map the distribution of emails sent and received by hour and try to detect abnormal behavior leading up to the public scandal.
  • Social Network Analysis : Build a model for communication between employees to find key influencers.
  • Email Analysis : Analyze the body messages in conjunction with email metadata to classify emails based on their purposes. A word cloud is also generated based on the content of the emails, which gives a visual representation of word frequency and underscores the keywords used.

Find the detailed documentation here.


Results

The results of all the tasks can be viewed in the documentatio nbaove or by running the codes below in Google Collab:


Contributors

Mihir Gandhi - mihir-m-gandhi

Jasdeep Singh Grover - jasdeep100

Hardik Chodvadiya - willyhardik

Jay Gala - jaygala25

Sanjana Joshi - sanjana-j


License

This project is licensed under the MIT - see the LICENSE file for details.

About

This analysis of the Enron Email Dataset focusses on 3 tasks: Anomaly Detection, Social Network Analysis, and Email Body Analysis.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages