Skip to content

A project utilizing data sets from various popular movie studios to provide actionable insights to a new upcoming one. Focuses on data cleaning and preparation as well as visualisations

License

Notifications You must be signed in to change notification settings

Sandrakiptumm/DSC-Phase-One-Project

Repository files navigation

Film Studio Industry Analysis for Microsoft

Overview

This repository contains the code for a data science project analyzing various aspects of the film studio industry. The project aims to provide insights into factors influencing the success of movies, such as production costs, revenue, popularity, directorial experience, genre, and ratings.

Table of Contents

  1. Introduction
  2. Business Understanding
  3. Data
  4. Analysis
  5. Results
  6. Conclusion

Introduction

In this project, we utilize data science techniques to address real-world problems faced by stakeholders in the film industry. By analyzing historical data on film production, performance, and audience reception, we aim to provide actionable insights that can inform decision-making and improve outcomes within the industry.

Business Understanding

Our analysis focuses on several key questions relevant to stakeholders in the film industry, including:

  • Is there a correlation between production costs and revenue generated?
  • How does the popularity of a film relate to its ratings?
  • What impact does a director's experience have on the success of their films?
  • Is there a relationship between genre and audience ratings?

Data

We utilize publicly available datasets containing information on film production, box office performance, audience ratings, directorial credits, and genre classifications. The data is sourced from reputable sources such as IMDb, The Movie Database (TMDb), and Box Office .

Analysis

Our analysis involves exploratory data analysis (EDA) and data visualization techniques to uncover patterns, correlations, and trends within the data. We employ tools such as Python programming language, pandas, NumPy, matplotlib for data processing, analysis, and visualization.

Results

The results of our analysis provide valuable insights into the factors influencing the success of films in terms of revenue, audience reception, and directorial experience. These insights can be used by stakeholders such as production companies, directors and investors to make informed decisions and optimize their strategies within the industry.

Conclusion

In conclusion, this data science project offers actionable insights into the complexities of the film industry, addressing real-world problems and providing value to stakeholders. By leveraging data-driven approaches, we aim to contribute to the continued growth and success of the film industry.

About

A project utilizing data sets from various popular movie studios to provide actionable insights to a new upcoming one. Focuses on data cleaning and preparation as well as visualisations

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published