Skip to content

Harry-Atulah/dsc-phase-1-project-v2-4

 
 

Repository files navigation

Title:Data Analysis To Guide Microsoft New Movie Studio

Overview

The goal of the project is to leverage data-driven insights to inform strategic decisions regarding film production. Microsoft aims at using various data provided so as to explore what types of films are currently doing the best at the box office.

Importance of Data-Driven Decision-Making:

  • In today's competitive entertainment industry, understanding audience preferences and market trends is crucial for success.
  • By analyzing comprehensive data from various sources, we can identify patterns, anticipate audience demand, and optimize our film production strategy
  • Ultimately, data-driven decision-making enables us to mitigate risks, maximize returns, and enhance the overall competitiveness of Microsoft's new movie studio

Business Understanding

Problem Statement

Microsoft is venturing into the movie industry with the establishment of its new movie studio. However, lacking expertise in the field, Microsoft faces the challenge of determining which film genres are most likely to yield success at the box office. Without this critical insight, Microsoft risks investing resources in projects that may not resonate with audiences or generate significant returns.

Objective

Our goal is to assist Microsoft's new movie studio in understanding the current landscape of the film industry to make informed decisions about the types of films to produce.

Stakeholders

Executives, Marketing teams, Production teams, Data Analysts, Customers

Business Questions

  • What genres of movies are currently popular at the box office?
  • What factors contribute to the success of a movie?
  • What audience demographics should Microsoft target?

Data Understanding

Dataset

Our analysis utilizes data from various sources, including IMDb,TheMovieDB, and The Numbers. These sources provide comprehensive information on box office performance, audience ratings, critical reviews, and film metadata.

Overview Of Dataset

  • IMDb furnishes comprehensive information encompassing movie titles, genres, release dates, and details regarding cast and crew. Within its database, the movie_basics table offers specifics on titles, genres, and release years, while the movie_ratings table houses IMDb user ratings and reviews.
  • TheMovieDB serves as an extensive repository containing details about movies and TV series, encompassing data such as titles, genres, release dates, vote averages, vote counts, and official languages. This dataset enhances our analysis by offering supplementary metadata and promotional content for films, aiding in the comprehension of their production and promotional tactics.
  • The Numbers is a movie industry data website that offers detailed box office revenue information, including domestic and international earnings, production budgets, movie titles,release dates and marketing expenses.

Data Analysis

We read the data from the various datasets We then managed to clean and analyze the data by removing null columns and rows. We then looked at the number of the final rows and columns of each dataset we used, checked the information that the dataset has, and also described the data to have the count, mean and standard deviation of the various datasets. Through our data analysis, we gained valuable insights into audience preferences, box office trends, and financial metrics, which will inform our recommendations for Microsoft's new movie studio.

Recommendations

image image image image image

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%