The goal of the project is to leverage data-driven insights to inform strategic decisions regarding film production. Microsoft aims at using various data provided so as to explore what types of films are currently doing the best at the box office.
Importance of Data-Driven Decision-Making:
- In today's competitive entertainment industry, understanding audience preferences and market trends is crucial for success.
- By analyzing comprehensive data from various sources, we can identify patterns, anticipate audience demand, and optimize our film production strategy
- Ultimately, data-driven decision-making enables us to mitigate risks, maximize returns, and enhance the overall competitiveness of Microsoft's new movie studio
Microsoft is venturing into the movie industry with the establishment of its new movie studio. However, lacking expertise in the field, Microsoft faces the challenge of determining which film genres are most likely to yield success at the box office. Without this critical insight, Microsoft risks investing resources in projects that may not resonate with audiences or generate significant returns.
Our goal is to assist Microsoft's new movie studio in understanding the current landscape of the film industry to make informed decisions about the types of films to produce.
Executives, Marketing teams, Production teams, Data Analysts, Customers
- What genres of movies are currently popular at the box office?
- What factors contribute to the success of a movie?
- What audience demographics should Microsoft target?
Our analysis utilizes data from various sources, including IMDb,TheMovieDB, and The Numbers. These sources provide comprehensive information on box office performance, audience ratings, critical reviews, and film metadata.
- IMDb furnishes comprehensive information encompassing movie titles, genres, release dates, and details regarding cast and crew. Within its database, the movie_basics table offers specifics on titles, genres, and release years, while the movie_ratings table houses IMDb user ratings and reviews.
- TheMovieDB serves as an extensive repository containing details about movies and TV series, encompassing data such as titles, genres, release dates, vote averages, vote counts, and official languages. This dataset enhances our analysis by offering supplementary metadata and promotional content for films, aiding in the comprehension of their production and promotional tactics.
- The Numbers is a movie industry data website that offers detailed box office revenue information, including domestic and international earnings, production budgets, movie titles,release dates and marketing expenses.
We read the data from the various datasets We then managed to clean and analyze the data by removing null columns and rows. We then looked at the number of the final rows and columns of each dataset we used, checked the information that the dataset has, and also described the data to have the count, mean and standard deviation of the various datasets. Through our data analysis, we gained valuable insights into audience preferences, box office trends, and financial metrics, which will inform our recommendations for Microsoft's new movie studio.