Skip to content

saba99/Video-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web Scrapping Popular Youtube Tech Channels with Selenium

Data Mining, Data Wrangling, and Exploratory Data Analysis


About the Data

Web scraping was performed on the Top 10 Tech Channels on Youtube using Selenium (an automated browser (driver) controlled using python, which is often used in web scraping and web testing).


Sample of Data Collected

The average number of videos per channel was around 200. In total, the data from 2000 videos was scrapped.

Sample Output(Clean Data)

Screenshot (3979)

Video Scraping Visualization



Take Aways

  1. Video Comment numbers have very little correlation to any data that was obtained in this project.

  2. The following seem to be seems to be highly correlated.

    • Channel Views and Subscribers
    • Interactions and Video Views
  3. Video titles fall into 5 topic groups.

    Kmeans and PCA used to create clusters for video titles

    • Iphone (kmeans 0)
    • Samsung (kmeans 1)
    • Reviews (kmeans 2)
    • Unboxing (kmeans 3)
    • How-to (kmeans 4)
  4. 70% of the the most viewed videos are about phones.

  5. Join Date (Date a Youtube Channel was created) does not seem to have any relationship to number of subscribers or overall cha



Colab Link

Data Analysis of Youtube Tech Channels