You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
scraping data also leads to problematic territory like terms of service and them just straight up blocking your ass for taking their data, etc... And not all sites have nice apis for scraping, so some general scraper would be hard.
Ideally this is modular but it would be good to have a boilerplate for scraping data from:
-twitter
-youtube
-tiktok
-instagram
Or even just google drive & pipe-lining it into a PyTorch dataloader.
The text was updated successfully, but these errors were encountered: