Skip to content

Alphx-rgb/Web-Crawler-CLI-Tool

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web-Crawler

Project of WOC

  • This project is about a web crawler,a tool which extracts information about webpages.
  • I will use Python language for the project.
  • This Crawler crawls over the internet and stores links,images and screenshots of linkavailable onthe website.
  • for further help: use command "python WCSC.py man" or "python WCSC.py -help"

Modules/libraries used:

  • tldextract

  • selenium

  • os

  • bs4

  • requests

  • sys

  • termcolor

  • itertools

  • keyboard

  • time

  • re Below are some Snippets of working of the tool:

  • man_page man_page

  • Headers Headers

Note: Enter email : https://github.com and depth : 1 or greater than 1

About

A CLI Tool for Web Crawling and Scrapping

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages