Skip to content

Mini web scraper for archiving blogspot/wordpress blog posts

Notifications You must be signed in to change notification settings

honeytoast/BlogPostArchiver

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BlogPostArchiver

Mini web scraper for archiving blogspot/wordpress blog posts.
Outputs one html file in the local directory corresponding for each URL in sample_article_links.txt.

Requirements

Python3
pip

Set up the Virtual Environment

$ pip install venv
$ python3 -m venv venv
$ source venv/bin/activate
$ pip install -r requirements.txt

Run the tool

Edit and save sample_article_links.txt with the URLs you want to archive. Each line should be a separate URL.
$ python3 scrape.py

About

Mini web scraper for archiving blogspot/wordpress blog posts

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages