Skip to content

Scrape Amazon product listings utilising scrapy & residential proxies

Notifications You must be signed in to change notification settings

Smartproxy/python-scrapy-amazon

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 

Repository files navigation

Python Scrapy Amazon Scraper

Scrape Amazon product listings utilising scrapy & residential proxies

Prerequisites

To get started with Scrapy you will first need to install it using methods provided in their documentation. Check here for more information

Authentication & Proxy setup

Once you have an active subscription you can find your credentials & proxy addresses in Dashboard > Residential > Proxy Setup

Navigate to settings.py in /amazon/amazon/ folder and modify the following lines to authenticate.

SMARTPROXY_USER = 'SPusername' ## Smartproxy Username (Sub-user)
SMARTPROXY_PASSWORD = 'SPpassword' ## Password for your user
SMARTPROXY_ENDPOINT = 'gate.smartproxy.com' ## Endpoint you'd like to use
SMARTPROXY_PORT = '7000' ## Port of the endpoint you are using.

Running the scraper

Navigate to the project folder and run the following command

scrapy crawl amazon_search

Results

Amazon search results will be saved in /amazon/data folder in a .csv format

Releases

No releases published

Packages

No packages published

Languages