Crawling Tweets as a Network.
Given a list of keywords, TwiSpider will crawl for real time tweets with this keyword(s). It saves the original tweet of it if that's a retweet. It then monitors the original tweets in a regualar basis.
postgresql@11
pip install -r requirements.txt
-
Fill in configs. Make a copy from
.template
files. For example, copy a file fromtwitter.ini.template
and name it astwitter.ini
, fill your api information inside. -
Edit
src/keywords.txt
for monitored keywords, one at a line. Space means boolean AND.hpv vaccine
will monitor tweets that both containhpv
andvaccine
-
Create tables with
src/sqls/create_tables.sql
-
Run with
python3 src/main.py