Skip to content

snutiise/Twitter-Crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

14 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

ํŠธ์œ„ํ„ฐ ์ด๋ฏธ์ง€ ํฌ๋กค๋Ÿฌ์ž…๋‹ˆ๋‹ค.

์‚ฌ์šฉํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” selenium, scrapy, pymongo, configparser๋ฅผ ์„ค์น˜ํ•ด์•ผํ•ฉ๋‹ˆ๋‹ค.

$ git clone https://github.com/snutiise/Twitter-Crawler.git

$ sudo pip install configparser pymongo selenium scrapy

$ cd Twitter-Crawler

$ scrapy crawl twitter


๋ฉ”ํƒ€๋ฐ์ดํ„ฐ ์ €์žฅ์‹œ ๋ชฝ๊ณ ๋””๋น„๋ฅผ ์ด์šฉํ•˜๋ฏ€๋กœ ๋ชฝ๊ณ ๋””๋น„๋„ ์„ค์น˜ํ•ด์•ผํ•ฉ๋‹ˆ๋‹ค.

mongodb config -> settings.py ํŒŒ์ผ์ฐธ์กฐ



config ํŒŒ์ผ์—์„œ ์ˆ˜์ง‘ํ•˜๊ณ  ์‹ถ์€ ์ด๋ฏธ์ง€์— ๋Œ€ํ•œ ํ‚ค์›Œ๋“œ์™€ ํŽ˜์ด์ง€ ์ˆ˜, ๊ทธ๋ฆฌ๊ณ  ํฌ๋กค๋Ÿฌ๊ฐ€ ์œ„์น˜ํ•œ ์ ˆ๋Œ€๊ฒฝ๋กœ๋ฅผ ์„ค์ •ํ•ด์ฃผ๋ฉด ๋ฉ๋‹ˆ๋‹ค.

ex)

keyword=๋Ÿฌ๋ธ”๋ฆฌ์ฆˆ

page=10

rootPath=/home/jsh/git/Twitter-Crawler/

About

Twitter Crawler

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages