Skip to content

A bunch of crawlers for extracting data from various sites (site name is mentioned for each one)

License

Notifications You must be signed in to change notification settings

armiro/crawlers

Repository files navigation

License Status Commits repo size

A Set of Crawlers

Each crawler is built as part of another project. Different crawler techs are used:

  • Selenium
  • BeautifulSoup
  • Scrapy
  • Scholarly

Other possible crawlers that may speed up code flow (not used yet):

  • serpAPI
  • Octoparse

Data Collections

Canadian Top University Researchers Data

license download doi

Dataset consists of 32,240 records of Google Scholar profiles from researchers affiliated with top 20 universities in Canada. Columns are GUID, full name, list of research interests, university name, and number of total citations per researcher.

About

A bunch of crawlers for extracting data from various sites (site name is mentioned for each one)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages