Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Useful sources #6

Open
jure opened this issue Apr 5, 2014 · 11 comments
Open

Useful sources #6

jure opened this issue Apr 5, 2014 · 11 comments

Comments

@jure
Copy link
Member

jure commented Apr 5, 2014

I'll use this issue to compile a list of useful sources for getting citations of code.

These sources must have full-text search, as this significantly increases the sensitivity of the search.

Having an API is optional, scraping can also be useful.

@sckott
Copy link

sckott commented Apr 5, 2014

Springer

Has some APIs http://dev.springer.com/, but in my experience have been a mess, and they don't respond to bug reports, etc.

@jure
Copy link
Member Author

jure commented Apr 5, 2014

Elsevier

Corpus: 7 million articles

Example of search: http://api.elsevier.com/content/search/index:SCIDIR?query="vegan: community ecology package"&APIkey=

API key needed, also notorious for its licensing of this data, so perhaps we should stay clear. It's a very good resource in terms of scope though!

@jure
Copy link
Member Author

jure commented Apr 5, 2014

Microsoft Academic Search

Corpus: 10 million articles?

Example of search: http://academic.research.microsoft.com/Search?query=r-project.org%2fweb

I don't think MAS has 10 million full text indexed articles. Has an API.

@sckott
Copy link

sckott commented Apr 5, 2014

@jure I heard Microsoft was deprecating that API, but perhaps I'm wrong?

@sckott
Copy link

sckott commented Apr 5, 2014

@jure I think @hpiwowar has experiencing working with the Elsevier API?

@jure
Copy link
Member Author

jure commented Apr 6, 2014

You could be right, @sckott, looks like the last update to MAS in general was in January 2013. That's aeons ago.

It would be awesome if @hpiwowar pitched in, although with Elsevier it's not really using the API that's the question, it's their strict licensing that is bothersome. But then again, licensing is always an issue.

@sckott
Copy link

sckott commented Apr 7, 2014

@jure curious if in this

These sources must have full-text search, as this significantly increases the sensitivity of the search.

you really need full text. Couldn't you just have the list of references from the paper? Wonder if there are some APIs that provide that in their metadata?

@jure
Copy link
Member Author

jure commented Apr 14, 2014

From what I've seen, APIs that have reference lists also have full-text. I agree that if we can get reference lists for a paper, that would be a fantastic start and full-text is not needed. But again, usually these two are coupled.

@pietsch
Copy link

pietsch commented Jun 23, 2014

BASE

Just use the search API of BASE (Bielefeld Academic Search Engine): http://www.base-search.net/about/en/about_develop.php?menu=2
BASE indexes >60 million publications, many of them open access.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants