Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] API for wiki tables #35

Open
skyprince999 opened this issue May 11, 2020 · 3 comments
Open

[Feature Request] API for wiki tables #35

skyprince999 opened this issue May 11, 2020 · 3 comments
Labels
mediawiki-issue It is not supported by mediawiki API, reopen with proper API call.

Comments

@skyprince999
Copy link

Is it possible to create an API to extract tables from the wiki page?

@skyprince999 skyprince999 changed the title [Feature Request] [Feature Request] API for wiki tables May 11, 2020
@martin-majlis
Copy link
Owner

martin-majlis commented Nov 23, 2020

Wikipedia-API is using extracts API call. For example, this page - List of masters world records in road running contains a lot of tables. However, when I try to load that page via API - there are no tables returned.

There is a different API call - parse - that could be used and returns content as HTML with all the tags. Maybe this one could be used instead. :)

@martin-majlis martin-majlis added the mediawiki-issue It is not supported by mediawiki API, reopen with proper API call. label Nov 23, 2020
@ScoobyQ
Copy link

ScoobyQ commented May 25, 2021

Agreed. This is a highly popular action. Interesting will be the handling of frequently irregular shaped tables. I'm guessing something similar to pandas read_html handling? To make this really useful there would be the ability to target specific table(s)

@caniko
Copy link

caniko commented Jan 25, 2022

Should probably use Pandas along with fletcher to store Python objects in the Dataframe efficiently.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
mediawiki-issue It is not supported by mediawiki API, reopen with proper API call.
Projects
None yet
Development

No branches or pull requests

4 participants