Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stages not triggered when importing to Postgres #254

Open
cheetah90 opened this issue Dec 6, 2015 · 7 comments
Open

Stages not triggered when importing to Postgres #254

cheetah90 opened this issue Dec 6, 2015 · 7 comments

Comments

@cheetah90
Copy link

All other stages in org.wikibrain.download.loader are not triggered when selecting Postgres as the data source in the GUILoader. (Not sure if this is solely related to GUILoader).

@cheetah90
Copy link
Author

No worries. I figured it out. Looks like if the Postgres' tables such as "local_articles" etc., cannot distinguish between different language editions so once the tables are created for one language, those stages are skipped when importing other languages? Did I miss something or is it really a issue?

@cheetah90
Copy link
Author

Still waiting for the answer on this.... I realize that after I imported one language into the Postgres, importing the second language edition will halt since it thinks that the language has already been parsed.

@bjhecht
Copy link
Collaborator

bjhecht commented Dec 8, 2015

We don't need spatial for this project, so we can probably skip this.

The intended use though is that you import them all at the same time.

Sent from Mobile Device
(please excuse brevity and any autocorrect-induced errors)

On Dec 8, 2015, at 16:45, Allen Lin [email protected] wrote:

Still waiting for the answer on this.... I realize that after I imported one language into the Postgres, importing the second language edition will halt since it thinks that the language has already been parsed.


Reply to this email directly or view it on GitHub.

@cheetah90
Copy link
Author

Oh okay. I just want to leverage the speed of Postgres to handle the non-spatial data. H2 is slow on dataset over 1M entires. I'll try the H2 then.

@shilad
Copy link
Owner

shilad commented Dec 9, 2015

Sorry for the delayed response! The only way to add languages right now is to reinstall from scratch with all the languages you want. This is the number one feature on our summer to-do list, but it isn't easy to do efficiently with the current setup. I would recommend that you stick to postgres. It will definitely be faster for es.

@bjhecht
Copy link
Collaborator

bjhecht commented Dec 9, 2015

Ah, read the e-mail too quickly. Using postgres makes sense.

The key thing here is that they have to be imported at the same time, I
believe.

On 12/8/2015 5:14 PM, Allen Lin wrote:

Oh okay. I just want to leverage the speed of Postgres to handle the
non-spatial data. H2 is slow on dataset over 1M entires. I'll try the
H2 then.


Reply to this email directly or view it on GitHub
#254 (comment).

@cheetah90
Copy link
Author

@shilad I currently manually run the stages to import data of different language editions. This works for me as I successfully imported EN, ES into the database. However, if I use the Loader class, it seems that some tables get dropped.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants