Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Databases charset #4

Open
Walter-Correa opened this issue Jun 10, 2020 · 6 comments
Open

Databases charset #4

Walter-Correa opened this issue Jun 10, 2020 · 6 comments

Comments

@Walter-Correa
Copy link

Hello.
Could you set the databases charset to UTF8?
Some names of cities / provinces in Latin America do not appear correctly. I appreciate it if you can do that!

@George480
Copy link
Owner

Could you please provide their geoname_id or part of name so I can make sure they will be correct on the next update? Thank you.

@Walter-Correa
Copy link
Author

Hello.

All geoname_ids from all south América countries: http://www.geonames.org/childrenJSON?geonameId=6255150

All geoname_ids from all state/province from Brazil: http://www.geonames.org/childrenJSON?geonameId=3469034

My server is Brazilian, if you add utf8 to provinces and cities I thank you so much. If you use the second link you will get all 26 states/provinces from Brazil and their names. If you put the geoid of each state at the end of the link you will get all cities.

If you need some more info just tell me. Sorry my english!

@George480
Copy link
Owner

I can view the names correctly with a browser so I think it might be an issue with server encoding, the way it displays them.

Can you open the include and add:

db_free_result(db_query(GEOLITE_province_db, "PRAGMA encoding=\"UTF-8\";"));

at the end of if (fexist("maxmind_province.db")) block? Let me know if it has any difference.

@Walter-Correa
Copy link
Author

None difference. Unfortunately it didn't work. :(

@George480
Copy link
Owner

George480 commented Jul 6, 2020

Writing the output to a file has the correct encoding, however printing in console outputs random characters. I am clueless how to make it work with non-English languages.

@Walter-Correa
Copy link
Author

I understand... I'm using the utf8decode() from strlib.inc, but I found bugs: oscar-broman/strlib#21

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants