Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Import calendar era information for CJK from the DILA Time Authority Database #27

Open
ronaldtse opened this issue Jul 14, 2021 · 8 comments
Labels
enhancement New feature or request

Comments

@ronaldtse
Copy link
Contributor

ronaldtse commented Jul 14, 2021

https://github.com/DILA-edu/Authority-Databases/tree/master/authority_time/SQL

Active site: https://authority.dila.edu.tw/time/

@ronaldtse ronaldtse added the enhancement New feature or request label Jul 14, 2021
@eroux
Copy link
Contributor

eroux commented Jul 14, 2021

just as a side note, here are all the 1400 eras (Chinese and others) recorded on Wikidata:

https://query.wikidata.org/#select%20%3Fs%20where%20%7B%20%3Ft%20wdt%3AP279%2a%20wd%3AQ256408%20.%20%3Fs%20wdt%3AP31%20%3Ft%20%7D%0A

@eroux
Copy link
Contributor

eroux commented Jul 14, 2021

I can extract the data, what format would be best? (there are sometimes different languages associated with the eras in the DILA database)

@ronaldtse
Copy link
Contributor Author

Would be best to extract them in YAML so we can import back to a document for distribution (because they will need to be vetted).

@eroux
Copy link
Contributor

eroux commented Jul 14, 2021

Sure, I'm just not sure what code we should assign these... for instance let's start with DILA era 1: 建武, this is the only label, but unfortunately it's an ambiguous one as 4 different eras are named that way... should the code be "dila-1", or just "chinese-era-1"? What about something like:

- code: dila1
  name: 建武
  reign_of: 光武帝
  ref: |
    * Dharma Drum Institute of Liberal Arts Time Database, era id 1
    * https://www.wikidata.org/wiki/Q8012586 

? (note that the wikidata Q id needs to be reconciled manually or semi-automatically, but I can't do it for all the DILA eras)

@ronaldtse
Copy link
Contributor Author

Can we use a pattern of {transliterated reign}-{transliterated era}?

The important thing is to have the data, codes can be fixed later.

@eroux
Copy link
Contributor

eroux commented Jul 15, 2021

ok yes, I'll work on that

@ronaldtse
Copy link
Contributor Author

Thanks @eroux !

@eroux
Copy link
Contributor

eroux commented Jul 16, 2021

eras.zip

@ronaldtse what about this file? I used https://github.com/lxneng/xpinyin for a conversion to Pinyin but this sometimes have issues when a character can be pronounced in different ways... Unfortunately I don't feel I can do much better... wdyt?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants