Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support transliteration of Chinese characters (pinyin) #58

Open
rhdunn opened this issue Dec 15, 2013 · 0 comments
Open

Support transliteration of Chinese characters (pinyin) #58

rhdunn opened this issue Dec 15, 2013 · 0 comments

Comments

@rhdunn
Copy link
Owner

rhdunn commented Dec 15, 2013

The Chinese character transliteration is based around the pinyin transliteration system. The data for this is in the Unicode Character Data files and the extraction of the pinyin transcriptions should be done by the ucd-tools project.

Specifically, transcriptions for Mandarin, Cantonese and Japanese pronunciations of the Chinese characters should be supported.

In addition, the pinyin pronunciations should have two pronunciation modes:

  1. phonetic/IPA -- an accurate IPA-based phonetic transcription;
  2. Latin/English -- an English approximation of the Chinese.

To be complete, this requires improving the phoneme model to support tone markers.


Want to back this issue? Post a bounty on it! We accept bounties via Bountysource.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant