Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Re-import data from ICU4C with starters in contraction-middle position marked in collation data #6181

Open
hsivonen opened this issue Feb 24, 2025 · 0 comments
Labels
2.0-breaking Changes that are breaking API changes A-data Area: Data coverage or quality C-collator Component: Collation, normalization

Comments

@hsivonen
Copy link
Member

After unicode-org/icu#3411 has landed in ICU4C and before ICU4X 2.0, change datagen to import the icuexportdata zip file with that ICU4C PR having taken effect.

This does not require changes to the current ICU4X algorithms but will enable implementing #2712.

Once #2712 is implemented in ICU4X 2.x, ICU4X 2.x will no longer compute the right results with data that predates the above ICU4C PR. I'll leave it to folks who know ICU4X data policies better than me to decide if that means that we should increment the data version of CollationRootV1 and CollationTailoringV1 for ICU4X 2.0.

@hsivonen hsivonen added 2.0-breaking Changes that are breaking API changes A-data Area: Data coverage or quality C-collator Component: Collation, normalization labels Feb 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2.0-breaking Changes that are breaking API changes A-data Area: Data coverage or quality C-collator Component: Collation, normalization
Projects
None yet
Development

No branches or pull requests

1 participant