Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Normalized form changed (こだわる, 拘る) is correct? #39

Open
ghost opened this issue Jan 28, 2022 · 1 comment
Open

Normalized form changed (こだわる, 拘る) is correct? #39

ghost opened this issue Jan 28, 2022 · 1 comment

Comments

@ghost
Copy link

ghost commented Jan 28, 2022

Thank you everyone.
I have a question about the changes in v20211220.

# v20211220
こだわる,1414,1414,12976,こだわる,動詞,一般,*,*,五段-ラ行,終止形-一般,コダワル,こだわる,59504,A,*,*,*,*
拘る,1414,1414,9931,拘る,動詞,一般,*,*,五段-ラ行,終止形-一般,コダワル,拘る,463547,A,*,*,*,*

# v20210802
こだわる,1414,1414,12976,こだわる,動詞,一般,*,*,五段-ラ行,終止形-一般,コダワル,こだわる,59516,A,*,*,*,*
拘る,1414,1414,9931,拘る,動詞,一般,*,*,五段-ラ行,終止形-一般,コダワル,こだわる,463727,A,*,*,*,*

"拘る" normalized form ( 語彙素 ) has been changed as followings.

"こだわる" => "拘る"

Is this change as expected?
I think, rather v20210802 seems to be correct.

@kawahara-n
Copy link

kawahara-n commented Jan 31, 2022

Thank you for your report.

"拘る" are registered as two different words, "コダワル" and "カカワル".
It is difficult to distinguish between them in morphological analysis.
So we changed these normalized form to be same.

# v20210802
拘る,1414,1414,10004,拘る,動詞,一般,*,*,五段-ラ行,終止形-一般,カカワル,関わる,463726,A,*,*,*,010755
拘る,1414,1414,9931,拘る,動詞,一般,*,*,五段-ラ行,終止形-一般,コダワル,こだわる,463727,A,*,*,*,*

# v20211220
拘る,1414,1414,10004,拘る,動詞,一般,*,*,五段-ラ行,終止形-一般,カカワル,拘る,461912,A,*,*,*,*
拘る,1414,1414,9931,拘る,動詞,一般,*,*,五段-ラ行,終止形-一般,コダワル,拘る,461913,A,*,*,*,*

But we left normalized form of "こだわる" (hiragana notation) unchanged.
We will modify it as follows in the next release.

# next version
拘る,1414,1414,9931,拘る,動詞,一般,*,*,五段-ラ行,終止形-一般,コダワル,拘る,461913,A,*,*,*,*
こだわる,1414,1414,12976,こだわる,動詞,一般,*,*,五段-ラ行,終止形-一般,コダワル,拘る,59042,A,*,*,*,*

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant