Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

hfst compilation fails ( #12

Open
albbas opened this issue Oct 19, 2017 · 5 comments
Open

hfst compilation fails ( #12

albbas opened this issue Oct 19, 2017 · 5 comments
Labels
enhancement New feature or request low priority

Comments

@albbas
Copy link
Contributor

albbas commented Oct 19, 2017

This issue was created automatically with bugzilla2github

Bugzilla Bug 2438

Date: 2017-10-19T20:44:53+02:00
From: Børre Gaup <<borre.gaup>>
To: Trond Trosterud <<trond.trosterud>>
CC: sindre.reino.trosterud, sjur.n.moshagen

Last updated: 2017-11-20T12:39:21+01:00

@albbas
Copy link
Contributor Author

albbas commented Oct 19, 2017

Comment 12674

Date: 2017-10-19 20:44:53 +0200
From: Børre Gaup <<borre.gaup>>

Using these configure options:
./configure --with-hfst --without-xfst --enable-tokenisers --enable-reversed-intersect --enable-alignment

Gives this result:
make[2]: Entering directory '/home/boerre/repos/langtech/langs/fkv/src/phonology'
HTWOLC fkv-phon.hfst
Exception: HfstException in file: htwolcpre1-parser.yy on line: 541

@albbas
Copy link
Contributor Author

albbas commented Oct 25, 2017

Comment 12680

Date: 2017-10-25 13:44:48 +0200
From: Trond Trosterud <<trond.trosterud>>

I now repeat this result,w ith the settings:
./configure --with-hfst --enable-oahpa
and the svn version of the twolc file.

Also, nn gemination (sano ==> sannoot) fails for the .xfst analyser.

@albbas
Copy link
Contributor Author

albbas commented Nov 20, 2017

Comment 12713

Date: 2017-11-20 11:43:52 +0100
From: Trond Trosterud <<trond.trosterud>>

I found the error: A % missing in front of a ^HMETA (should be %^HMETA).

Unfortunately the case is not quite closed: The code now compiles:

  • in twolc, without the error message
    *** Warning: make_num_re: Positive integer expected, got 0. ***

  • in hfst, without the error message

But the test form sannoot still fails in xfst, we get no nn gemination:

sannoot
sannoot sannoot +?

sannoot+V+Inf
sannoot+V+Inf sanoot
sannoot+V+Inf sanoa

Interestingly, it succeeds in hfst:

tf4-hsl-m0024:fkv trond$ hufkv
sannoot
sannoot sannoot+V+Inf 0.000000

So this calls for some afterthought.

@albbas
Copy link
Contributor Author

albbas commented Nov 20, 2017

Comment 12714

Date: 2017-11-20 12:38:18 +0100
From: Trond Trosterud <<trond.trosterud>>

New testing. Here is the situation:

  • Everythings work compiled with hfst (so the bug name is now misleading).
  • With the xerox tools, we have this situation:
    -- dfkv generates "sannoot+V+Inf" ok, but it cannot analyse it, instead it analyses "sanoot"
    -- testing in twolc, we get mixes results: lex-test gives non-geminated n (as does the analyser), whereas the pair-test behaves as it should:

twolc> lex-test

Lexical string ('q' = quit): san0o>^Vt
san0o>^Vt
s
a
n
0
o

^V
t

Lexical string ('q' = quit): q

twolc> pair-test

Lexical string ('q' = quit): san0o>^Vt
Surface string ('q' = quit): sanno>ot
s
a
n
0:n
o

^V:o
t
ACCEPTED

Lexical string ('q' = quit): sano>^Vt
Surface string ('q' = quit): sano>ot
s
a
n
o

^V:o
REJECTED: "Gemination Vo" fails in state 113.

@albbas
Copy link
Contributor Author

albbas commented Nov 20, 2017

Comment 12715

Date: 2017-11-20 12:39:21 +0100
From: Trond Trosterud <<trond.trosterud>>

New testing. Here is the situation:

  • Everythings work compiled with hfst (so the bug name is now misleading).
  • With the xerox tools, we have this situation:
    -- dfkv generates "sannoot+V+Inf" ok, but it cannot analyse it, instead it analyses "sanoot"
    -- testing in twolc, we get mixes results: lex-test gives non-geminated n (as does the analyser), whereas the pair-test behaves as it should:

twolc> lex-test

Lexical string ('q' = quit): san0o>^Vt
san0o>^Vt
s
a
n
0
o

^V
t

Lexical string ('q' = quit): q

twolc> pair-test

Lexical string ('q' = quit): san0o>^Vt
Surface string ('q' = quit): sanno>ot
s
a
n
0:n
o

^V:o
t
ACCEPTED

Lexical string ('q' = quit): sano>^Vt
Surface string ('q' = quit): sano>ot
s
a
n
o

^V:o
REJECTED: "Gemination Vo" fails in state 113.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request low priority
Projects
None yet
Development

No branches or pull requests

1 participant