Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

+a/+aar invalid tag #2

Open
TinoDidriksen opened this issue Dec 2, 2020 · 5 comments
Open

+a/+aar invalid tag #2

TinoDidriksen opened this issue Dec 2, 2020 · 5 comments
Labels
bug Something isn't working

Comments

@TinoDidriksen
Copy link
Member

/usr/share/giella-core/scripts/taglist2make_CG_tags_regex.sh disamb-tags.txt > make-disamb-CG-tags.regex
ERROR:
The following tags do not follow the Giella tag conventions
(either +TAG, TAG+ or +TAG+) and can not be converted to the CG
format required for proper tokeniser functionality within the
Giella infrastructure:

+a/+aar
make[4]: *** [Makefile:865: make-disamb-CG-tags.regex] Error 1
@TinoDidriksen TinoDidriksen added the bug Something isn't working label Dec 2, 2020
@Trondtr
Copy link
Contributor

Trondtr commented Dec 2, 2020

The tag is from lang-esu. The fix is to change the tag there (discuss with the main developer first). I will have a look (or you may).

@snomos
Copy link
Member

snomos commented Dec 2, 2020

This is source code contributed and maintained by others than GiellaLT people. Do not change the code without thorough discussions with the author 🙂

@Trondtr
Copy link
Contributor

Trondtr commented Dec 2, 2020

Yes, this is a good advice.

While preparing for that, I glue in the full set of multichar symbols. They are different from ours. Could you have a look and tell what tags (tag types, rather) cannot be converted to CG format?

My general view is that tags should be transparent and groupable. But let step one be the formal aspect. So: Which of these (types) cannot be handled? (see next comment)

@Trondtr
Copy link
Contributor

Trondtr commented Dec 2, 2020

(ar)
(ar*)
(e)
(g)
(g/t)
(ng)
(r)
(r/l)
(s)
(t)
(u)
(u/i)
(ʼa)
+a/+aar
+c/.s
+c/.ss
+ciqe/@⁵ciiqe
+p/.v
+pegun/.vkun
+t/.s
@%:~(ng)ạ
@+(r/l)i
@+–'(g)ar(ar)te
@nga
@²
@³
@³%:(u)cir/@⁵cir
@¹
@ʸ
@ˁ
@ˡ
@⁴
@⁵
@⁵+cete/.vkar
@⁵+p/~v
@–ngr(ar)
@ˡ~+lu/@⁵na
[1DuPoss]
[1Du]
[1PlPoss]
[1Pl]
[1SgPoss]
[1Sg]
[2DuPoss]
[2Du]
[2PlPoss]
[2Pl]
[2SgPoss]
[2Sg]
[3DuPoss]
[3Du]
[3PlPoss]
[3Pl]
[3SgPoss]
[3Sg]
[4DuPoss]
[4Du]
[4PlPoss]
[4Pl]
[4SgPoss]
[4Sg]
[A_1Du]
[A_1Pl]
[A_1Sg]
[A_2Du]
[A_2Pl]
[A_2Sg]
[A_3Du]
[A_3Pl]
[A_3Sg]
[A_4Du]
[A_4Pl]
[A_4Sg]
[Abl]
[Abl_Mod]
[Abs]
[Cardinal]
[Cnsq]
[Conc]
[Cond]
[Cont]
[CtmpII]
[CtmpI]
[DemAdv]
[DemPro]
[DuPosd]
[Du]
[Encl]
[Equ]
[FUT]
[Ignorative]
[Imprs_Agnt]
[Ind]
[Intr]
[Intrg]
[Loc]
[NEG]
[N]
[Num]
[N→N]
[N→V]
[Opt]
[Opt_PRS_A_2Sg]
[Opt_PRS_A_2Sg_P_1Sg]
[Opt_PRS_A_2Sg_P_3Sg]
[Opt_PRS_S_2PlDu]
[Opt_PRS_S_2Sg]
[Ordinal]
[PRS]
[P_1Du]
[P_1Pl]
[P_1Sg]
[P_2Du]
[P_2Pl]
[P_2Sg]
[P_3Du]
[P_3Pl]
[P_3Sg]
[P_4Du]
[P_4Pl]
[P_4Sg]
[Particle]
[PerPro]
[PlPosd]
[Pl]
[Positional]
[Prec]
[Ptcp]
[Quant_Qual]
[Rel]
[STE_2Sg]
[S_1Du]
[S_1Pl]
[S_1Sg]
[S_2Du]
[S_2Pl]
[S_2Sg]
[S_3Du]
[S_3Pl]
[S_3Sg]
[S_4Du]
[S_4Pl]
[S_4Sg]
[Sbrd]
[Sec_Ter]
[SgPosd]
[Sg]
[Symbol]
[Ter]
[Trns]
[Unpd]
[V]
[Via]
[V→N]
[V→V]
[V→XCLM]
[e]
[how.V]
g̲
g̲g̲
k̲
q̲
r̲
r̲r̲
–lria/@⁴+ngur*
–r̲(ur)lur
–r̲pag/@²vag

@ftyers
Copy link
Collaborator

ftyers commented Dec 2, 2020

Ping @lonnystrunk :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants