Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What should I do with the data? #20

Open
fairy-of-9 opened this issue Jun 3, 2020 · 4 comments
Open

What should I do with the data? #20

fairy-of-9 opened this issue Jun 3, 2020 · 4 comments

Comments

@fairy-of-9
Copy link

I downloaded OntoNotes Release 5.0.

and I did e2e-coref's getting started.

I created directories (data/train,data/development,data/test)
and data(output of getting started) are located in directories like data/train/train.english.v4_gold_conll

Did I miss anything or do something wrong?

Thanks.

@henryhust
Copy link

This article maybe helpful:https://zhuanlan.zhihu.com/p/121786025

@ShadmanRohan
Copy link

@fairy-of-9 did you manage to train?

@fairy-of-9
Copy link
Author

@ShadmanRohan sry. I couldn't train.

@MarurSrikanta
Copy link

Hi Shayne,
Congratulations on the great work with Coreference Resolution model.

Unfortunately, I do not have Ontonotes dataset and am using my .txt file. I am unable to find any useful link to convert .txt file into conll 2012 format. I tried using conll u format for training but did not succeed. It would be great if you can answer the following questions:

1.) Which tool can be used to annotate the text to match the coreferences
2.) Can your packages handle custom training

Please revert at a convenient time of yours.

Thanks and Regards
Marur Srikanta

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants