You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I don't have any immediate plans to implement this (too busy with other work). Depending on what you need to do, the code does work for some cases. For example, it can read in the format and (1) print out one-line or multi-line versions of parses, and/or (2) apply most of the edits, and/or (3) filter by sentence length.
The main things it can't do are:
Read ontonotes and print to my split head format
Read my split head format and print to ontonotes format
Yeah, I can successfully read the parse tree in PTB format and convert it to your split head format on most cases of Ontonotes 5.0. It only fails in some rare cases. Maybe I can do it manually. But overall, thanks for your contribution
Probably the easiest way to do this would be to write something to preprocess the parses, moving the indexation to the child node where appropriate (and then similar code to reverse that when going the other way). That said, there may be more complex examples than the one I included above.
More recent LDC data has annotated traces like this:
The format converter should be updated to handle this format too.
The text was updated successfully, but these errors were encountered: