Skip to content

Latest commit

 

History

History
executable file
·
45 lines (36 loc) · 1.55 KB

README.md

File metadata and controls

executable file
·
45 lines (36 loc) · 1.55 KB

UCCA-Annotated German Corpus: Der kleine Prinz (The Little Prince)

Version 2.0 (April 12, 2020)

This bundle contains 68 passages annotated according to the foundational layer of UCCA. The total number of tokens in this corpus is 18653.

Corpus:

The German corpus used here is the book Der kleine Prinz (Le Petit Prince / The Little Prince), a classic novel written in French by Antoine de Saint-Exupéry (1900--1944), and first published in 1943.

Format and Source Code:

Information about the format of the xml files and source code for reading and manipulating them are available at https://universalconceptualcognitiveannotation.github.io/.

Citation:

The annotation was conducted at the Hebrew University of Jerusalem. If you use this corpus, please cite:

@inproceedings{Oep:Abe:Abz:20,
  author = {Oepen, Stephan and Abend, Omri and Abzianidze, Lasha and
            Bos, Johan and Haji\v{c}, Jan and Hershcovich, Daniel and
            Li, Bin and O'Gorman, Tim and Xue, Nianwen and Zeman, Daniel},
  title = {{MRP}~2020: {T}he {S}econd {S}hared {T}ask on
           {C}ross-Framework and {C}ross-{L}inguistic
           {M}eaning {R}epresentation {P}arsing},
  booktitle = {Proc. of CoNLL Shared Task},
  year = 2020
}

Licensing:

The UCCA annotation is distributed under the "Attribution-ShareAlike 3.0 Unported" license (http://creativecommons.org/licenses/by-sa/3.0/). Please follow the link for exact details.