Name		Name	Last commit message	Last commit date
parent directory ..
allArcs		allArcs
testArcs		testArcs
README.md		README.md

README.md

Annotations Directory

allArcs

For documentation of normalizedScores.tsv file in this directory, refer here.

This folder contains the manual annotation of the 0-scored arcs identified across different CV runs as per the format illustrated here. The arc distribution in different files are as follows:

k_all_200.tsv       Arcs that are present across all the different k-values in CV run.
                    Total Counts = 200
                
k_4not8_100.tsv     Arcs that are present in k=4 or k=2 CV run, but not in k=8 CV run.
                    Total Counts = 100
                    
k_2not4_100.tsv     Arcs that are present in k=2 CV run, but not in k=4 or k=8 CV run.
                    Total Counts = 100

testArcs

For documentation of comparisonStats.tsv file in this directory, refer here.

This folder contains the manual annotation of the 0-scored arcs identified in test set of UDv2.4 hi-HDTB treebank across either of CV or baseline runs, as per the format illustrated here. The arc distribution in different files are as follows:

base_allZero.tsv    All 0-scored arcs as discovered in baseline run
 
base_not_k.tsv      0-scored arcs as discovered in baseline run, unique to the run, and not 
                    discovered by the CV run

k_not_base.tsv      0-scored arcs as discovered in CV run, unique to the run, and not
                    discovered by the baseline run

test_k{x}.tsv       0-scored arcs as discovered in CV run, when k={x}

Annotation Format

The files in the directories are annotated in the following tsv format, with one token per line.

<Sentence-ID>   <Token-ID>  <Error Typology>

Statistics Helper Files

There are two files that are generated based on the annotations from the directories, and are used for generative evaluation statistics. These files are eventually used to generate stats.md file.

allArcs/normalizedScores.tsv
Since the manual annotation in the other files in the allArcs directory is done on an unequal number of instances, this file calculates the normalized scores calculated over 1000 arcs to allow effective comparison of the different CV runs. The following are the formulae used for calculating normalized scores:
testArcs/comparisonStats.tsv
Based on the manual annotation of the 0-scored arcs found in test set of UDv2.4 hi-HDTB treebank, this file compares the frequency of different error typologies as discovered across baseline and CV runs of LISCA.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Annotations

Annotations

README.md

Annotations Directory

Contents

allArcs

testArcs

Annotation Format

Statistics Helper Files

Files

Annotations

Directory actions

More options

Directory actions

More options

Latest commit

History

Annotations

Folders and files

parent directory

README.md

Annotations Directory

Contents

allArcs

testArcs

Annotation Format

Statistics Helper Files