Skip to content

Commit

Permalink
Add Thai Discourse Treebank
Browse files Browse the repository at this point in the history
  • Loading branch information
wannaphong authored May 12, 2024
1 parent d9b341b commit be69542
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions docs/tasks/dependency_parser.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,7 @@
| ----------------------------- | ------------------------------------------------------------ | ---------------------------------- | ------------ | ------------------------------------------------------------ | ------------------------------------------------------------ |
| UD Thai PUD | This is a part of the Parallel Universal Dependencies (PUD) treebanks created for the CoNLL 2017 shared task on Multilingual Parsing from Raw Text to Universal Dependencies. | 1,000 sentences | CC BY-SA 3.0 | Universal Dependencies | [GitHub](https://github.com/UniversalDependencies/UD_Thai-PUD) |
| Blackboard Treebank | Blackboard Treebank is a Thai dependency corpus based on the LST20 Annotation Guideline. It features dependency structures, constituency structures, word boundaries, named entities, clause boundaries, and sentence boundaries. | 122,851 clauses (38,558 sentences) | CC BY 3.0 | Prachya Boonkwan, NECTEC | [bitbucket](https://bitbucket.org/kaamanita/blackboard-treebank/) or [GitHub](https://github.com/KoichiYasuoka/spaCy-Thai/blob/master/UD_Thai-Corpora/th_blackboard.conllu) |
| Thai Discourse Treebank | The Thai Discourse Treebank (TDTB) is a project at Chulalongkorn University, Bangkok, Thailand. The annotation adopts the sense inventory from PDTB 3.0. | 180 documents | - | Chulalongkorn University | [GitHub](https://github.com/nlp-chula/thai-discourse-treebank/tree/main/data/th-tdtb) |


## Software
Expand Down

0 comments on commit be69542

Please sign in to comment.