Skip to content

udmurtNLP/datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 

Repository files navigation

Datasets

image

Text datasets

Parallel corpora

Monolingual corpora

Classification

POS-tagging

  • Zerpal-pos-tagging (12,392 rows, 17 classes) HuggingFace

NER

  • WikiANN (the transcription is problematic: Latin and Cyrillic are used inconsistently, Wikipedia Markup is parsed incorrectly, but if you want to use it, see wikiann directory)

Instruction

About

No description or website provided.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published