This repository has been archived by the owner on Sep 11, 2024. It is now read-only.
v0.1.11
Fix the following issues with the huggingface Dataset
dump:
languages
property (document languages) was missing- translated and nontranslated of documents were treated as single documents