Skip to content

masakhane-io/afridoc-mt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 

Repository files navigation

AFRIDOC-MT: Document-level MT Corpus for African Languages

This repository contains the data for AFRIDOC-MT. AFRIDOC-MT is a document-level multi-parallel translation dataset covering English and five African languages: Amharic, Hausa, Swahili, Yorùbá, and Zulu. The dataset comprises 334 health and 271 information technology news documents, all human-translated from English to these languages.

The github page is still under construction. Please refer to the huggingface page for the data.

The project was generously funded by Lacuna Fund.

About

A lacuna funded project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published