Skip to content

Latest commit

 

History

History
4 lines (3 loc) · 326 Bytes

README.md

File metadata and controls

4 lines (3 loc) · 326 Bytes

net-and-kjv

This is a machine learning dataset containing two versions of the Bible. The purpose of this is to experiment with "translating" modern English to 17th-century English and visa-versa.

The dataset is roughly 31K verse pairs split 80/20 into training/evaluation. These files are available as either CSV or JSONL.