Skip to content

This public repository contains code used in: Hill, M. J. & Hengchen, S., 2018. Quantifying the impact of dirty OCR on historical text analysis: Eighteenth Century Collections Online as a case study, (Accepted/In press) In : Digital Scholarship in the Humanities : DSH.

Notifications You must be signed in to change notification settings

COMHIS/ECCO-TCP_ECCO-OCR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 

Repository files navigation

ECCO-TCP_ECCO-OCR

This public repository contains code used in: Hill, M. J. & Hengchen, S., 2018. Quantifying the impact of dirty OCR on historical text analysis: Eighteenth Century Collections Online as a case study, (Accepted/In press) In : Digital Scholarship in the Humanities : DSH.

Acknowledgements

This research is part of the Helsinki Computational History Group’s (COMHIS: https://www.helsinki.fi/en/comhis) larger project on ECCO and ESTC. We would like to thank Gale for providing our group with ECCO data. Special thanks go to Prof. Mäkelä for providing figures as well as drawing our attention to OCR accuracy estimations.

About

This public repository contains code used in: Hill, M. J. & Hengchen, S., 2018. Quantifying the impact of dirty OCR on historical text analysis: Eighteenth Century Collections Online as a case study, (Accepted/In press) In : Digital Scholarship in the Humanities : DSH.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published