-
Notifications
You must be signed in to change notification settings - Fork 206
Open
Description
ideally i want to edit the hocr files like
gimagereader-qt6 001.hocr 001.jpg gimagereader-qt6 002.jpg 002.hocr 003.jpg 003.hocr
this already works with .html
files
but .hocr
files are ignored
gimagereader-qt6 001.hocr.html
gimagereader-qt6 002.hocr.html 003.hocr.html
extra image files are counted as separate pages
but the page images referenced in the hocr files are used
<div class='ocr_page' id='page_1' title='image "001.tiff"; bbox ...'>
Would actually be trivial to also allow the
.hocr
file extension, but I'm not sure that's actually a standardized extension?
sounds like youre waiting for the central committee of file extensions
to allow this use case... ; )see also kba/hocr-spec#115
Metadata
Metadata
Assignees
Labels
No labels