Skip to content

关于训练数据需要ocr结果的思考 #9

@whalefa1I

Description

@whalefa1I

tab_pre.py代码中表述的可能是合并单元格后单元格内部换行,参考:.\pubtabnet\train\PMC1626454_002_00.png
image

通过横向投影直方图确定有几个H_Start,如果不为1才要进行后续处理,所以可能是这个思路

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions