📊 RapidTable4j

简介

RapidTable库是专门用来文档类图像的表格结构还原，表格结构模型均属于序列预测方法，结合RapidOCR，将给定图像中的表格转化对应的HTML格式。

本项目是RapidTable的Java移植版本，使用 ONNXRuntime + OpenCV，结合RapidOCR4j实现表格识别。

slanet_plus是paddlex内置的SLANet升级版模型，准确率有大幅提升

效果展示

模型列表下载地址

`model_type`	模型名称	推理框架	模型大小	推理耗时(单图 60KB)
`ppstructure_en`	`en_ppstructure_mobile_v2_SLANet.onnx`	onnxruntime	7.3M	0.15s
`ppstructure_zh`	`ch_ppstructure_mobile_v2_SLANet.onnx`	onnxruntime	7.4M	0.15s
`slanet_plus`	`slanet-plus.onnx`	onnxruntime	6.8M	0.15s

模型来源
PaddleOCR 表格识别
 PaddleX-SlaNetPlus 表格识别
模型下载地址：link

安装

由于模型较小，预先将slanet-plus表格识别模型(slanet-plus.onnx)打包进了jar包内。其余模型在初始化RapidTable类时，通过TableConfig的modelPath来指定自己模型路径。注意仅限于现在支持的TableModelType。

🛠️ 待办：

支持SLANet-wired和SLANet-wireless最新开源模型 SLANet

🎉 快速开始

安装依赖，默认使用CPU版本

<dependency>
    <groupId>io.github.hzkitty</groupId>
    <artifactId>rapid-table4j</artifactId>
    <version>1.0.0</version>
</dependency>

使用示例

RapidTable tableEngine = RapidTable.create();
RapidOCR rapidOCR = RapidOCR.create();

File file = new File("src/test/resources/table_01.jpg");
String imgContent = file.getAbsolutePath();
OcrResult ocrResult = rapidOCR.run(imgContent);
TableResult tableResult = tableEngine.run(imgContent, ocrResult.getRecRes());

如果想要使用GPU, onnxruntime_gpu 对应版本可以在这里找到 here.

<dependency>
    <groupId>io.github.hzkitty</groupId>
    <artifactId>rapid-table4j</artifactId>
    <version>1.0.0</version>
    <exclusions>
      <exclusion>
        <groupId>com.microsoft.onnxruntime</groupId>
        <artifactId>onnxruntime</artifactId>
      </exclusion>
    </exclusions>
</dependency>

<!-- 1.18.0 support CUDA 12.x -->
<dependency>
    <groupId>com.microsoft.onnxruntime</groupId>
    <artifactId>onnxruntime_gpu</artifactId>
    <version>1.18.0</version>
</dependency>

鸣谢

RapidTable

开源许可

使用 Apache License 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📊 RapidTable4j

简介

效果展示

模型列表下载地址

安装

🛠️ 待办：

🎉 快速开始

鸣谢

开源许可

About

Uh oh!

Releases 2

Packages

Languages

License

hzkitty/RapidTable4j

Folders and files

Latest commit

History

Repository files navigation

📊 RapidTable4j

简介

效果展示

模型列表 下载地址

安装

🛠️ 待办：

🎉 快速开始

鸣谢

开源许可

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

模型列表下载地址

Packages