Skip to content

feat: support MinerU official cloud API for PDF parsing #180

@wyuc

Description

@wyuc

Currently the MinerU provider only supports self-hosted instances (POST {baseUrl}/file_parse), which requires a GPU server.

MinerU now has an official cloud API at https://mineru.net/apiManage/docs with free tier (2000 pages/day). It uses a different async task model (POST /api/v4/extract/task → poll → download), so a new mineru-cloud provider is needed alongside the existing self-hosted one.

This would make MinerU's advanced PDF parsing (tables, formulas, layout analysis) accessible without any infrastructure setup.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions