Skip to content
Change the repository type filter

All

    Repositories list

    • MinerU

      Public
      Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
      Python
      Other
      5.6k67k911Updated Jun 8, 2026Jun 8, 2026
    • labelU

      Public
      Open-source multimodal data annotation platform with AI auto-annotation support.
      Python
      Apache License 2.0
      1821.6k330Updated Jun 8, 2026Jun 8, 2026
    • A Python package for interacting with the MinerU Vision-Language Model.
      Python
      Apache License 2.0
      3612811Updated Jun 5, 2026Jun 5, 2026
    • Data annotation component library --provided as NPM packages
      TypeScript
      Apache License 2.0
      5115211Updated Jun 2, 2026Jun 2, 2026
    • Standardized tool schemas and SDKs that expose Sciverse Open Platform retrieval capabilities to LLM agents.
      Python
      Other
      22300Updated May 28, 2026May 28, 2026
    • .github

      Public
      3100Updated May 27, 2026May 27, 2026
    • opendatalab-datasets

      Public
      datasets resource
      1714440Updated May 27, 2026May 27, 2026
    • Python
      MIT License
      912520Updated May 26, 2026May 26, 2026
    • CiteVQA

      Public
      Python
      MIT License
      56800Updated May 20, 2026May 20, 2026
    • Python
      Apache License 2.0
      4853590Updated May 13, 2026May 13, 2026
    • Python
      Apache License 2.0
      1111641Updated May 11, 2026May 11, 2026
    • MinerU Training Camp course materials and tutorials
      Other
      41801Updated May 11, 2026May 11, 2026
    • Python
      Apache License 2.0
      1400Updated May 8, 2026May 8, 2026
    • [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation
      Python
      Apache License 2.0
      1761.8k1308Updated May 6, 2026May 6, 2026
    • Agent-native knowledge engine with MCP tools for document indexing, wiki organization, fast retrieval and deep reading across PDF/DOCX/PPTX/Markdown
      TypeScript
      MIT License
      6358140Updated Apr 26, 2026Apr 26, 2026
    • A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding.
      Python
      MIT License
      3859671Updated Apr 20, 2026Apr 20, 2026
    • Vis3

      Public
      Data browser based on s3. 一个基于 S3 的数据(json / jsonl / parquet / html / md等)可视化工具。👇 Try online.
      TypeScript
      Apache License 2.0
      158700Updated Apr 14, 2026Apr 14, 2026
    • WebMainBench is a high-precision benchmark for evaluating web main content extraction.
      Python
      Apache License 2.0
      111611Updated Apr 3, 2026Apr 3, 2026
    • [ICLR 2026] The official implementation of the paper “Earth-Agent: Unlocking the Full Landscape of Earth Observation with Agents”
      Python
      MIT License
      21156101Updated Apr 2, 2026Apr 2, 2026
    • MinerU-HTML: An SLM-powered HTML main content extractor that outputs clean HTML bodies. Perfect for Deep Research Agents, RAG applications, and training data ge…
      Python
      Apache License 2.0
      2525710Updated Mar 27, 2026Mar 27, 2026
    • HTML
      Apache License 2.0
      21000Updated Mar 25, 2026Mar 25, 2026
    • VHM

      Public
      VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis
      Python
      Apache License 2.0
      911710Updated Mar 25, 2026Mar 25, 2026
    • [CVPR 2026] SOTA Chemical Reaction Diagram Parsing Framework
      Python
      Other
      22500Updated Mar 24, 2026Mar 24, 2026
    • LOKI

      Public
      [ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models”
      Python
      517940Updated Feb 7, 2026Feb 7, 2026
    • TRivia

      Public
      (CVPR 2026) TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition
      Python
      Apache License 2.0
      53430Updated Feb 5, 2026Feb 5, 2026
    • HTML
      1100Updated Feb 2, 2026Feb 2, 2026
    • Python
      1012130Updated Jan 15, 2026Jan 15, 2026
    • rdkit

      Public
      A forked repo of the official RDKit library
      HTML
      BSD 3-Clause "New" or "Revised" License
      1k000Updated Jan 7, 2026Jan 7, 2026
    • OHR-Bench

      Public
      (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
      Python
      1410200Updated Dec 3, 2025Dec 3, 2025
    • 🕶️ A curated list of awesome things related to MinerU
      Python
      MIT License
      2810Updated Nov 14, 2025Nov 14, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.