Skip to content

Add Support for Multiple OCR/LLM Models (Beyond OpenAI) #4

@mubashir-oss

Description

@mubashir-oss

Description:
Currently, the OCR service relies exclusively on a single model provider: OpenAI. To improve flexibility, we should extend support to integrate with other open-source or self-hostable models.


✅ Goals

  • Add support for open-source LLMs like:
    • llama.cpp-based models
    • Mistral and Anthropic models
  • Support Hugging Face-hosted models or local inference with transformers / vLLM
  • Add a configuration switch to select model backend (openai, local, hf, anthropic, etc.)

Priority: Medium to High

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions