Skip to content

Extend Support for Multiple File Types in OCR/Extraction Service #3

@mubashir-oss

Description

@mubashir-oss

📂 Supported File Types (Current)

The service currently supports extracting text from the following file types:

  1. PDF
  2. DOCX
  3. PPTX

🎯 Goals: Extend File Type Support

  • Add support for .DOC
  • Add support for .XLSX and .XLS (Excel formats)
  • Add support for .ODT, .ODS, .ODP (OpenDocument formats)
  • Add support for plain text .TXT and rich text .RTF
  • Add support for .EPUB (ebook format)

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions