Skip to content

pdfix/pdfix_sdk_builds

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Getting Started with PDFix SDK

PDFix SDK is a cross-platform PDF processing tool that offers multiple integration options. The Command-Line Interface (PDFix CLI) provides a quick and easy way to add PDF processing functionality to various workflows without writing code.

Developers can leverage the SDK with different programming languages for full control over PDF documents.

Before getting started, check the system requirements to ensure compatibility with your system.

Integration Using the CLI

PDFix offers straightforward, rapid, and automated PDF processing via the command-line interface:

$ ./pdfix_app make-accessible -i test.pdf -o output.pdf

For additional CLI options, refer to the PDFix SDK Command-Line Interface documentation. The CLI application is included in the downloadable package available on the PDFix website.

Integration the SDK programatically

To integrate the PDFix SDK programmatically, refer to the code examples on GitHub for your preferred programming language:

  • C++ – Native applications
  • .NET – For .NET Framework, .NET Core, and .NET 5+
  • Java – For Maven or Gradle projects
  • Python – Applications
  • JavaScript – For frameworks like Node.js, React.js, Angular, and similar

How to

Fix Accessibility Issues

To fix PDF/UA compliance issues in a PDF document use methods available in:

  • PDFix Actions for Accessibility - a flexible pdf manipulation without coding
  • SDK API methods to access and edit
    • Structure Tree and its Elements
    • Page Objects and their Content Marks
    • Annotations
    • Document Metadata

Extract PDF Content

To extract the data from a PDF document a conversion to JSON. The data extraction can methods can provide:

  • Raw Document Extraction to access
    • Document Metadata, Form Fields, and classification such as tagged, signed, secured
    • Page Size, Rotation, Annotations, Content including text content, images, positions, colors
  • Layout Recognition to access the logical content such as
    • Paragraphs, Headings, Figures, Tables, Headers, Footers
  • Document Structure Tags to access
    • Complete document structure tree with element properties, attributes, position, and content

Multi-Threaded Environments

PDFix SDK operates in a single-threaded manner, allowing only one API method to run at a time within a single process. Any additional method calls from other threads will be queued until the current operation finishes.

For parallel processing, use separate processes rather than threads.

Licensing

Trial/Lite License Limitations

  • Extracted text may include randomly replaced characters with "*".
  • Saved PDFs may contain redacted content with watermark.

Prerequisites

Support

For any questions, visit our Support Page.