Layout Document AI
is a tool designed to process documents using Google Cloud's Document AI, preserving the original layout. It reads JSON files generated by Document AI, processes the text while maintaining the layout, and outputs the text into .txt
files.
- Process Document AI JSON outputs
- Preserves the original layout of the document.
- Generate formatted text files
- Install dependencies:
pnpm install
-
Place your Document AI JSON files in the
document-ai-json
directory. -
Run the main script to process the documents:
pnpm run dev
- The processed text files will be saved in the
document-ai-text
directory.
- Fork the repository.
- Create a new branch (
git checkout -b feature-branch
). - Make your changes.
- Commit your changes (
git commit -m 'Add some feature'
). - Push to the branch (
git push origin feature-branch
). - Open a pull request.
This project is licensed under the ISC License. See the LICENSE file for details.
Thierry Santos - [email protected]
- Google Cloud Document AI for providing the document processing capabilities.