Skip to content

Conversation

@tarun-menta
Copy link
Contributor

@tarun-menta tarun-menta commented Jun 27, 2025

Integration of the new foundation model. Major changes are:

  • Moving the foundation model into its own reusable class. The prediction loop and cache have now changed to a static shape with our custom sliding window logic
  • Refactor layout and recognition to use the foundation predictor

TODOs:

  • Implement multi-token prediction
  • Cache logic is slow due to batch-wise looping. This needs to be vectorized
  • Surface top-k for layout

@tarun-menta tarun-menta marked this pull request as draft June 27, 2025 17:25
@github-actions github-actions bot locked and limited conversation to collaborators Jul 29, 2025
@VikParuchuri VikParuchuri reopened this Jul 29, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants