Skip to content

1.3.0

Compare
Choose a tag to compare
@TimKoornstra TimKoornstra released this 14 Nov 12:19
· 384 commits to master since this release
7c1d8ac

Release Notes for Loghi-HTR Version 1.3.0

Date: 2023-11-14

Overview

In version 1.3.0, we've introduced significant improvements, including enhanced normalization features for CER and CER lower, a simplified confidence interval, and various API enhancements. Fixes have been made to the ResidualBlock implementation and freezing mechanism, and models now automatically save in the new .keras file format. Several changes have also been made to the API to improve usability and performance.

New Features

  • Normalization for CER and CER Lower: Added functionality to normalize for Character Error Rate (CER) and its lower case version using the --normalization_file argument. This update also displays the ground truth and prediction in a normalized form.
  • Simplified Confidence Interval: Introduced a more straightforward method for calculating confidence intervals.

Enhancements

  • Model File Format: Models now automatically get saved in the new .keras file format, while still supporting loading of both .pb and .keras files.

Bug Fixes

  • ResidualBlock Implementation Fix: Addressed an issue where saving a model and then continuing training was not working properly.
  • ResidualBlock Freezing Fix: Corrected the freezing of convolutional layers in the residual blocks with --freeze_conv_layers.

API Specific Changes

  • Environment Variable Simplifications: Removed the necessity of LOGHI_INPUT_CHANNELS and LOGHI_CHARLIST_PATH environment variables, which are now read directly from the model's config.json and charlist.txt respectively.
  • Reduced OOM Errors: Enhanced batch processing to split recursively on Out-Of-Memory (OOM) errors, failing only the problematic image instead of the entire batch.
  • Improved Image Padding: Adjusted image padding during processing for better alignment with training, marginally improving confidence and output.
  • Dynamic Model Switching in API: Introduced the ability to switch models during an API call using the "model" field, though it's advised to use caution as it can slow down inference.
  • Error Output for Failed Predictions: Text line images that fail during prediction are now outputted to LOGHI_OUTPUT_PATH/group_id/identifier.error with the specific error message.

Contributors

  • @Thelukepet: Major contributions to normalization for CER and CER lower, and the simplified confidence interval.
  • @TimKoornstra: Significant contributions across various aspects including bug fixes, API enhancements, and overall improvements.

Full changelog: 1.2.10...1.3.0