Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
LeoPits authored Apr 15, 2022
1 parent 1c762df commit 2e11411
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -75,7 +75,7 @@ Pre-process cropped image example:
I use Tesseract OCR engine (https://tesseract-ocr.github.io/) with default page segmentation , the experiments show the LCDDot_FT_500.traineddata performs the best results in this case.
Two methods are used to control the label:
* **SequenceMatcher** is a class available in python module named *difflib*. It can be used for comparing pairs of input sequences. With the function *ratio( )* returns the similarity score ( float in [0,1] ) between input strings. It sums the sizes of all matched sequences returned by function.
* **Regular expression** is a class available in python module named *re*. The function *re.match()* checks for a match only at the beginning of the string
* **Regular expression** is a class available in python module named *re*. The function *re.match()* checks for a match only at the beginning of the string.

Saving all result in json file:
```
Expand Down

0 comments on commit 2e11411

Please sign in to comment.