From 2e11411b71196736ec92611ae9c53ef97711449a Mon Sep 17 00:00:00 2001 From: LeoPits Date: Fri, 15 Apr 2022 12:17:56 +0200 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 6ef79d8..04e7fa1 100644 --- a/README.md +++ b/README.md @@ -75,7 +75,7 @@ Pre-process cropped image example: I use Tesseract OCR engine (https://tesseract-ocr.github.io/) with default page segmentation , the experiments show the LCDDot_FT_500.traineddata performs the best results in this case. Two methods are used to control the label: * **SequenceMatcher** is a class available in python module named *difflib*. It can be used for comparing pairs of input sequences. With the function *ratio( )* returns the similarity score ( float in [0,1] ) between input strings. It sums the sizes of all matched sequences returned by function. -* **Regular expression** is a class available in python module named *re*. The function *re.match()* checks for a match only at the beginning of the string +* **Regular expression** is a class available in python module named *re*. The function *re.match()* checks for a match only at the beginning of the string. Saving all result in json file: ```