Crash/BSOD on Windows 11 when using Hebrew #39

do2or · 2023-12-11T13:41:46Z

Problem:

dpscreenocr uses Tesseract to recognize text on the screen. However, Tesseract has known issues with recognizing text from images captured by Intel UHD 600 graphics cards (like BSOD).

Solution:

Add support for Tesseract-OCR-DNN to dpscreenocr. Tesseract-OCR-DNN is a deep learning-based OCR engine that is not affected by the same issues as Tesseract.

Benefits:

Improved accuracy for recognizing text from images captured by Intel UHD600 graphics cards.
Increased flexibility for users to choose the best OCR engine for their needs.
Implementation:

Add a new configuration option to dpscreenocr that allows users to choose between using Tesseract or Tesseract-OCR-DNN.
Update the code to use Tesseract-OCR-DNN when the new configuration option is set to "Tesseract-OCR-DNN".

Conclusion:

Adding support for Tesseract-OCR-DNN to dpscreenocr would be a valuable addition to the program. It would provide users with improved accuracy and flexibility, making it a more powerful tool for recognizing text on the screen.

danpla · 2023-12-11T15:15:50Z

Tesseract has known issues with recognizing text from images captured by Intel UHD 600 graphics cards

I believe Tesseract itself doesn't care how an image was captured. Could you please provide more context (e.g., an example image or a link to the ticket on the Tesseract bug tracker)?

Or maybe you mean that dpScreenOCR has issues capturing images on machines with Intel UHD 600?

Tesseract-OCR-DNN is a deep learning-based OCR engine that is not affected by the same issues as Tesseract.

Googling for "Tesseract-OCR-DNN" gives no results. BTW, Tesseract already uses deep learning (LSTM).

do2or · 2023-12-11T19:03:43Z

My problem is that every time I use this software, the computer shuts down or i get BSOD system crash,
and for sure it is caused by the incompatibility of the video card with Tesseract

this is the instructions to build from source:

Clone the Tesseract OCR Engine repository from [GitHub](https://github.com/tesseract-ocr/tesseract) using the following command:
git clone https://github.com/tesseract-ocr/tesseract.git
Install the dependencies required to build Tesseract. A C++ compiler with good C++17 support is required for building Tesseract from source. You can find the list of dependencies and installation instructions in the [README.md](https://github.com/tesseract-ocr/tesseract/blob/master/README.md) file of the repository.
Once you have installed the dependencies, navigate to the root directory of the cloned repository and run the following commands:
./autogen.sh
./configure --enable-build-type=Release --enable-shared --disable-static --with-protoc=/usr/local/bin/protoc
make -j4
After the build process is complete, you can find the tesseract_ocr_dnn binary in the src/api directory.

danpla · 2023-12-11T19:30:44Z

My problem is that every time I use this software, the computer shuts down or i get BSOD system crash,

So you are using Windows? Could you please tell your Windows and dpScreenOCR versions, and provide the steps to reproduce the crash/shutdown?

and for sure it is caused by the incompatibility of the video card with Tesseract

That's unlikely: Tesseract doesn't use GPU acceleration.

Clone the Tesseract OCR Engine repository from [GitHub](https://github.com/tesseract-ocr/tesseract)

The Tesseract library used by dpScreenOCR on Windows is already built from the official source code from https://github.com/tesseract-ocr/tesseract.

do2or · 2023-12-11T19:37:08Z

my Windows version: Windows 11
dpScreenOCR version: 1.4.1
The procedure that causes the problem is when I use the software and capture some image, sometimes it turns off immediately and sometimes after a few minutes or crashes with a blue screen

danpla · 2023-12-11T19:42:45Z

What languages do you use for OCR? Were they installed from the language manager or elsewhere?

do2or · 2023-12-11T19:45:39Z

Language: hebrew
and also this Tessdata
https://gitlab.com/pninim.org/tessdata_heb_rashi/-/tree/main/tesseract_4.1.1?ref_type=heads

danpla · 2023-12-11T19:53:27Z

Does using only English also cause a crash?

do2or · 2023-12-11T20:00:58Z

I didn't try the English language,
but I search for support in Google AI and it says there is a known incompatibility issues between the Intel UHD GPU and Tesseract 5
i try this GImageReader and work perfect and don't cause any bsod

danpla · 2023-12-11T23:02:15Z

I don't currently have access to Windows 11, but I was unable to reproduce the issue with Windows 7 and 10. I have tried both the official Hebrew data and the files from https://gitlab.com/pninim.org/tessdata_heb_rashi/-/tree/main/tesseract_4.1.1.

It would help me if you could:

Check if the crash happens when using only the English language.
Check if the issue is reproducible in dpScreenOCR 1.1.0: https://github.com/danpla/dpscreenocr/releases/download/v1.1.0/dpScreenOCR-1.1.0-win32.zip

do2or · 2023-12-12T18:52:55Z

Apparently version 1.1.0 does not cause any problem, but no I don't know for sure yet,
I don't think that this issue it's related to English or Hebrew languages; I think that this issue is related to Tesseract 5

danpla changed the title ~~Proposal to Add Support for Tesseract-OCR-DNN~~ Crash/BSOD on Windows 11 when using Hebrew Dec 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Crash/BSOD on Windows 11 when using Hebrew #39

Crash/BSOD on Windows 11 when using Hebrew #39

do2or commented Dec 11, 2023 •

edited

Loading

danpla commented Dec 11, 2023 •

edited

Loading

do2or commented Dec 11, 2023 •

edited

Loading

danpla commented Dec 11, 2023 •

edited

Loading

do2or commented Dec 11, 2023

danpla commented Dec 11, 2023

do2or commented Dec 11, 2023

danpla commented Dec 11, 2023

do2or commented Dec 11, 2023 •

edited

Loading

danpla commented Dec 11, 2023

do2or commented Dec 12, 2023

Crash/BSOD on Windows 11 when using Hebrew #39

Crash/BSOD on Windows 11 when using Hebrew #39

Comments

do2or commented Dec 11, 2023 • edited Loading

danpla commented Dec 11, 2023 • edited Loading

do2or commented Dec 11, 2023 • edited Loading

danpla commented Dec 11, 2023 • edited Loading

do2or commented Dec 11, 2023

danpla commented Dec 11, 2023

do2or commented Dec 11, 2023

danpla commented Dec 11, 2023

do2or commented Dec 11, 2023 • edited Loading

danpla commented Dec 11, 2023

do2or commented Dec 12, 2023

do2or commented Dec 11, 2023 •

edited

Loading

danpla commented Dec 11, 2023 •

edited

Loading

do2or commented Dec 11, 2023 •

edited

Loading

danpla commented Dec 11, 2023 •

edited

Loading

do2or commented Dec 11, 2023 •

edited

Loading