This Text Detection App uses Python libraries such as Tesseract, OpenCV, and Pillow to extract and process text from images. The app can handle simple text extraction as well as more complex image processing tasks to improve text detection accuracy. Additionally, a Flask web application is included to provide a user-friendly interface for the text detection functionality.
- Extract text from an image using Tesseract OCR.
- Remove irrelevant symbols from extracted text.
- Perform various image preprocessing operations using OpenCV:
- Grayscale conversion
- Noise removal
- Thresholding
- Erosion
- Morphology operations
- Canny edge detection
- Skew correction
- Template matching
- Draw rectangles around detected text.
- Highlight specific words or patterns in the image.
- Python 3.x
- Requests
- Pillow
- pytesseract
- OpenCV
- numpy
- re
- Flask
- Install Tesseract OCR from here. Ensure that Tesseract is added to your system path.
- Install the required Python packages:
pip install requests pip install pillow pip install pytesseract pip install opencv-python pip install numpy pip install flask
- Ensure Tesseract is correctly installed and its path is configured in the script.
- Navigate to the folder containing the app:
cd path/to/your/app
- Run the Flask application:
python app.py
- Open a web browser and go to
http://127.0.0.1:5000/
to access the application.