Sudoku-Solver

This program is designed to solve the Sudoku puzzle, taken from the camera. It uses openCV library for image processing, Keras for image recognition and it uses Backtracking algorithm for saving it.

If the git is not yet installed, check this tutorial how to install it https://www.atlassian.com/git/tutorials/install-git
Open the console and go to the directory, where you want to download files.
Use the command git clone https://github.com/ghribar97/Sudoku-Solver.git.
Files are ready. To run Sudoku-Solver check Quick start.

Download files

Go to the repository page https://github.com/ghribar97/Sudoku-Solver
Click a big green button "Clone or download" and choose "download ZIP" option.
After download you must unzip the files with a WinZip software, or any other program.
Files are ready. To run Sudoku-Solver check Quick start.

Quick start

To start a program go to the directory where the Sudoku-Solver files are located and execute this command:

$ python3 main.py

After the execution the program should start.

How does it work

Here, we will briefly look at how the program is working - or at least the most important things. For any details please see the code or contact one of the authors.

Extracting Sudoku field

The purpose of this step is to extract the outline of Sudoku field.

Lets first take a look at the image we want to process.

Image pre-processing

For smoothing out the noise a bit, lets first blur the image a bit.

cv2.GaussianBlur(image, (11, 11), 0)

Where the image is our original photo.

With the noise smoothed out, we can now threshold the image. The image can have varying illumination levels, so a good choice for a thresholding algorithm would be an adaptive threshold. It calculates a threshold level several small windows in the image. This threshold level is calculated using the mean level in the window. So it keeps things illumination independent.

cv2.adaptiveThreshold(image, 255, cv2.ADAPTIVE_THRESH_MEAN_C, cv2.THRESH_BINARY, 5, 2, dst=image)

It calculates a mean over a 5x5 window and subtracts 2 from the mean. This is the threshold level for every pixel.

Since we're interested in the borders, and they are black, we invert the image outerBox. Then, the borders of the puzzles are white (along with other noise).

cv2.bitwise_not(image)

This thresholding operation can disconnect certain connected parts (like lines). So dilating the image once will fill up any small "cracks" that might have crept in.

kernel = np.array([[0, 1, 0], [1, 1, 1], [0, 1, 0]], dtype=np.uint8)
cv2.dilate(image, kernel)

Read some more about dilation and erosion: https://www.geeksforgeeks.org/erosion-dilation-images-using-opencv-python/

Finding the biggest blob

First, we used the floodfill command. This command returns a bounding rectangle of the pixels it filled. We've assumed the biggest thing in the picture to be the puzzle. So the biggest blob should have be the puzzle.

area = cv2.floodFill(image, None, (x, y), 64)[0]

We iterate through the white pixels (>=128) of the image, which ensure that only the white parts are flooded. Whenever we encounter such a part, we flood it with a dark gray colour (gray level 64). So in the future, we won't be reflooding these blobs. When we encounter a blob, that has bigger area, than the previous maximum, we have to update maximum and get the point of it.

Now, we have several blobs filled with a dark gray colour (level 64). And we also know the point what produces a blob with maximum area. So we floodfill that point with white:

cv2.floodFill(image, None, max_point, (255, 255, 255))

Now, the biggest blob is white. We need to turn the gray color blobs black. We iterate through image again and use this command (if pixel is gray (=64)):

cv2.floodFill(image, None, (x, y), 0)

Which will color the blob to black.

Lets see the result now:

Detecting lines

For detecting lines, we used Hough Lines:

cv2.HoughLines(image, 1, np.pi / 180, 200)

Based on each line's rho and theta, we separated them on Horizontals and Verticals. For each of them we calculated the intersection point with others.

Now, when we have all of the points, we can get the most top-left, top-right, bottom-left and bottom-right point, that is white (it is on the blob).

Result

After we have the corner points, we must calculate the biggest distance between them. If the difference between distances is to big, we probably did not detect the complete Sudoku field (It should be square).

The we transformed the perspective to get straight field.

src = np.array([tl, tr, bl, br], np.float32)
dst = np.array([(0, 0), (distance-1, 0), (0, distance-1), (distance-1, distance-1)], np.float32)
matrix = cv2.getPerspectiveTransform(src, dst)
cv2.warpPerspective(original_image, matrix, (distance, distance))

tl, tr, bl, br are the corners of the biggest blob and the distance is the longest line between points. Original_image is the picture, taken form the camera, which has not been changed.

Lets see the result:

Extracting digits

The purpose of this step is to get the digit from an image.

After Extracting the Sudoku field we have a clear, straight field, so we can easily divide the field on 81 equal squares.

Like that:

We can easily extract the "small square" of the puzzle. Lets take first one for example:

Before continuing with recognition step, we must once again use some image processing.

We use similar technique as in Image pre-processing. Then we used this command:

contours = cv2.findContours(image, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)[1]
x, y, width, height = cv2.boundingRect(sorted(contours, key=cv2.contourArea)[-1])

Upper function returns the data about the biggest contour in the image (we assume this is the number). If the ratio between width and height is not good enough, the field does not contain any number. Otherwise, proceed with processing. Lets see what is the result of the picture now:

Or a little bit more extreme case:

The image still contains a part of the line. So, we had to crop it and extract the contour section.

Last step before recognizing, is to change the image to look more like the images from the training set. One last time we use:

cv2.bitwise_not(image)

to switch white and black pixels. For the end we put this image to a bigger square white image.

The image now, looks like that:

Finally, the image is now ready for recognizing.

Algorithm

How to Maximize accuracy

To achieve good results, make sure that you do the following things:

Image should be flat, not curved, so that the program would recognize field easier.
The Sudoku's field outline should be thick enough.
Try to capture puzzle as close to the camera as possible.
While taking a picture make sure, that there is an appropriate light.

Tests

To run the tests for this project go to Tests directory and run mainTester.py Run it with this command:

$ python3 mainTester.py

Test coverage is:

Authors

To the top!

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
Algorithm		Algorithm
Common		Common
DataSet/data		DataSet/data
GUI		GUI
ImageProcessing		ImageProcessing
MachineLearning		MachineLearning
Model		Model
SamplePictures		SamplePictures
Tests		Tests
.gitignore		.gitignore
.travis.yml		.travis.yml
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sudoku-Solver

Table of contents

Getting Started

Prerequisites

Python

TensorFlow

Keras

Pillow

Sudoku Solver code

Using git

Download files

Quick start

How does it work

Extracting Sudoku field

Image pre-processing

Finding the biggest blob

Detecting lines

Result

Extracting digits

Algorithm

How to Maximize accuracy

Tests

Authors

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

ghribar97/Sudoku-Solver

Folders and files

Latest commit

History

Repository files navigation

Sudoku-Solver

Table of contents

Getting Started

Prerequisites

Python

TensorFlow

Keras

Pillow

Sudoku Solver code

Using git

Download files

Quick start

How does it work

Extracting Sudoku field

Image pre-processing

Finding the biggest blob

Detecting lines

Result

Extracting digits

Algorithm

How to Maximize accuracy

Tests

Authors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages