WonkaVision

Project Overview

The goal of this project was to develop a computer vision model, named "Willy Vision," capable of efficiently detecting various models of chocolates based on shape, texture, and flavor. Deployed on the FG company's production line, this system can identify over 60 different types of chocolates from a catalog of 150, showcasing its ability to recognize a diverse array of chocolate products.

What is Computer Vision?

Computer vision is a field within artificial intelligence that enables computers to interpret and understand the visual world. By using digital images and videos, along with deep learning models, the technology mimics human vision to identify, classify, and react to elements within visual data. This capability is pivotal in automating tasks that require visual recognition.

The Magic behind WillyVison

Bounding Box Prediction:

Each cell predicts multiple bounding boxes for objects along with their confidence scores. The confidence score reflects the accuracy of the bounding box and whether the box contains a specific object.

Grid Division:

YOLO divides an image into a grid (e.g., 13x13 cells). Each grid cell is responsible for detecting objects that fall within its boundaries.

Class Prediction:

Simultaneously, each cell predicts the class probabilities for each bounding box. This step involves using softmax functions that calculate the probability of the object belonging to a specific class.

Non-max Suppression:

To ensure the model does not have overlapping bounding boxes for the same object, YOLO uses a technique called non-max suppression. This step filters out bounding boxes based on the confidence score and Intersection over Union (IoU) metric, keeping only the highest scoring boxes.

Combining Results:

The bounding boxes and class predictions are combined to create the final output, which includes the positions, dimensions, and class labels of all detected objects.

Detail of the Trainings

Go to the ModelWeitgh folder, and you wil have acces to detailed graph and pictures

Usage

To run the "Willy Vision" detection model on your local machine, follow these steps:

Set Up Your Environment:

Ensure Python and necessary libraries (opencv-python-headless, ultralytics) are installed.
Install dependencies: pip install opencv-python-headless ultralytics.

Run the Detection:

Navigate to the project directory.
Execute the script: python3 predict_video.py.
View the Output:

The script displays the video with detected chocolates and saves an output file.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Willy_Vision_v1		Willy_Vision_v1
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WonkaVision

Project Overview

What is Computer Vision?

The Magic behind WillyVison

Bounding Box Prediction:

Grid Division:

Class Prediction:

Non-max Suppression:

Combining Results:

Detail of the Trainings

Usage

Set Up Your Environment:

Run the Detection:

About

Releases

Packages

Languages

madaagain/WonkaVision

Folders and files

Latest commit

History

Repository files navigation

WonkaVision

Project Overview

What is Computer Vision?

The Magic behind WillyVison

Bounding Box Prediction:

Grid Division:

Class Prediction:

Non-max Suppression:

Combining Results:

Detail of the Trainings

Usage

Set Up Your Environment:

Run the Detection:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages