Computer vision Experiments

First experiments using OpenCV and PyTorch.

DISCLAIMER: This is at a very early stage so there are many things to improve. (More on the section Things to improve.)

The idea of this repo is to test out some computer vision techniques to later use them as a tool/controller for the robot arm I am developing.

Robot Arm Repository

The main objectives to solve are:

Given a static camera with a static background, detect new objects that enter it's FOV (field of view).
Quick way to define types and new objects to detect (Be able to train the model only with data gathered within a few minutes of a training prodecure).
Classify objects mentioned in (1) using the gathered data mentioned in (2).

For the first aproach the following workflow was used. This is divided in to main sections.

Training sequence:

Define the background of the inital scene. (Startup the camera without any objects in it's FOV)
Define the objects the model should be able to detect and classify. For each object place it individually in the camere's FOV and take pictures of it in various positions and orientations (using the same cropping and ROI described in "Running the object detection".
Use data augmentation techniques to increase the size of the training data and improve generalization.
Train the model used with this gathered data.

Running the object detection:

Define the background of the inital scene. (Startup the camera without any objects in it's FOV)
For each new frame detect the difference between it and the background. The countours created with this difference will be the detected objects.
Determine bounding boxes for each object detectec and extract a region of interest from it (ROI).
Use a CNN (in this case a modified version of DenseNet-121 implemented using pytorch), to classifie each ROI
Output the bounding boxes and the prediccion to the output frame (screen live camera)

About the CNN model used.

A modified version of DenseNet-121 was implemented using pytorch. This model takes as an input images (RGB) of size 32 (3,32,32) used to classify the several classes defined in "Training sequence"

Sample dataset examples

Some examples.

Object segmentation process.

Various objects in the same frame.

Things to improve

The object detection algorithm (the way countours are found).
The data augmentation for the creation of new datasets.
How the ROI's are processed and sent to the classifier model. (CNN)
Many others ... (The ones just mentioned are the most important at the moment.)

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
notebooks		notebooks
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computer vision Experiments

First experiments using OpenCV and PyTorch.

DISCLAIMER: This is at a very early stage so there are many things to improve. (More on the section Things to improve.)

For the first aproach the following workflow was used. This is divided in to main sections.

Training sequence:

Running the object detection:

About the CNN model used.

Sample dataset examples

Some examples.

Object segmentation process.

Various objects in the same frame.

Things to improve

About

Languages

alberto-abarzua/computer_vision_experiments

Folders and files

Latest commit

History

Repository files navigation

Computer vision Experiments

First experiments using OpenCV and PyTorch.

DISCLAIMER: This is at a very early stage so there are many things to improve. (More on the section Things to improve.)

For the first aproach the following workflow was used. This is divided in to main sections.

Training sequence:

Running the object detection:

About the CNN model used.

Sample dataset examples

Some examples.

Object segmentation process.

Various objects in the same frame.

Things to improve

About

Topics

Resources

Stars

Watchers

Forks

Languages