Custom classification model design #185

MinhxNguyen7 · 2024-05-01T19:36:56Z

Description

Design of the custom classification model, second shot in the new pipeline (#179). This model will classify the shape, character, shape color, and character color in one go, given the shape crop.

I'd like help with working through the ideas to design the model.

Design

Considerations

~~The model should be performant, ideally more so than YOLOv8n-det.~~
Something of a similar speed to YOLOv8n-det is acceptable, considering we currently process at ~2fps, and only ~1fps should be necessary.
Our datasets are not big and not that diverse. My hypothesis for our low real-world generalization performance is that our targets are too similar, perfect, and high-contrast compared to real-world data.
Our target crops will be of somewhat different sizes, and we should think about resizing it in a "good" way.

Ideas

YOLOv8n-det should be considered.
Use an autoencoder trained on diverse, unlabeled data (can be pre-trained or in-house). This should help with feature extraction and mitigate some of the problems with our smaller datasets.
- If we do our own, we can only care about the area in the mask.
We need to augment our data manually since YOLO isn't helping. Since it's our own model, we should be able to do on-line augmentation with something like albumentations.
Maybe collect more IRL data that we can use in validation. We probably don't have enough to train on it.
Some traditional CV pre-processing.
- Sharpening and contrast enhancement. Could be useful, but, in theory, the model should be able to learn this easily. That being said, these are such easy steps that might make it easier to train.
Force the first shot to return square, slightly-enlarged bounding boxes to maximize information to this model and decrease the chance that we accidentally crop part of it.

Action Items

Merge our datasets from different sources to make them more diverse Merge YOLO datasets #177.
Maybe collect more IRL data. Even without labeling, we could use it in an unsupervised manner.
Implement resnet-50 as baseline

The text was updated successfully, but these errors were encountered:

MinhxNguyen7 · 2024-05-25T07:43:49Z

Path forward has been decided (Resnet)

MinhxNguyen7 added help wanted Extra attention is needed Perception labels May 1, 2024

MinhxNguyen7 self-assigned this May 1, 2024

MinhxNguyen7 closed this as completed May 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom classification model design #185

Custom classification model design #185

MinhxNguyen7 commented May 1, 2024 •

edited

Loading

MinhxNguyen7 commented May 25, 2024

Custom classification model design #185

Custom classification model design #185

Comments

MinhxNguyen7 commented May 1, 2024 • edited Loading

Description

Design

Considerations

Ideas

Action Items

MinhxNguyen7 commented May 25, 2024

MinhxNguyen7 commented May 1, 2024 •

edited

Loading