-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use a Neural Network for hand recognition & tracking #2
Comments
One easy out-of-the-box solution for feature detection of hands is this. An implementation of this can be found in the branch issue/neuralnet Important Sidenote: |
I managed to train a the deeplabv3 model from pytorch to do semantic hand segmentation using a tutorial . But the result is much slower than hoped for :/ Here is one gif of the detection at full image resolution: Here the resolution was reduced by 30%: Edit: Edit: |
I am currently trying this tutorial wich has a well documented repository on github. It provides several different models as backbones, including very lightweight models like MobileNet. So I am hoping for a good inference time. |
Here is the result using FCN32 and MobilNet from the previously mentioned repository. It is much faster but also very inacurate. The inacuracy might be due to the fact that I only used one third of the training data this time. I will try to train with more data and maybe switch out the models. Update: And here Segnet and Mobilenet (5 epochs): Update2: |
Currently depth and optical flow are not used for the segmentation. However the depth values from the camera are very inacurate, so there is not much that can be done with them. |
Imo hand feature detection (eg with mediapipe) is much more feasible in this project. It has the downside of having animate the detection. |
Opencv provides a builtin class for background removal: We should try it out. Edit: Maybe this can be done with GrabCut. |
Maybe to try a lightweight neural network (like YOLO?) with the colorimage, depthimage & fingertip features. Use current approach to generate training data.
The text was updated successfully, but these errors were encountered: