-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question about input depth image #5
Comments
Since you probably want to use the normal images as the ground truth for a model, you want them to be as high-quality as possible. In general, the denser, aggregated KITTI depth completion ground truth images will work much better for this. The surface normal estimation algorithm estimates a local plane in an nxn window (the default window size is 15 px), so you will have more points in the window for a more accurate estimate and will also have more than the minimal required 3 points within that window. The only situation where the sparser single-scan depth maps are more accurate seems to be in the presence of dynamic objects, where the noise from the imperfect point cloud aggregation in the denser depth map results in some "wobblyness" in the normals for the dynamic objects. Also, the sparse depth images have not been filtered to exclude points that should be occluded, but overlap with closer ones due to being transformed into the camera frame.
If you are asking whether the model trained on cropped images can also process full-size images, then yes, the DeepLidar model is a convolutional model and is not limited to a fixed image size. |
1st question: Thanks for your detailed answer. maybe if I want to supervise surface normal for my output depth map, I will choose to use (sparse+gt) to generate surface normal to get more precise surface normal groundtruth. |
Hi, how does one access these dense depth GT images? I've only been able to find the sparse ones shown first in the above post. Thanks! |
Hello, valgur!
Thanks for sharing your code about computing surface-normal. I have a question about the input depth image. I know it is from KITTI depth completion dataset, but dont know it is a input sparse depth map or dense ground truth depth map. And will the sparisty of depth map effect the quality of surface normal?
Also, the second question is that when we are training our model, we will use cropped depth image, can it compute the right surface normal at 256x512 depth image size but not full scale size.
The text was updated successfully, but these errors were encountered: