-
Notifications
You must be signed in to change notification settings - Fork 0
Description
Hi authors, thanks for this new diverse SLAM dataset! I was downloading some validation sequences with intention to reconstruct a scene point cloud using the depth and camera pose (ie. gt_trajectory.txt) information. However, I found that the depth data from its .h5 files have a different batch dimension than that of the camera pose data, the latter of which is consistent with the number of frames in the user and GT view videos.
For example, for sequence new_indoor_close_14 validation's 000003.tar:
- GT and user videos have 5206 frames, and
gt_trajectory.txthas shape:(5206, 8) depth_frame_0.h5andconfidence_frame_0.h5's data has shape:(4841, 135, 240)
Also for sequence new_scanning_54 validation's 000021.tar:
- GT and user videos have 3480 frames, and
gt_trajectory.txthas shape:(3480, 8) depth_frame_0.h5andconfidence_frame_0.h5's data has shape:(3941, 135, 240)
For this sequence I noticed that the first 460 frames of confidence and depth data are all zeros.
I wasn't able to find information about the mapping between depth data and video frames in the documentation. Could the proper utilization of depth be clarified? Thanks.