- Curate a large data of real-world data and label (target is 200-500hours worth of video data) - hf? large server?