Multi-modal Data Preparation and Processing

This repo contains the multi-modal data preparation code for Skeleton Aware Multi-modal Sign Language Recognition (SAM-SLR).

List of all six modalities:

Full-body pose keypoints
Full-body pose features
RGB frames
RGB optical flow
HHA (depth)
Depth flow

Generate whole-body skeleton keypoints and save as npy

Use pretrained model of whole-body pose estimation to extract 133 landmarks from rgb videos and save as npy files.

Go to wholepose folder, change input_path and output_npy variables as the path of input videos and output npy files.
Download pretrained whole-body pose model: Google Drive
Run python demo.py
Copy generated npy files to corresponding data folders.

Generate skeleton features

Use the feature/wholepose_features_extraction.py to extract skeleton features.

Generate rgb frames from rgb videos

Get frames from RGB videos and crop to 256x256 according to the whole-pose skeletons extracted above.

Change folder, npy_folder, out_folder variables accordingly in gen_frames.py.
Run python gen_frames.py

Generate flow data from rgb and depth videos

There are two types of flow modality: color flow and depth flow. Those data can be obtained by pretrained Caffe model first. Then combine flow_x and flow_y and crop the combined flow data using gen_flow.py.

Obtain raw flow data from videos using docker as described in optical_flow_guidelines.docx
Change folder, npy_folder, out_folder variables accordingly in gen_flow.py.
Run python gen_flow.py

Generate HHA representation from depth videos

Use matlab code in Depth2HHA_master_mat to extract HHA from depth videos. It takes a long time extracting HHA features. And then crop the hha images and maskout pixels using gen_hha.py.

Change input_folder and output_folder and hha_root variables accordingly in CVPR21Chal_convert_HHA.m and run the script.
Change folder, npy_folder, out_folder variables accordingly in gen_hha.py.
Run python gen_hha.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-modal Data Preparation and Processing

Generate whole-body skeleton keypoints and save as npy

Generate skeleton features

Generate rgb frames from rgb videos

Generate flow data from rgb and depth videos

Generate HHA representation from depth videos

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Depth2HHA		Depth2HHA
feature		feature
wholepose		wholepose
.gitattributes		.gitattributes
README.MD		README.MD
gen_flow.py		gen_flow.py
gen_frames.py		gen_frames.py
gen_hha.py		gen_hha.py
optical_flow_guidelines.docx		optical_flow_guidelines.docx

jackyjsy/data-prepare

Folders and files

Latest commit

History

Repository files navigation

Multi-modal Data Preparation and Processing

Generate whole-body skeleton keypoints and save as npy

Generate skeleton features

Generate rgb frames from rgb videos

Generate flow data from rgb and depth videos

Generate HHA representation from depth videos

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages