AutoGesture with 3DCDC

Pytorch code for the TIP paper "Searching Multi-Rate and Multi-Modal Temporal Enhanced Networks for Gesture Recognition"

Welcome to plug and play 3DCDC in your networks

# -------- Vanilla ---------#
nn.Conv3d(3, 64, kernel_size=3, padding=1)

# -------- 3DCDC ---------#
from 3DCDC import CDC_ST, CDC_T, CDC_TR
CDC_ST(3, 64, kernel_size=3, padding=1, theta=0.6)
CDC_T(3, 64, kernel_size=3, padding=1, theta=0.6)
CDC_TR(3, 64, kernel_size=3, padding=1, theta=0.3)

Citation

If you find our project useful in your research, please consider citing:

@article{yu2021searching,
  title={Searching Multi-Rate and Multi-Modal Temporal Enhanced Networks for Gesture Recognition},
  author={Yu, Zitong and Zhou, Benjia and Wan, Jun and Wang, Pichao and Chen, Haoyu and Liu, Xin and Li, Stan Z and Zhao, Guoying},
  journal={IEEE Transactions on Image Processing (TIP)},
  year={2021}
}

Pretrained model on IsoGD

You can download the checkpoints from google drive

Visualization

Figure 1: The searched architecture from (a) the first stage NAS, and (b) the second stage NAS. The three rows in (a) represent the searched cell structure in the low, mid, and high frame branches, respectively.

Figure 2: Features visualization from C3D assembled with varied convolutions on the IsoGD dataset. With (a) RGB and (b) Depth modality inputs, the four rows represent the neural activation with 3D vanilla convolution, 3D-CDC-ST, 3D-CDC-T, and 3D-CDC-TR, respectively.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

AutoGesture with 3DCDC

Welcome to plug and play 3DCDC in your networks

Citation

Pretrained model on IsoGD

Visualization

Files

README.md

Latest commit

History

README.md

File metadata and controls

AutoGesture with 3DCDC

Welcome to plug and play 3DCDC in your networks

Citation

Pretrained model on IsoGD

Visualization