January 2020
tl;dr: Summary of the main idea.
Adds a social pooling layer that pools the hidden stages of the neighbors within a spatial radius.
- Instead of a spatial occupancy grid, replace the occupancy with LSTM embedding.
- Summary of technical details
- talk at CVPR: the animation of predicting a person passing through the gap of a crowd is cool.