Skip to content
/ PGP Public

(CoRL 2023) Preference Learning for Guiding the tree searches in continuous POMDPs

Notifications You must be signed in to change notification settings

iMSquared/PGP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Preference Learning for Guiding the tree searches in continuous POMDPs

Accepted at CoRL 2023

[Paper] [Project page] [Github] [SlideShare]

We take inspiration from the recent work in learning to guide task and motion planning to propose a framework that learns to guide POMCPOW from past planning experience. Our method uses prefer- ence learning that utilizes both success and failure trajectories, where the preference label is given by the results of the tree search. We demonstrate the efficacy of our framework in several continuous partially observable robotics domains, including real-world manipulation, where our framework explicitly reasons about the uncertainty in off-the-shelf segmentation and pose estimation algorithms.

image

Code details

We implement on two following domains: (1) Light Dark domain and (2) object fetching domain. Check out toy_domain and Simulation folder, respectively.

Cite this work

@inproceedings{ahn2023preference,
  title={Preference learning for guiding the tree search in continuous POMDPs},
  author={Ahn, Jiyong and Son, Sanghyeon and Lee, Dongryung and Han, Jisu and Son, Dongwon and Kim, Beomjoon},
  booktitle={Conference on Robot Learning},
  pages={3929--3948},
  year={2023},
  organization={PMLR}
}