GitHub - SCAI-JHU/GOMA

GOMA: Proactive Embodied Cooperative Communication via Goal-Oriented Mental Alignment
_{IROS 2024 | Paper | Website | Video}

Introduction

The GOMA algorithm casts human-robot communication as a planning problem by selecting utterances that maximizally improves the efficiency of the joint plan in a partially observable environment.

Reward of robot sharing information X to human:
$R$(request X) = KL($\mathbb{E}$[human plan | human mind + X] || $\mathbb{E}$[human plan | human mind ]) - $C$
Reward of robot requesting information X from human:
$R$(request X) = KL($\mathbb{E}$[robot plan | robot mind + X] || $\mathbb{E}$[robot plan | robot mind ]) - $C$

where C is the communication cost.

Installation

Before running the following code, replace ${VH_PATH} and ${GOMA_PATH} with the path on your machine, and set OPENAI_API_KEY with your own OpenAI API key.

# 1. prepare virtualhome environment
## code
git clone https://github.com/xavierpuigf/virtualhome ${VH_PATH}
cd ${VH_PATH}
git switch wah
## executable
wget http://virtual-home.org/release/simulator/v2.0/v2.2.4/linux_exec.zip -O v2.2.4.zip
unzip v2.2.4.zip -d ${VH_PATH}/bin/v2.2.4
export VH_BIN="${VH_PATH}/bin/v2.2.4/linux_exec.v2.2.4.x86_64"
export PYPATH_VH="${VH_PATH}/virtualhome:${VH_PATH}/virtualhome/simulation"

# 2. clone this repo
git clone https://github.com/SCAI-JHU/GOMA ${GOMA_PATH}

# 3. setup environment variables
export OPENCV_IO_ENABLE_OPENEXR=1
export OPENAI_API_KEY=...
export OPENAI_MODEL="gpt-4"
export OPENAI_MAX_TOKENS="256"

# 4. run experiment
cd ${GOMA_PATH}/testing_agents
export PYTHONPATH="${PYPATH_VH}:${GOMA_PATH}:$PYTHONPATH"
python test_template_agent_structured.py \
  --base-port=8088 \
  --num-belief-particles=10 \
  --num-proc=10 \
  --model="goma"

For VSCode users, we also provide .vscode/launch.json for quick debugging.

Dataset

train_env_task_set.pik contains 21 collaborative scenarios in VirtualHome across 4 goal types (setup_table, put_fridge, prepare_food, put_dishwasher) and 3 simulated apartments.

Below is an example of the task format:

[
  {
    "task_id": 5,
    "task_name": "setup_table",
    "env_id": 0,
    "task_goal": {
      "0": {
        "on_wineglass_231": 3,
        "on_plate_231": 3,
        "on_cutleryfork_231": 3
      },
      "1": {}
    },
    "level": 0,
    "init_rooms": ["bedroom", "bathroom"],
    "init_graph": {
      "nodes": [
        {
          "id": 11,
          "category": "Rooms",
          "class_name": "bathroom",
          "prefab_name": "PRE_ROO_Bathroom_01",
          "obj_transform": {
            "position": [-6.385, -0.003, -0.527],
            "rotation": [0.0, 0.0, 0.0, 1.0],
            "scale": [1.0, 1.0, 1.0]
          },
          "bounding_box": {
            "center": [-5.135, 1.247, 0.723],
            "size": [8.0, 3.0, 5.5]
          },
          "properties": [],
          "states": []
        }, {}, {}, {}
      ],
      "edges": [
        {
          "from_id": 12,
          "to_id": 11,
          "relation_type": "INSIDE"
        }, {}, {}, {}
      ]
    }
  }, {}, {}, {}
]

Troubleshooting

OSError: The port 8088 is already being used

lsof -i :8088 -t | xargs -r kill -9

Citing GOMA

Please cite our paper and star this repo if you find it interesting or useful. Thank you!

@inproceedings{ying2024goma,
  title={Goma: Proactive embodied cooperative communication via goal-oriented mental alignment},
  author={Ying, Lance and Jha, Kunal and Aarya, Shivam and Tenenbaum, Joshua B and Torralba, Antonio and Shu, Tianmin},
  booktitle={2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
  pages={7099--7106},
  year={2024},
  organization={IEEE}
}

Acknowledgments

Our code is based watch_and_help and online_watch_and_help.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.vscode		.vscode
GPT_message		GPT_message
LLM		LLM
MCTS		MCTS
agents		agents
algos		algos
analysis		analysis
config		config
envs		envs
testing_agents		testing_agents
utils		utils
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
arguments.py		arguments.py
requirements.txt		requirements.txt
train_env_task_set.pik		train_env_task_set.pik

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GOMA: Proactive Embodied Cooperative Communication via Goal-Oriented Mental Alignment
_{IROS 2024 | Paper | Website | Video}

Introduction

Installation

Dataset

Troubleshooting

OSError: The port 8088 is already being used

Citing GOMA

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

SCAI-JHU/GOMA

Folders and files

Latest commit

History

Repository files navigation

GOMA: Proactive Embodied Cooperative Communication via Goal-Oriented Mental AlignmentIROS 2024 | Paper | Website | Video

Introduction

Installation

Dataset

Troubleshooting

OSError: The port 8088 is already being used

Citing GOMA

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

GOMA: Proactive Embodied Cooperative Communication via Goal-Oriented Mental Alignment
_{IROS 2024 | Paper | Website | Video}

Packages