GitHub - AIS-Bonn/LIAM

This repository is for the project:"Action Transcript Prediction from Multi-Modal Environments using Vision-Language Models." from University of Bonn, Computer Science Institute VI, Center for Robotics.

Dataset and installation

essential packages for the code, please check requirements.txt

Download Alfred dataset, please check: https://github.com/askforalfred/alfred

If you want to try a lighter backbone, i.e., MobileCLIP, please install from their official repo: https://github.com/apple/ml-mobileclip

Repo structure:

pretraining: All the pretraining and preprocessing code

model: end-to-end model for generating action sequence

dataset: Dataset for end-to-end training.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
dataset		dataset
model		model
pretraining		pretraining
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dataset and installation

Repo structure:

About

Releases

Packages

Languages

AIS-Bonn/LIAM

Folders and files

Latest commit

History

Repository files navigation

Dataset and installation

Repo structure:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages