Language driven semantic segmentation

My implementation of the "Lseg: language driven semantic segmentation" paper by Boyi Li et al.

Architecture

A dense prediction transformer (DPT) with a modified head encodes at pixel level, and the CLIP model encodes a set of words. Both embeddings are later combined in a multimodal latent space (orange tensor in the image) which will be later compared to the ground truth labels of an annotated image.

Results

Dataset

We will train our model only on the ADE20K and COCOPanoptic datasets. We use MSeg-API to download and relabel them. I recommend following its instructions step by step but with a few modifications:

mseg-api should be cloned in the repo main directory.
In the scripts from the mseg-api/download_scripts folder you need to comment parts regarding other datasets.
Place the download for the dataset into data/ (data is a directory in the repo directory). This is done when you define MSEG_DST_DIR.

Once everything is downloaded, we use mseg-semantic utils link to interact with the data and create the dataloader.

I needed to change ade20k_images_dir = "data/mseg_dataset/ADE20K/" to ade20k_images_dir = "data/mseg_dataset/ADE20K/ADEChallengeData2016/" in Lseg/utils/util.py , otherwise an error shows up.
Use test_data_utils.ipynb to check we are fetching the images correctly. In my case, for the COCO dataset, nor the folder train2017 nor val2017 was inside data/COCOPanoptic/images/ so I had to create the folder myself and put both inside.

Acknowledgements

To Richard Zhao for answering all my annoying questions
Useful repositories:

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
DPT		DPT
Lseg/utils		Lseg/utils
SillyExperiments		SillyExperiments
__pycache__		__pycache__
mseg-api		mseg-api
readme_images		readme_images
wandb		wandb
.gitignore		.gitignore
LsegNet.py		LsegNet.py
LsegNet_eval.ipynb		LsegNet_eval.ipynb
README.md		README.md
demo.ipynb		demo.ipynb
output_dpt.png		output_dpt.png
test_DPT.ipynb		test_DPT.ipynb
test_ViT.ipynb		test_ViT.ipynb
test_data_utils.ipynb		test_data_utils.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Language driven semantic segmentation

Architecture

Results

Dataset

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

TAOGenna/pytorch-language-driven-semantic-segmentation

Folders and files

Latest commit

History

Repository files navigation

Language driven semantic segmentation

Architecture

Results

Dataset

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages