Skip to content

YiwenShaoStephen/mergeNet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MergeNet

A novel method to do instance segmentation in a semantic segmentation and speech decoding fashion. It doesn't need any bounding boxes or regions of interest to detect objects. Instead, it uses the pixel-wise "class" and "offset" output to dynamically merge pixels into instances.

Oracle Example

Original Image

This is a sample image from Cityscapes dataset. img1

Classes output

The probability of one pixel belongs to each classes as in semantic segmentation. img2

Offsets output

The probability of one pixel (x, y) is of the same instance with its offset pixel (x+i, y+j). img3

Offset values

We choose offset values (i, j) from a Logarithmic spiral to approximately get the relationship between one pixel to its neighborhood and in turn the shape of each instance. Note that a large offset (e.g. [-40, 0]) is especially necessary and helpful when there is occlusion.

img4

Merging Process

We use a priority queue to greedly merge instances when it can improve the total log-likihood of the image. We initialize the merging state by assigning each pixel with a distinct instance ID and we stop merging when the queue is empty (i.e. a merging operation can no longer improve the log-likihood). img5

About

instance segmentation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published