Mosaic Transform #6534

abhi-glitchhg · 2022-09-02T20:37:46Z

Part of #6323

datumbox · 2022-09-14T14:42:34Z

@abhi-glitchhg Just checking with you to see if you got stack anywhere. :) Let me know if you face any issues.

abhi-glitchhg · 2022-09-15T17:21:36Z

Hey @datumbox, Thanks for checking on me! 🤗; I was a bit busy for some time.

I have gone through the mosaic implementation and have understood it;

I have a basic implementation locally. Hopefully, by this weekend, I will clean up and update this PR.
Thanks,
Abhijit :)

… mosaic

abhi-glitchhg · 2022-09-18T19:47:41Z

Still WIP

references/detection/transforms.py

… mosaic

abhi-glitchhg · 2023-01-26T10:05:30Z

first of all, I apologize for the inactivity on this pr. I'll be more regular from now on.

I have used Pedestrian Dataset to check the implementation. Download the dataset
I have tested the implementation with following code; to create image tensor of shape B*4*C*H*W I have used for loop, there might be some efficient way to do this.

import torch

from torchvision.prototype import transforms, datapoints

from torchvision.prototype.transforms import functional as F

from torchvision import utils


import os
import numpy as np
import torch
from PIL import Image

from references.detection.transforms import Mosaic


class PennFudanDataset(torch.utils.data.Dataset):
    def __init__(self, root, transforms ):
        self.root = root
        self.transforms=  transforms
        # load all image files, sorting them to
        # ensure that they are aligned
        self.imgs = list(sorted(os.listdir(os.path.join(root, "PNGImages"))))
        self.masks = list(sorted(os.listdir(os.path.join(root, "PedMasks"))))

    def __getitem__(self, idx):
        # load images and masks
        img_path = os.path.join(self.root, "PNGImages", self.imgs[idx])
        mask_path = os.path.join(self.root, "PedMasks", self.masks[idx])
        img = Image.open(img_path).convert("RGB")
        img = F.pil_to_tensor(img)
        # note that we haven't converted the mask to RGB,
        # because each color corresponds to a different instance
        # with 0 being background
        mask = Image.open(mask_path)
        # convert the PIL Image into a numpy array
        mask = np.array(mask)
        # instances are encoded as different colors
        obj_ids = np.unique(mask)
        # first id is the background, so remove it
        obj_ids = obj_ids[1:]

        # split the color-encoded mask into a set
        # of binary masks
        masks = mask == obj_ids[:, None, None]

        # get bounding box coordinates for each mask
        num_objs = len(obj_ids)
        boxes = []
        for i in range(num_objs):
            pos = np.where(masks[i])
            xmin = np.min(pos[1])
            xmax = np.max(pos[1])
            ymin = np.min(pos[0])
            ymax = np.max(pos[0])
            boxes.append([xmin, ymin, xmax, ymax])

        # convert everything into a torch.Tensor
        boxes = torch.as_tensor(boxes, dtype=torch.float32)
        # there is only one class
        labels = torch.ones((num_objs,), dtype=torch.int64)
        masks = torch.as_tensor(masks, dtype=torch.uint8)

        #image_id = torch.tensor([idx])
        #area = (boxes[:, 3] - boxes[:, 1]) * (boxes[:, 2] - boxes[:, 0])
        # suppose all instances are not crowd
        #iscrowd = torch.zeros((num_objs,), dtype=torch.int64)

        img = datapoints.Image(img)
        boxes = datapoints.BoundingBox(boxes, format=datapoints.BoundingBoxFormat.XYXY, spatial_size=F.get_spatial_size(img) )
        labels = datapoints.Label(labels)
        if self.transforms is not None:
            img, boxes, labels = self.transforms(img, boxes,labels)

        return img, boxes, labels

    def __len__(self):
        return len(self.imgs)


def collate_fn(batch):
    return tuple(zip(*batch))

dataset = PennFudanDataset(root="./../PennFudanPed", transforms= transforms.Resize((350,324))  ) #change the root parameter according to your dir structure. 

data_loader = torch.utils.data.DataLoader(
 dataset, batch_size=4, shuffle=True, num_workers=1,
 collate_fn=collate_fn)

B = 16  # Batch size
counter=0 

batched_images=[]
batched_boxes = []
batched_labels = []
for i in data_loader:
    image,boxes, labels= i 
    image = torch.stack(image)
    boxes = list(boxes)
    labels = [*labels[0], *labels[1], *labels[2], *labels[3]]
    batched_images.append(image)
    batched_boxes.append(boxes)
    batched_labels.append(labels)
    counter+=1
    if (counter>B):
        break
    
batched_images= torch.stack(batched_images)
mosaic=  Mosaic()
output = mosaic(batched_images, batched_boxes, batched_labels)

for i in range(B):
    viz = utils.draw_bounding_boxes(F.to_image_tensor(output[0][i]), boxes= output[1][i])
    F.to_pil_image(viz).show()

abhi-glitchhg · 2023-01-26T10:40:08Z

references/detection/transforms.py

+        super().__init__()
+        self.min_frac = min_frac
+        self.max_frac = max_frac


here we need to check if the min_frac and max_frac arguments are in between 0 and 1.

oke-aditya · 2023-02-11T09:43:59Z

Aah we need to review, this. Well I will try my best to find time and review this 😄 as well as understand how this works :)

abhi-glitchhg · 2023-02-14T14:19:04Z

Aah we need to review, this. Well I will try my best to find time and review this 😄 as well as understand how this works :)

yeah; sure! lmk if something is not clear

byronyi · 2023-05-31T01:15:49Z

Gentle ping for any updates.

initial commit

83a4069

facebook-github-bot added the cla signed label Sep 2, 2022

Merge branch 'main' into mosaic

1337569

datumbox mentioned this pull request Sep 4, 2022

[RFC] Batteries Included - Phase 3 #6323

Open

16 tasks

datumbox added module: transforms new feature labels Sep 4, 2022

abhi-glitchhg and others added 4 commits September 18, 2022 20:09

Merge branch 'main' into mosaic

5333a0c

update: basic Mosaic Implemented

8b630f0

Merge branch 'mosaic' of https://github.com/abhi-glitchhg/vision into…

f4526ed

… mosaic

typo

2b7ed01

abhi-glitchhg marked this pull request as ready for review September 20, 2022 08:53

abhi-glitchhg marked this pull request as draft September 20, 2022 09:22

abhi-glitchhg added 3 commits September 21, 2022 00:10

channel last -> channel first

ced3ac4

[skip-ci]

eabac3c

SKIP CI ;fix typos typos typos

c36ab7f

oke-aditya reviewed Sep 21, 2022

View reviewed changes

references/detection/transforms.py Show resolved Hide resolved

datumbox mentioned this pull request Nov 29, 2022

expose some prototype transforms utils #6989

Merged

abhi-glitchhg and others added 7 commits December 3, 2022 13:52

Merge branch 'main' into mosaic

364f2da

Merge branch 'main' into mosaic

aa3e717

Merge branch 'main' into mosaic

98b7ad9

cleanup and some progress after long time;

6effa13

Merge branch 'mosaic' of https://github.com/abhi-glitchhg/vision into…

56c9b8f

… mosaic

add batch support

f0c2b42

Merge branch 'main' into mosaic

31977f7

abhi-glitchhg commented Jan 26, 2023

View reviewed changes

abhi-glitchhg marked this pull request as ready for review January 26, 2023 10:42

abhi-glitchhg closed this Jul 12, 2024

abhi-glitchhg deleted the mosaic branch July 12, 2024 15:12

abhi-glitchhg restored the mosaic branch July 12, 2024 15:12

abhi-glitchhg reopened this Jul 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mosaic Transform #6534

Mosaic Transform #6534

abhi-glitchhg commented Sep 2, 2022

datumbox commented Sep 14, 2022

abhi-glitchhg commented Sep 15, 2022 •

edited

Loading

abhi-glitchhg commented Sep 18, 2022

abhi-glitchhg commented Jan 26, 2023

abhi-glitchhg Jan 26, 2023

oke-aditya commented Feb 11, 2023

abhi-glitchhg commented Feb 14, 2023 •

edited

Loading

byronyi commented May 31, 2023 •

edited

Loading

Mosaic Transform #6534

Are you sure you want to change the base?

Mosaic Transform #6534

Conversation

abhi-glitchhg commented Sep 2, 2022

datumbox commented Sep 14, 2022

abhi-glitchhg commented Sep 15, 2022 • edited Loading

abhi-glitchhg commented Sep 18, 2022

abhi-glitchhg commented Jan 26, 2023

abhi-glitchhg Jan 26, 2023

Choose a reason for hiding this comment

oke-aditya commented Feb 11, 2023

abhi-glitchhg commented Feb 14, 2023 • edited Loading

byronyi commented May 31, 2023 • edited Loading

abhi-glitchhg commented Sep 15, 2022 •

edited

Loading

abhi-glitchhg commented Feb 14, 2023 •

edited

Loading

byronyi commented May 31, 2023 •

edited

Loading