Added support for EXIF orientation transform in read_image for JPEG #8279

vfdev-5 · 2024-02-23T17:19:38Z

TODO:

fix failing smoke test
make public API less BC-breaking as new args seems to be required withtout taking default value.

Notes:

How to get exif data with Pillow:

from PIL import Image
pimg = Image.open("image-with-exif.jpg")

# Binary exif data:
print(pimg.info['exif'])
# b'Exif\x00\x00MM\x00*\x00\x00\x00\x08\x00\n\x01\x0f\x00\x02\x00\x00\x00\x06\x00\x00\x00\x86\x01\x10\x00\x02\x00\x00\x00\n\x00\x00\x00\x8c\x01\x12\x00\x03\x00\x00\x00\x01\x00\x06\x00\x00\x01\x1a\x00\x05\x00\x00\x00\x01\x00\x00\x00\x96\x01\x1b\x00\x05\x00\x00\x00\x01\x00\x00\x0
# 0\x9e\x01(\x00\x03\x00\x00\x00\x01\x00\x02\x00\x00\x011\x00\x02\x00\x00\x00\x07\x00\x00\x00\xa6\x012\x00\x02\x00\x00\x00\x14\x00\x00\x00\xae\x02\x13\x00\x03\x00\x00\x00\x01\x00\x01\x00\x00\x87i\x00\x04\x00\x00\x00\x01\x00\x00\x00\xc2\x00\x00\x07\xc8Apple\x00iPhone XR\x00\x00\
# x00\x00H\x00\x00\x00\x01\x00\x00\x00H\x00\x00\x00\x0113.1.2\x00\x002019:10:23


# Parsed exif data:
exif = pimg.getexif()
for tag, value in exif.items():
    if tag == 274:  # orientation tag
        print(tag, value)

cc @gau-nernst @kero-ly

pytorch-bot · 2024-02-23T17:19:42Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/8279

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit af933cb with merge base bca7a49 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…n in the op registration

vfdev-5 · 2024-02-23T17:49:15Z

torchvision/csrc/io/image/image.cpp

@@ -23,11 +23,11 @@ static auto registry =
    torch::RegisterOperators()
        .op("image::decode_png", &decode_png)
        .op("image::encode_png", &encode_png)
-        .op("image::decode_jpeg", &decode_jpeg)
+        .op("image::decode_jpeg(Tensor data, int mode, bool apply_exif_orientation=False) -> Tensor", &decode_jpeg)


This helps to make the op BC compatible, otherwise apply_exif_orientation becomes required and thus break the old code

NicolasHug

Thanks a ton for working on this @vfdev-5 ! I made a few comments / questions but this looks great

torchvision/csrc/io/image/cpu/decode_image.cpp

torchvision/csrc/io/image/cpu/exif.h

torchvision/csrc/io/image/cpu/decode_jpeg.cpp

torchvision/csrc/io/image/cpu/exif.h

torchvision/csrc/io/image/cpu/decode_jpeg.cpp

NicolasHug · 2024-03-04T18:01:26Z

torchvision/csrc/io/image/cpu/decode_jpeg.cpp

+          size_t num_entry = get_uint16(exif_data_vec, endianness, offset);
+          offset += 2; // go to start of tag fields
+          constexpr size_t tiff_field_size = 12;
+          for (size_t entry = 0; entry < num_entry; entry++) {


Is there any risk that this loop can segfault/overflow if the entries are malformed?

get_uint16 can return -1 in case offset + required size for uint16 go outside the actual buffer size, so we can catch it here and return earlier.
I hope the code is safe but I can't say 100% sure. I have to investigate a bit more.

torchvision/csrc/io/image/cpu/decode_jpeg.cpp

torchvision/csrc/io/image/cpu/exif.h

NicolasHug

Thanks a lot @vfdev-5 , some minor comments but LGTM

torchvision/csrc/io/image/cpu/exif.h

test/test_image.py

torchvision/csrc/io/image/cpu/exif.h

github-actions · 2024-03-05T14:14:36Z

Hey @vfdev-5!

You merged this PR, but no labels were added.
The list of valid labels is available at https://github.com/pytorch/vision/blob/main/.github/process_commit.py

…or JPEG (#8279) Reviewed By: vmoens Differential Revision: D55062774 fbshipit-source-id: 6a75321d8f86630d54cd15ad7433f71d0735659d

Added support for EXIF orientation transform in read_image for JPEG

a40be0a

facebook-github-bot added the cla signed label Feb 23, 2024

vfdev-5 requested a review from NicolasHug February 23, 2024 17:20

Fixed smoke test and set signature with dafault apply_exif_orientatio…

4401c2e

…n in the op registration

vfdev-5 force-pushed the read_image_exif_support branch from a7acc1c to 4401c2e Compare February 23, 2024 17:48

vfdev-5 commented Feb 23, 2024

View reviewed changes

NicolasHug reviewed Mar 4, 2024

View reviewed changes

vfdev-5 force-pushed the read_image_exif_support branch from 855d5de to 5314d43 Compare March 5, 2024 10:53

Addressed PR review

44f4d81

vfdev-5 force-pushed the read_image_exif_support branch from 5314d43 to 44f4d81 Compare March 5, 2024 10:56

vfdev-5 added 2 commits March 5, 2024 11:57

Merge branch 'main' into read_image_exif_support

951edd7

Fixed formatting issue and added exif data note

2b4dcd8

NicolasHug approved these changes Mar 5, 2024

View reviewed changes

NicolasHug reviewed Mar 5, 2024

View reviewed changes

torchvision/csrc/io/image/cpu/exif.h Outdated Show resolved Hide resolved

vfdev-5 added 3 commits March 5, 2024 14:01

More updates

795ff42

Merge branch 'main' into read_image_exif_support

ec0b682

Fixed cpp code formatting

af933cb

vfdev-5 merged commit f3298dc into pytorch:main Mar 5, 2024
77 checks passed

vfdev-5 deleted the read_image_exif_support branch March 5, 2024 14:14

NicolasHug added enhancement module: io labels Mar 5, 2024

vfdev-5 mentioned this pull request Mar 6, 2024

Improved jpeg decoding code with exif #8302

Merged

NicolasHug mentioned this pull request May 31, 2024

Added a flag 'process_exif: bool = False' to 'read_image' #8109

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added support for EXIF orientation transform in read_image for JPEG #8279

Added support for EXIF orientation transform in read_image for JPEG #8279

vfdev-5 commented Feb 23, 2024 •

edited

Loading

pytorch-bot bot commented Feb 23, 2024 •

edited

Loading

vfdev-5 Feb 23, 2024

NicolasHug left a comment

NicolasHug Mar 4, 2024

vfdev-5 Mar 5, 2024

NicolasHug left a comment

github-actions bot commented Mar 5, 2024

Added support for EXIF orientation transform in read_image for JPEG #8279

Added support for EXIF orientation transform in read_image for JPEG #8279

Conversation

vfdev-5 commented Feb 23, 2024 • edited Loading

pytorch-bot bot commented Feb 23, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/8279

✅ No Failures

vfdev-5 Feb 23, 2024

Choose a reason for hiding this comment

NicolasHug left a comment

Choose a reason for hiding this comment

NicolasHug Mar 4, 2024

Choose a reason for hiding this comment

vfdev-5 Mar 5, 2024

Choose a reason for hiding this comment

NicolasHug left a comment

Choose a reason for hiding this comment

github-actions bot commented Mar 5, 2024

vfdev-5 commented Feb 23, 2024 •

edited

Loading

pytorch-bot bot commented Feb 23, 2024 •

edited

Loading