-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[wip] support reading generic images #154
base: main
Are you sure you want to change the base?
Conversation
crates/kornia-io/src/functional.rs
Outdated
GenericImage::L8(Image::<u8, 1>::new(size, img.into_luma8().to_vec())?) | ||
} | ||
image::ColorType::Rgb8 => { | ||
GenericImage::Rgb8(Image::<u8, 3>::new(size, img.into_rgb8().to_vec())?) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One more change is into_vec
(instead of to_vec
) as noted in #149 (comment) . Just wanted to remind you in case you missed it. (I haven't tried it myself, so please ignore if it has any issues/negatives).
into_vec()
consumes the image and gives us the inner vector without copy.to_vec
, on the other hand, seems to point toslice::to_vec
(according to rust-analyzer) which copies.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yep, my mistake
let image = read_image_any("../../tests/data/dog.jpeg")?; | ||
// let image: Image<u8, 3> = read_image_any("../../tests/data/dog.jpeg")?; | ||
let image: super::GenericImage = read_image_any("../../tests/data/dog.jpeg")?; | ||
// NOTE: then how to access the size? we need to reimplment the methods ?? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@emilmgeorge @jandremarais do you have any ideas here how to solve this ? Initially I wanted somehow that the read function could return the Image struct so that we can use directly the underlying api
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How about returning a Box<dyn AnyImage>
with
trait AnyImage {
fn size(&self) -> ImageSize;
}
impl<T: SafeTensorType, const C: usize> AnyImage for Image<T, C> {
fn size(&self) -> ImageSize {
self.size()
}
}
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it's another option, however wouldn't be difficult to scale/maintain this as soon as we increase the functionality in Image ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think it would be more effort than the current enum approach but I can give it more thought. What if the AnyImage trait owns all the image functionality? Then we don't have to implement it twice
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I also think using traits to decouple the functionality may be a good direction, although I haven't fully thought it through.
If considering/implementing the above needs more time, another limited option for now, is to use the enum type only for returning and force the user to convert the enum to a concrete type before further operations. We can provide functions like into_rgb8() -> Result<Image<u8, 3>, XXError>
etc. in impl GenericImage {}
.
Based on use cases:
-
User who does not care about the pixel type at all
Not easily possible with this limited solution. User has to either handle all image types with match (maybe macros?) or convert to one specific type (as below).Trait based solutions may work for this use case.
-
User who cares only about the final image type
let gimg: super::GenericImage = read_image_any("../../tests/data/dog.jpeg")?; let gray = gimg.into_l8()?; // Any operation for Image<u8, 1>
-
User who wants to handle only specific type images
let gimg: super::GenericImage = read_image_any("../../tests/data/dog.jpeg")?; if let GenericImage::Rgb8(image) = gimg { // Any operation for Image<u8, 3> }
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
that sounds a good option. Also in this direction, i want start getting rid of image-rs
and implement directly the decoders we want, which can lead to have a pool of free functions to read from different formats in the format of read_rgb8
, read_l8
, etc. Possibly by curating first png
, jpeg,
exrand maybe
tiff` which are the most common formats used in ml/robotics
No description provided.