Efficient vision foundation models for high-resolution generation and perception.
imagenet segmentation high-resolution vision-transformer efficientvit segment-anything deep-compression-autoencoder efficient-diffusion-model
-
Updated
Apr 3, 2025 - Python