You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
1.in your paper, says "we generate 4 images conditioned on the image prompt for each sample in the dataset, resulting in total 20,000 generated images for each method", did you generate 4 different images by different seed? or different captions?
2.when you got quantitative results, do your generated images have the same size with each validate coco image? (like width=600, height=400, same with coco eval annotations)
Would you be willing to provide the code for validating the quantitative indicators to other researchers?
Thank you for your work. When calculating CLIP-I and CLIP-T, did you use text prompt as an input into the model? Is the image prompt the only input?
The text was updated successfully, but these errors were encountered: