Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

generate images with arbitrary resolutions, #28

Open
Leiii-Cao opened this issue Sep 1, 2024 · 4 comments
Open

generate images with arbitrary resolutions, #28

Leiii-Cao opened this issue Sep 1, 2024 · 4 comments

Comments

@Leiii-Cao
Copy link

is there any way to use a checkpoint from that model to generate images with arbitrary resolutions,

@LTH14
Copy link
Owner

LTH14 commented Sep 1, 2024

You can consider using similar techniques as MaskGIT: first, generate a 256x256 image, and keep expanding it to meet the resolution requirement.

@Leiii-Cao
Copy link
Author

您可以考虑使用与MaskGIT类似的技术:首先,生成一个256x256的图像,并不断扩大它以满足分辨率要求。

I roughly understand your approach, but if I want to apply it to downstream tasks like image editing, wouldn’t the resolution of the edited images be limited, and wouldn’t it be challenging to work with arbitrary resolutions such as 345x566?

@LTH14
Copy link
Owner

LTH14 commented Sep 1, 2024

Since we use a learnable position embedding, it is hard to transfer it to different resolutions. If you want to generalize to different resolutions, you could consider using ROPE as the position embedding during training.

@Leiii-Cao
Copy link
Author

由于我们使用可学习的位置嵌入,因此很难将其迁移到不同的分辨率。如果您想要推广到不同的分辨率,可以考虑在训练期间使用 ROPE 作为位置嵌入。

Thank you for your explanation. Your answer was very helpful to me.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants