Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reconstruction loss in ELBO #51

Open
Paulmzr opened this issue Sep 19, 2024 · 2 comments
Open

Reconstruction loss in ELBO #51

Paulmzr opened this issue Sep 19, 2024 · 2 comments

Comments

@Paulmzr
Copy link

Paulmzr commented Sep 19, 2024

Hi, thanks for your great work!

I notice there is a discretized_gaussian_log_likelihood function to estimate the log-likelihood of the reconstructed representation from $x_1$. As the VAE has already encoded the images to continuous latent space, I am confused why we need this function to estimate the log-likelihood of a Gaussian distribution discretizing to an image ground truth? Why not we directly use the MSE loss ( i.e., $|x_0 - {x^{reconstruct}_0}|$) to optimize the log-likelihood of the reconstructed latent representation?

Looking forward to your reply. Thanks in advance!

@LTH14
Copy link
Owner

LTH14 commented Sep 19, 2024

Thanks for your interest! This VLB loss exactly follows the iDDPM and DiT design. However, we also conducted experiments without the VLB loss (reconstruction loss only), and the performance is the same.

@Paulmzr
Copy link
Author

Paulmzr commented Sep 20, 2024

@LTH14 thanks for your response!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants