Increasing token size #71

jinnan-chen · 2024-10-31T09:18:46Z

Hi Tianhong,

I have trained the MAR on 1D unordered latents, it works fine for 256 tokens with 64 chanels, the loss converges at 0.35. However, when training on 1k or 2k tokens with 64 chanels, the loss converge at 0.45 and the results looks bad, even though the VAE reonstruction ability is higher than 256 tokens. Is there any suggestions? Thanks!

LTH14 · 2024-10-31T12:46:53Z

You might need to check the parameters such as vae_embed_dim, vae_stride, etc?

jinnan-chen · 2024-11-02T04:40:43Z

Hi,
My tokens is not from 2D images, so I dont have vae_stride, and my token_embed_dim=vae_embed_dim=64.
When I use token_embed_dim=64, seq_len=buffer_size=256, it converges fast and generate good results.
So, when I increase the self.seq_len, should I increase the buffer_size during training and increase num_iter in sample_tokens accordingly?

LTH14 · 2024-11-02T04:47:16Z

buffer size does not need to be increased. num_iter should be increased (e.g., 128 for seq_len=1024)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increasing token size #71

Increasing token size #71

jinnan-chen commented Oct 31, 2024

LTH14 commented Oct 31, 2024

jinnan-chen commented Nov 2, 2024

LTH14 commented Nov 2, 2024

Increasing token size #71

Increasing token size #71

Comments

jinnan-chen commented Oct 31, 2024

LTH14 commented Oct 31, 2024

jinnan-chen commented Nov 2, 2024

LTH14 commented Nov 2, 2024