-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
train without projection layer #17
Comments
Yes, you can change the configuration in
|
I did so, but got some error. |
Ah sorry, I also forgot about this detail. You also need to add these arguments:
It will stil perform cross-frame-attention, because we set
|
Thank you so much!! It works! And btw, I wonder why the gpu memory usage is the same with or without the projection layer? It's not quite what I expected. |
The memory usage should not be the same. For me, |
Oh, I see. I used the |
That's expected, because
It means, that |
Okay, and also I wonder how many epochs did you train to get a good result. |
We trained for 60K iterations with 2xA100 GPUs using the |
Is there any way to train without the proposed projection layer? like the ablation study did?
The text was updated successfully, but these errors were encountered: