-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About pretrain time #21
Comments
Hi @1787648106 , V100*8. Tiny:~1.5 days; Small:~1.8 days; Base:~2.2 days Large:~2.5days |
thx for your response ~ |
hello~I pretrained the model on ImageNet1k, but the training time kept about ~5 days. A800 * 8, base model version, amp on. Are there some accelerating skills used ? Thanks again |
@wwqq hello, I'm so sorry to bother you again. About the accelerating in pretrain. |
@1787648106 Hello, try setting the batch size and learning rate to twice their original values. |
Thank you very much for your excellent work. May I ask, what is your experimental hardware configuration? For example, the model of the GPU; In addition, how long did you complete the pre training on imagenet1k?
The text was updated successfully, but these errors were encountered: