-
Notifications
You must be signed in to change notification settings - Fork 287
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Benchmark reliability of torchbenchmarks #2527
Comments
This seems like this is an issue with model code, our expectation is that repo owners should own model code while our team owns infrastructure. |
I think the time variability from run to run is expected when running on a devgpu. |
Oh so is this more of an infrastructure thing? |
I feel this might be related to benchmarking code, since with the exact same setup, machine etc. |
Recently I found that for the same model, the native benchmark code in torchbenchmarks does not give expected time, i.e. one is consistently slower than the other one, or one could be slower by up to 20%, I'm relying on torchao.utils.benchmark_model for now, please help take a look to see what might be the problem.
For details please see: #2519
The text was updated successfully, but these errors were encountered: