Benchmark reliability of torchbenchmarks #2527

jerryzh168 · 2024-10-28T22:25:24Z

Recently I found that for the same model, the native benchmark code in torchbenchmarks does not give expected time, i.e. one is consistently slower than the other one, or one could be slower by up to 20%, I'm relying on torchao.utils.benchmark_model for now, please help take a look to see what might be the problem.

For details please see: #2519

seemethere · 2024-10-31T21:51:52Z

This seems like this is an issue with model code, our expectation is that repo owners should own model code while our team owns infrastructure.

kit1980 · 2024-10-31T21:54:59Z

I think the time variability from run to run is expected when running on a devgpu.
TorchBench servers have some special settings to reduce the variability.

seemethere · 2024-10-31T22:08:53Z

I think the time variability from run to run is expected when running on a devgpu. TorchBench servers have some special settings to reduce the variability.

Oh so is this more of an infrastructure thing?

jerryzh168 · 2024-10-31T22:39:15Z

I feel this might be related to benchmarking code, since with the exact same setup, machine etc. torchao.utils.benchmark_model gives stable results

jerryzh168 assigned kit1980 Oct 30, 2024

seemethere unassigned kit1980 Oct 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark reliability of torchbenchmarks #2527

Benchmark reliability of torchbenchmarks #2527

jerryzh168 commented Oct 28, 2024

seemethere commented Oct 31, 2024

kit1980 commented Oct 31, 2024

seemethere commented Oct 31, 2024

jerryzh168 commented Oct 31, 2024 •

edited

Loading

Benchmark reliability of torchbenchmarks #2527

Benchmark reliability of torchbenchmarks #2527

Comments

jerryzh168 commented Oct 28, 2024

seemethere commented Oct 31, 2024

kit1980 commented Oct 31, 2024

seemethere commented Oct 31, 2024

jerryzh168 commented Oct 31, 2024 • edited Loading

jerryzh168 commented Oct 31, 2024 •

edited

Loading