modify benchmark to make early stop work #73

crazydemo · 2024-01-26T10:40:45Z

This PR

use while rather than for loop in benchmark, so that the early stop can be truly enabled. Otherwise, once min_batches is done, the benchmark will be over, which may generate unstable performance results when min_batches is small.
add min_bacthes and min_seconds into benchmark_params.

Egor-Krivov · 2024-02-05T14:46:00Z

dl_bench/utils.py

+                while True:
                    s = get_time()
-                    x = backend.to_device(x)
+                    x = backend.to_device(sample)


I think it's wrong that we pass some constant sample instead of part of our dataset. It might be cached in some way

The main purpose of this change is to avoid data_loader generating data during benchmark. Otherwise this is data_loader benchmark instead of MLP benchmark. Data_loader won't be used in production inference environment, instead it's possible to use same buffer holding different samples passing to the network. It means in real production environment, the input can be cached in some way, it's not a problem.

There is no data generation during benchmarking. Right now we benchmark passing pre-generated data to compute device and forward pass. Main benchmarking happens with fw_times that is limited to net(backend.to_device(x))

Maybe we could have an alternative change in PR#75.
We could still pass the pre-generated test loader data to the forward pass. But then we will need to drop the first 3 steps' performance data in benchmarking period. Because we notice the previous warm up does not work. According to the onednn verbose, the first 3 steps in benchmarking period show quite poor performance.
I notice that you have an issue mentioned you would move warmup directly into main pass. If this issue is solved, maybe we can close this PR.

Egor-Krivov · 2024-02-05T14:47:04Z

This PR

use while rather than for loop in benchmark, so that the early stop can be truly enabled. Otherwise, once min_batches is done, the benchmark will be over, which may generate unstable performance results when min_batches is small.

add min_bacthes and min_seconds into benchmark_params.

I don't mind point 2, but for point 1 can't you just pass larger min_batches in this case?

Egor-Krivov · 2024-02-08T13:23:40Z

Merged alternative version of this PR

modify benchmark to make early stop work

55e0d4b

crazydemo requested a review from ZhennanQin January 26, 2024 10:40

Egor-Krivov reviewed Feb 5, 2024

View reviewed changes

Egor-Krivov closed this Feb 8, 2024

Egor-Krivov deleted the zhangyan/fix_perf branch February 8, 2024 13:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

modify benchmark to make early stop work #73

modify benchmark to make early stop work #73

Uh oh!

crazydemo commented Jan 26, 2024

Uh oh!

Egor-Krivov Feb 5, 2024

Uh oh!

ZhennanQin Feb 6, 2024

Uh oh!

Egor-Krivov Feb 6, 2024

Uh oh!

crazydemo Feb 6, 2024

Uh oh!

Egor-Krivov commented Feb 5, 2024

Uh oh!

Egor-Krivov commented Feb 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

modify benchmark to make early stop work #73

modify benchmark to make early stop work #73

Uh oh!

Conversation

crazydemo commented Jan 26, 2024

Uh oh!

Egor-Krivov Feb 5, 2024

Choose a reason for hiding this comment

Uh oh!

ZhennanQin Feb 6, 2024

Choose a reason for hiding this comment

Uh oh!

Egor-Krivov Feb 6, 2024

Choose a reason for hiding this comment

Uh oh!

crazydemo Feb 6, 2024

Choose a reason for hiding this comment

Uh oh!

Egor-Krivov commented Feb 5, 2024

Uh oh!

Egor-Krivov commented Feb 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants