Allow Mesh-related queue phase systems to parallelize #11804

james7132 · 2024-02-10T00:58:31Z

Objective

Partially addresses #3548. queue_shadows and queue_material_meshes cannot parallelize because of the ResMut<RenderMeshInstances> parameter for queue_material_meshes.

Solution

Change the material_bind_group field to use atomics instead of needing full mutable access. Change the ResMut to a Res, which should allow both sets of systems to parallelize without issue.

Performance

Tested against many_foxes, this has a significant improvement over the entire render schedule. (Yellow is this PR, red is main)

The use of atomics does seem to have a negative effect on queue_material_meshes (roughly a 8.29% increase in time spent in the system).

queue_shadows seems to be ever so slightly slower (1.6% more time spent) in the system.

batch_and_prepare_render_phase seems to be a mix, but overall seems to be slightly faster by about 5%.

crates/bevy_pbr/src/lightmap/mod.rs

crates/bevy_pbr/src/material.rs

hymm · 2024-02-19T19:29:17Z

Does this have a significant effect on single threaded perf?

superdump · 2024-02-19T21:29:14Z

Merge conflict. Code looks good. Could you test with a heavier single-material-type no-contention load like many_cubes?

IceSentry

LGTM assuming conflicts are fixed

james7132 · 2024-02-19T23:00:27Z

Merge conflict. Code looks good. Could you test with a heavier single-material-type no-contention load like many_cubes?

Checked against main and this PR. There seems to be next to zero impact on the average time spent, which makes sense: uncontended atomics on x86 machines is essentially free. There's a bit more variance, but not by much. Here's queue_material_meshes with many_cubes.

james7132 · 2024-02-19T23:19:11Z

Does this have a significant effect on single threaded perf?

Most modern platforms (with maybe the exception of wasm) treat all load/store operations as atomic, and so long as there is no contention on the same cache line, it's identical to single threaded performance. The only thing you might be missing out on is the compiler reordering your operations to be more optimal, which isn't a particular concern for these systems.

# Objective Partially addresses bevyengine#3548. `queue_shadows` and `queue_material_meshes` cannot parallelize because of the `ResMut<RenderMeshInstances>` parameter for `queue_material_meshes`. ## Solution Change the `material_bind_group` field to use atomics instead of needing full mutable access. Change the `ResMut` to a `Res`, which should allow both sets of systems to parallelize without issue. ## Performance Tested against `many_foxes`, this has a significant improvement over the entire render schedule. (Yellow is this PR, red is main) ![image](https://github.com/bevyengine/bevy/assets/3137680/6cc7f346-4f50-4f12-a383-682a9ce1daf6) The use of atomics does seem to have a negative effect on `queue_material_meshes` (roughly a 8.29% increase in time spent in the system). ![image](https://github.com/bevyengine/bevy/assets/3137680/7907079a-863d-4760-aa5b-df68c006ea36) `queue_shadows` seems to be ever so slightly slower (1.6% more time spent) in the system. ![image](https://github.com/bevyengine/bevy/assets/3137680/6d90af73-b922-45e4-bae5-df200e8b9784) `batch_and_prepare_render_phase` seems to be a mix, but overall seems to be slightly *faster* by about 5%. ![image](https://github.com/bevyengine/bevy/assets/3137680/fac638ff-8c90-436b-9362-c6209b18957c)

# Objective - After #11804 , The queue_prepass_material_meshes function is now executed in parallel with other queue_* systems. This optimization introduced a potential issue where mesh_instance.should_batch() could return false in queue_prepass_material_meshes due to an unset material_bind_group_id.

Allow Mesh-related queue phase systems to parallelize

aa4685d

james7132 added A-Rendering Drawing game state to the screen C-Performance A change motivated by improving speed, memory usage or compile times labels Feb 10, 2024

pcwalton reviewed Feb 10, 2024

View reviewed changes

crates/bevy_pbr/src/lightmap/mod.rs Outdated Show resolved Hide resolved

crates/bevy_pbr/src/material.rs Outdated Show resolved Hide resolved

JMS55 added this to the 0.13 milestone Feb 10, 2024

JMS55 approved these changes Feb 10, 2024

View reviewed changes

james7132 added 2 commits February 9, 2024 23:39

Fix typo

828f75e

Doc comments and use-import cleanup

e66a61b

james7132 requested a review from pcwalton February 10, 2024 07:53

james7132 added 2 commits February 9, 2024 23:53

Formatting

61c5b90

Fix CI

d9b5fc1

alice-i-cecile removed this from the 0.13 milestone Feb 12, 2024

james7132 added this to the 0.14 milestone Feb 19, 2024

superdump approved these changes Feb 19, 2024

View reviewed changes

IceSentry approved these changes Feb 19, 2024

View reviewed changes

Merge branch 'main' into paralleize-queue-phase

52c5e60

Formatting

78fab88

Remove the ResMut again

647e70f

james7132 enabled auto-merge February 20, 2024 00:00

james7132 added this pull request to the merge queue Feb 20, 2024

Merged via the queue into bevyengine:main with commit 6d547d7 Feb 20, 2024
23 checks passed

re0312 mentioned this pull request Jun 20, 2024

Fix prepass batch #13943

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow Mesh-related queue phase systems to parallelize #11804

Allow Mesh-related queue phase systems to parallelize #11804

james7132 commented Feb 10, 2024 •

edited

Loading

hymm commented Feb 19, 2024

superdump commented Feb 19, 2024 •

edited

Loading

IceSentry left a comment

james7132 commented Feb 19, 2024 •

edited

Loading

james7132 commented Feb 19, 2024

Allow Mesh-related queue phase systems to parallelize #11804

Allow Mesh-related queue phase systems to parallelize #11804

Conversation

james7132 commented Feb 10, 2024 • edited Loading

Objective

Solution

Performance

hymm commented Feb 19, 2024

superdump commented Feb 19, 2024 • edited Loading

IceSentry left a comment

Choose a reason for hiding this comment

james7132 commented Feb 19, 2024 • edited Loading

james7132 commented Feb 19, 2024

james7132 commented Feb 10, 2024 •

edited

Loading

superdump commented Feb 19, 2024 •

edited

Loading

james7132 commented Feb 19, 2024 •

edited

Loading