Optimize "duplicate removal" pass #3

zvxryb · 2019-02-21T19:33:40Z

The collision detection process often produces duplicate values; this is expected because each object can produce up to four (2D) or eight (3D) distinct indices, and each one must be tested independently. When two objects collide, and at least one has multiple indices, multiple potential collisions may be emitted.

To avoid unexpected results from the user's perspective (re-running collision handlers unnecessarily), duplicates are removed from the results. This is currently implemented by inserting results into a HashSet before returning them from detect_collisions.

This process is the current limiting factor for performance, taking significantly more time than either index calculation or actual collision detection for the example application.

Alternatives already tested:

Using a Vec, calling sort_unstable() followed by dedup() — this is the slowest option
Using a Vec, calling rayon's par_sort_unstable() with dedup() — almost as fast as HashSet, but requiring more CPU time
Different hashers — I've tested the default std:: hasher, as well as hashers from fnv, rustc_hash, and murmurhash64. Of these, rustc_hash is the fastest (and current default) followed by fnv, with std being the slowest.

Either continue optimizing this or provide an option to the user to obtain potential collision results with duplicates included (so they might implement application-specific solutions).

The text was updated successfully, but these errors were encountered:

zvxryb · 2019-02-21T19:37:23Z

Additional note: this may be less of an issue in real applications; the example is meant to be a sort of "stress test" and has 1500 dynamic entities all in constant collision.

In real applications, there are likely to be fewer collisions (per object) at any given time and the duplicate removal pass is likely to operate on a much smaller set of data.

…ddresses issue #3

zvxryb · 2019-02-28T04:19:16Z

Dropped HashSet entirely:

It doesn't parallelize well (not allowed to have multiple writers and there's no way to combine existing HashSets efficiently)
Performance improvement over sort_unstable()/dedup() in the single-threaded case has become insignificant
More sensible output order; potentially better cache usage if IDs are sorted in the output

This is still the most expensive part of scan()/par_scan()

zvxryb added a commit that referenced this issue Feb 26, 2019

Fix an issue where many duplicate indices were generated; partially a…

b215823

…ddresses issue #3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize "duplicate removal" pass #3

Optimize "duplicate removal" pass #3

zvxryb commented Feb 21, 2019 •

edited

Loading

zvxryb commented Feb 21, 2019 •

edited

Loading

zvxryb commented Feb 28, 2019

Optimize "duplicate removal" pass #3

Optimize "duplicate removal" pass #3

Comments

zvxryb commented Feb 21, 2019 • edited Loading

zvxryb commented Feb 21, 2019 • edited Loading

zvxryb commented Feb 28, 2019

zvxryb commented Feb 21, 2019 •

edited

Loading

zvxryb commented Feb 21, 2019 •

edited

Loading