Add script to run benchmarks #117

ivanpauno · 2023-02-13T20:57:31Z

Related to #35.

Summary

Adds a script that allows running a rosbag repeatedly, configuring amcl with different number of particles.
It records another bagfile with the result, the results of timem, and optionally perf events data.

The missing thing is the postprocessing steps.

qq: does the current rosbag provide a groundtruth topic?
We will need that to use something a tool like evo.

Checklist

Read the contributing guidelines.
Configured pre-commit and ran colcon test locally.
Signed all commits for DCO.
Added tests (regression tests for bugs, coverage of new code for features).
Updated documentation (as needed).
Checked that CI is passing.

nahueespinosa

@ivanpauno Left some comments!

does the current rosbag provide a groundtruth topic?

I didn't record an extra topic for ground truth since odom-to-base_link is the ground truth in the perfect_odometry bagfile, but +1 to having ground truth in bagfiles. Note that for this case, in evo you can compare odom-to-base_link and map-to-base_link.

scripts/benchmarking/parameterized_run.sh

ivanpauno · 2023-02-14T13:57:50Z

in the perfect_odometry bagfile

Lol, I didn't pay attention to the name of the bag haha.
Ok, that's enough to get evo to work.

ivanpauno · 2023-02-16T21:08:21Z

Note that for this case, in evo you can compare odom-to-base_link and map-to-base_link.

I tried to make some progress on this.
The problem is that evo doesn't support tf topic for ROS 2, it only does for ROS 1.
I tried to convert the bag using rosbags which doesn't require ROS 1 installed.
The problem is that evo does when using the rosbag reader.

It seems that the easy solution is to use a bag with a ground truth topic (or check how to add tf support to evo).

olmerg · 2023-02-17T15:03:52Z

maybe we can use the Odometry message ground_thruth_topic which generate the flatland plugin to obtain the ground truth, which is suuported by evo

ivanpauno · 2023-02-17T18:14:37Z

maybe we can use the Odometry message ground_thruth_topic which generate the flatland plugin to obtain the ground truth, which is suuported by evo

Yes, I think that's the easier thing to do, I will record a new bag that includes the odometry topic.

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

ivanpauno · 2023-02-23T19:57:17Z

Results:
benchmark.zip

The folder benchmark was compiled in Release mode.
The folder benchmark with profiling was compiled using RelWithDebInfo and -fno-omit-frame-pointer.

I'm failing to generate results for nav2_amcl, for some reason it's segfaulting now ......

To see the timem results, use:

/ws/src/scripts/benchmarking/timem_metrics_results /path/to/folder/with/timem/json/output

To get ape metrics:

evo_ape bag2 benchmark_1000_particles_output/rosbag/ /odometry/ground_truth /pose

ivanpauno · 2023-02-23T20:11:20Z

The issue I'm running into is ros-navigation/navigation2#3311.
I think it was fixed for rolling (see here), but it seems it wasn't backported to humble.

ivanpauno · 2023-02-23T20:52:22Z

I was finally able to record the same cases for nav2 (it doesn't seem to fail if commenting out the recovery_alpha_fast/recovery_alpha_slow, so I did that).

benchmark_nav2.zip

(edit: uploaded new zip, as I forgot to record the /amcl_pose topic)

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@gmail.com>

ivanpauno · 2023-02-24T17:00:54Z

I run both again with some changes:

beluga
- Used covariance 0 for the initial pose, i.e. all initial particles in the same state.
  That's what nav2 does if you use the parameters.
amcl
- Changed recovery_alpha_fast/recovery_alpha_slow to the default, i.e. disabled.
  It avoids the crash mentioned above. I cannot disable this in beluga, but I think it shouldn't affect much the comparison.
- Run tests again without using google meet at the same time 😄

benchmark.zip

I'm working in an script to easily compare both results

* Add script to plot results comparision. Signed-off-by: Ivan Santiago Paunovic <ivanpauno@gmail.com>

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@gmail.com>

hidmic · 2023-02-27T19:27:21Z

I don't think it's useful for the comparison

Not for a comparison, but I wonder why the APE maximum is consistently higher for beluga. Is it a transient that is quickly corrected? Is it a larger variance in the distribution (ie. beluga being less consistent in its performance than nav2_amcl)?

ivanpauno · 2023-02-27T19:31:22Z

Not for a comparison, but I wonder why the APE maximum is consistently higher for beluga. Is it a transient that is quickly corrected? Is it a larger variance in the distribution (ie. beluga being less consistent in its performance than nav2_amcl)?

Yes, I guessed it was that.
I will probably add that in another PR to limit the scope of this one, but I think it's a good idea to have that available as well.

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

* Remove .sh extension from executable scripts. Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

ivanpauno · 2023-02-28T13:38:30Z

I will probably add that in another PR to limit the scope of this one, but I think it's a good idea to have that available as well.

I see that's already provided by evo, see 1a50027.

The maximum APE error seems to happen just before the bagfile ends (i.e. before shutdown).

It happens pretty consistently for beluga, and not for nav2_amcl.
It doesn't seem to be something to worry much about ...

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

nahueespinosa

@ivanpauno I have only minor comments.

beluga_benchmark/CMakeLists.txt

beluga_benchmark/docs/BENCHMARKING.md

docker/images/humble/Dockerfile

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Co-authored-by: Nahuel Espinosa <nespinosa@ekumenlabs.com> Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

nahueespinosa

@ivanpauno LGTM!

Related to #35. Adds a script that allows running a rosbag repeatedly, configuring amcl with different number of particles. It records another bagfile with the result, the results of timem, and optionally perf events data. It also adds postprocessing scripts, and instructions of how to run them. Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

ivanpauno added the enhancement New feature or request label Feb 13, 2023

ivanpauno requested a review from nahueespinosa February 13, 2023 20:57

ivanpauno self-assigned this Feb 13, 2023

nahueespinosa reviewed Feb 14, 2023

View reviewed changes

scripts/benchmarking/parameterized_run.sh Outdated Show resolved Hide resolved

scripts/benchmarking/parameterized_run.sh Outdated Show resolved Hide resolved

scripts/benchmarking/parameterized_run.sh Outdated Show resolved Hide resolved

ivanpauno added 15 commits February 23, 2023 16:49

Automatically fix perf executable if needed

0f8d2d3

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Configure bagfile output

24d6052

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Add script to run benchmark with different amount of particles

729445c

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Add postprocessing dependencies

7418ec0

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Add copyright notice to bash scripts

88012c8

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Add new script to postprocess timem results

e1f53a3

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Also record /odom

394ca9b

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Fix parameter name

9e900fe

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Add evo to docker image

b7867a4

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Remove unused param

d8de082

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Record ground truth topic in bagfile

0cb260a

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Fix pose timestamp

7c5f573

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Replace perfect_odometry bag for one with ground truth topic

69d5de3

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Address peer review comments

d5dfa2a

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Fix

7343ed3

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

ivanpauno force-pushed the ivanpauno/script-to-benchmark branch from 1183461 to 7343ed3 Compare February 23, 2023 19:49

Add /amcl_pose to topics to record

0103034

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@gmail.com>

* Move benchmark/profiling scripts and docs to new package.

bc24c85

* Add script to plot results comparision. Signed-off-by: Ivan Santiago Paunovic <ivanpauno@gmail.com>

Make both arguments in compare_results script required

91f253e

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@gmail.com>

ivanpauno added 6 commits February 27, 2023 17:42

Move beluga_benchmark dependencies from dockerfile to package.xml

cc3f11c

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

* Update PROFILING.md

96a56bc

* Remove .sh extension from executable scripts. Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Remove .sh suffix from parameterized_run

bf69d8e

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Install scripts correctly so they can be found by ros2run

3c78679

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Remove unnecessary __init__.py file

27129e2

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Add benchmarking documentation

1a50027

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

ivanpauno added 2 commits February 28, 2023 10:39

fix

b7fc17b

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Rename scripts and python modules to improve consistency

5bbd671

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

ivanpauno requested a review from nahueespinosa February 28, 2023 13:49

Fix references

4173a1d

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

nahueespinosa reviewed Feb 28, 2023

View reviewed changes

nahueespinosa added the python Related to Python code label Feb 28, 2023

ivanpauno and others added 7 commits February 28, 2023 15:13

Remove unused line

cc6b7d4

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Reword list of recorded metrics/events

a85e6d0

Co-authored-by: Nahuel Espinosa <nespinosa@ekumenlabs.com> Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Reword sentence

098f660

Co-authored-by: Nahuel Espinosa <nespinosa@ekumenlabs.com> Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Reword sentence

38e7744

Co-authored-by: Nahuel Espinosa <nespinosa@ekumenlabs.com> Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Fix nit

5910649

Co-authored-by: Nahuel Espinosa <nespinosa@ekumenlabs.com> Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Use "ALL CAPS" for acronyms

b4aa404

Co-authored-by: Nahuel Espinosa <nespinosa@ekumenlabs.com> Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

Alpha order + pin evo version

09b65ad

Signed-off-by: Ivan Santiago Paunovic <ivanpauno@ekumenlabs.com>

ivanpauno requested a review from nahueespinosa February 28, 2023 18:22

nahueespinosa approved these changes Feb 28, 2023

View reviewed changes

ivanpauno merged commit f93602e into main Feb 28, 2023

ivanpauno deleted the ivanpauno/script-to-benchmark branch February 28, 2023 21:26

hidmic mentioned this pull request Apr 18, 2023

Use LAMBKIN for performance benchmarking #163

Closed

3 tasks

nahueespinosa mentioned this pull request Jul 10, 2023

Fix perfect_odometry rosbag #238

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add script to run benchmarks #117

Add script to run benchmarks #117

ivanpauno commented Feb 13, 2023 •

edited

Loading

nahueespinosa left a comment •

edited

Loading

ivanpauno commented Feb 14, 2023

ivanpauno commented Feb 16, 2023

olmerg commented Feb 17, 2023

ivanpauno commented Feb 17, 2023

ivanpauno commented Feb 23, 2023

ivanpauno commented Feb 23, 2023

ivanpauno commented Feb 23, 2023 •

edited

Loading

ivanpauno commented Feb 24, 2023

hidmic commented Feb 27, 2023

ivanpauno commented Feb 27, 2023

ivanpauno commented Feb 28, 2023

nahueespinosa left a comment

nahueespinosa left a comment

Add script to run benchmarks #117

Add script to run benchmarks #117

Conversation

ivanpauno commented Feb 13, 2023 • edited Loading

Summary

Checklist

nahueespinosa left a comment • edited Loading

Choose a reason for hiding this comment

ivanpauno commented Feb 14, 2023

ivanpauno commented Feb 16, 2023

olmerg commented Feb 17, 2023

ivanpauno commented Feb 17, 2023

ivanpauno commented Feb 23, 2023

ivanpauno commented Feb 23, 2023

ivanpauno commented Feb 23, 2023 • edited Loading

ivanpauno commented Feb 24, 2023

hidmic commented Feb 27, 2023

ivanpauno commented Feb 27, 2023

ivanpauno commented Feb 28, 2023

nahueespinosa left a comment

Choose a reason for hiding this comment

nahueespinosa left a comment

Choose a reason for hiding this comment

ivanpauno commented Feb 13, 2023 •

edited

Loading

nahueespinosa left a comment •

edited

Loading

ivanpauno commented Feb 23, 2023 •

edited

Loading