intro tutorial: Analysing model reporters #1955

EwoutH · 2024-01-11T08:40:42Z

Some enhancements to the Mesa Introductory Tutorial:

Visualization Library Update: Updated the docs to mention Seaborn instead of matplotlib for data visualization.
Model Efficiency Improvement: Reduced the number of agents in batch runs, resulting in quicker model executions and generating more insightful data without compromising the educational value.
New Agent Reporter: Introduced a new agent reporter steps_not_given, which tracks the number of consecutive steps an agent hasn't transacted. This addition enriches the tutorial's analytical depth, demonstrating how to handle multiple reporters.
Improved Tutorial Layout and Structure: Enhanced the text layout and structure of the batch run section for better readability and understanding.
General Steps for Analysis: Added a section outlining general steps for analyzing model results, providing a structured approach for new users to follow and apply in their modeling endeavors.

It also fixes Readthedocs not having enough runtime to finish running the notebook, by focussing the batch_run on runs with fewer agents (which run faster).

Tested and renders correctly in Readthedocs. Please review and merge. Feel free to squash while merging.

codecov · 2024-01-11T08:41:55Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (a15ce9f) 79.45% compared to head (07ce5b9) 79.45%.
Report is 5 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1955   +/-   ##
=======================================
  Coverage   79.45%   79.45%           
=======================================
  Files          15       15           
  Lines        1285     1285           
  Branches      285      285           
=======================================
  Hits         1021     1021           
  Misses        225      225           
  Partials       39       39

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

rht · 2024-01-11T09:28:20Z

I set the PR to ready for review so that I can read the rendered tutorial on the RTD CI.

EwoutH · 2024-01-11T09:43:56Z

@rht There's a KeyboardInterrupt in the batch_run. Was already there on the current docs, and I can't reproduce it locally. Do you know what's happening there? Maybe something with the multithreading?

EwoutH · 2024-01-11T10:05:10Z

Maybe the runtime is just too long for Read the Docs

Corvince · 2024-01-11T10:37:54Z

While you are at it. Can you also include a super().__init__() call in the model? Otherwise we will trigger the warning.

Also I find it suboptimal that another warning is raised, a FutureWarning regarding AgentSet. I think its a bit overwhelming for first-timers, especially since the model doesn't make any direct use the AgentSet feature (only indirectly in the schedulers)

Corvince · 2024-01-11T10:40:24Z

Also also: Can you fix the reference to the matplotlib installation in the beginning? The tutorial uses seaborn. And we have to install seaborn manually for the colab version, since its not a requirement of mesa

rht · 2024-01-11T13:03:24Z

docs/tutorials/intro_tutorial.ipynb

+   "outputs": [],
+   "source": [
+    "# Create a point plot with error bars\n",
+    "g = sns.pointplot(data=results_filtered, x=\"N\", y=\"Gini\", linestyle='none')\n",


This seems to be slightly duplicated from the previous paragraph

g = sns.scatterplot(data=results_filtered, x="N", y="Gini") g.set( xlabel="Number of agents", ylabel="Gini coefficient", title="Gini coefficient vs. number of agents", );

Describing both seems to be an exercise in showcasing the feature of Seaborn, because they are essentially showing the same info. But if I have to choose which one to keep, the sns.pointplot with error bars seems to be more informative.

rht · 2024-01-11T13:10:48Z

docs/tutorials/intro_tutorial.ipynb

+  {
+   "cell_type": "markdown",
+   "source": [
+    "In this case it looks like the Gini coefficient increases slower for smaller populations. This can be because of different things, either because the Gini coefficient is a measure of inequality and the smaller the population, the more likely it is that the agents are all in the same wealth class, or because there are less interactions between agents in smaller populations, which means that the wealth of an agent is less likely to change."


The Gini coefficient being the way it is, is due to preferential attachment. The inequality scales faster than linearly against the number of nodes. I don't think this paragraph has incorporated preferential attachment in the explanation.

Bit in doubt. Sometimes it's useful to show the same thing in two different ways to engrain the idea (in this case that it's just a dataframe that can be plotted in different ways).

Did you mean to reply to #1955 (comment) ? If you say so, you have to explicitly say it's a different way to visualize the same data.

rht · 2024-01-11T13:22:58Z

Maybe the runtime is just too long for Read the Docs

I assume it's because AgentSet is slightly slower than the previous version without it?

With less agents the model runs are quicker and so the batch_run method is finished earlier. The resulting data is also more interesting.

Add a steps_not_given agent reporter to the model that's used in the batch_run() function. This way we agent plots can be discussed.

EwoutH · 2024-01-11T14:56:34Z

Okay fixed all the stuff (see commit messages), ready for another round of review!

quaquel · 2024-01-12T09:21:37Z

Maybe the runtime is just too long for Read the Docs

I assume it's because AgentSet is slightly slower than the previous version without it?

I quickly ran 2.2 against 2.1.5 for a few example models yesterday. We indeed sacrificed a bit of performance for the convenience of AgentSet.

EwoutH · 2024-01-12T14:58:29Z

I updated the PR description. It renders correctly in Readthedocs.

Please review and merge. Feel free to squash while merging!

tpike3

Very nice additions on the data analysis thanks!

EwoutH · 2024-01-16T23:31:15Z

Note to self: Need to talk about nested / multidimensional aggregation somewhere. Like how to aggregate over multiple agents, over multiple iterations (and possibly over multiple timesteps). When to do what and how to do it properly.

(after having seen students just throwing all there data on a big pile and drawing a single average out of it. And then reporting a confidence interval which means god knows what)

Ideally maybe a full stack “how to perform experiments and report results with agent-based models”. Including:

Experimental setup (full factorial, scenario based, alternatives)
Metrics (KPIs) and the datacollector
Data aggregation and visualization

And the theory and how to do it in Mesa sandwiched.

intro tutorial: Analysing model reporters

2d7a6cd

rht marked this pull request as ready for review January 11, 2024 09:27

rht reviewed Jan 11, 2024

View reviewed changes

EwoutH added 5 commits January 11, 2024 15:19

intro tutorial: Use less agents in batch_run

63f9ec6

With less agents the model runs are quicker and so the batch_run method is finished earlier. The resulting data is also more interesting.

intro tutorial: Remove matplotlib, we use seaborn now

4e39e86

intro tutorial: Update batch run model with agent reporter

59818a0

Add a steps_not_given agent reporter to the model that's used in the batch_run() function. This way we agent plots can be discussed.

intro tutorial: Improve layout of batch run text

424be31

Intro tutorial: Add general steps for analysing results

07ce5b9

EwoutH force-pushed the tutorial_improvement_2024 branch from 428c25a to 07ce5b9 Compare January 11, 2024 14:55

tpike3 added the docs Release notes label label Jan 13, 2024

tpike3 approved these changes Jan 15, 2024

View reviewed changes

tpike3 merged commit f21d242 into projectmesa:main Jan 15, 2024
12 checks passed

EwoutH mentioned this pull request Jan 16, 2024

Tutorial improvement #1717

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

intro tutorial: Analysing model reporters #1955

intro tutorial: Analysing model reporters #1955

EwoutH commented Jan 11, 2024 •

edited

Loading

codecov bot commented Jan 11, 2024 •

edited

Loading

rht commented Jan 11, 2024

EwoutH commented Jan 11, 2024

EwoutH commented Jan 11, 2024

Corvince commented Jan 11, 2024

Corvince commented Jan 11, 2024 •

edited

Loading

rht Jan 11, 2024

rht Jan 11, 2024

EwoutH Jan 11, 2024

rht Jan 11, 2024

rht commented Jan 11, 2024

EwoutH commented Jan 11, 2024

quaquel commented Jan 12, 2024

EwoutH commented Jan 12, 2024

tpike3 left a comment

EwoutH commented Jan 16, 2024 •

edited

Loading

intro tutorial: Analysing model reporters #1955

intro tutorial: Analysing model reporters #1955

Conversation

EwoutH commented Jan 11, 2024 • edited Loading

codecov bot commented Jan 11, 2024 • edited Loading

Codecov Report

rht commented Jan 11, 2024

EwoutH commented Jan 11, 2024

EwoutH commented Jan 11, 2024

Corvince commented Jan 11, 2024

Corvince commented Jan 11, 2024 • edited Loading

rht Jan 11, 2024

Choose a reason for hiding this comment

rht Jan 11, 2024

Choose a reason for hiding this comment

EwoutH Jan 11, 2024

Choose a reason for hiding this comment

rht Jan 11, 2024

Choose a reason for hiding this comment

rht commented Jan 11, 2024

EwoutH commented Jan 11, 2024

quaquel commented Jan 12, 2024

EwoutH commented Jan 12, 2024

tpike3 left a comment

Choose a reason for hiding this comment

EwoutH commented Jan 16, 2024 • edited Loading

EwoutH commented Jan 11, 2024 •

edited

Loading

codecov bot commented Jan 11, 2024 •

edited

Loading

Corvince commented Jan 11, 2024 •

edited

Loading

EwoutH commented Jan 16, 2024 •

edited

Loading