Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance-notebooks #4787

Open
wants to merge 19 commits into
base: develop
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
12 changes: 12 additions & 0 deletions docs/source/examples/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -33,6 +33,18 @@ The notebooks are organised into subfolders, and can be viewed in the galleries
notebooks/creating_models/6-a-simple-SEI-model.ipynb
notebooks/creating_models/7-creating-a-submodel.ipynb

.. nbgallery::
:caption: performance
:glob:

notebooks/performance/01-simulation-pipeline.ipynb
notebooks/performance/02-input-parameters.ipynb
notebooks/performance/03-pybamm-solvers.ipynb
notebooks/performance/04-interpolation-points.ipynb
notebooks/performance/05-solver-tolerances.ipynb
notebooks/performance/06-output-variables.ipynb
notebooks/performance/07-multithreading.ipynb

.. nbgallery::
:caption: Expression Tree
:glob:
Expand Down

Large diffs are not rendered by default.

347 changes: 347 additions & 0 deletions docs/source/examples/notebooks/performance/02-input-parameters.ipynb

Large diffs are not rendered by default.

159 changes: 159 additions & 0 deletions docs/source/examples/notebooks/performance/03-pybamm-solvers.ipynb

Large diffs are not rendered by default.

Original file line number Diff line number Diff line change
@@ -0,0 +1,267 @@
{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": [
"# Interpolation and evaluation points\n",
"\n",
"In this notebook we will discuss the use of interpolation and evaluation points in PyBaMM. These points are used to control the solver and the output of the simulation, and can be used to improve the performance of the solver, or if they are used incorrectly, can lead to decreased performance."
]
},
{
"cell_type": "code",
"execution_count": 1,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Note: you may need to restart the kernel to use updated packages.\n"
]
}
],
"source": [
"%pip install \"pybamm[plot,cite]\" -q # install PyBaMM if it is not installed\n",
"import pybamm\n",
"import numpy as np\n",
"import time"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"\n",
"## Solver timestepping\n",
"\n",
"Every PyBaMM solver solves the equations by starting from a set of initial conditions and then \"stepping\" forward in time. At each step, the solver solves the equations, including any algebraic conditions, and evaluates the local error to determine how long each time-step should be in order to satisfy the tolerances supplied by the user. Therefore, the progress of the solver is defined by a sequence of internal time-points at which the solver solved the equations. For example if we wanted to solve the equations from t=0 to t=10s, the solver might choose to solve the equations at the following time-points:\n",
"\n",
"```\n",
"|-------|-------------------------------|---------------------------|----------|------|-----|\n",
"0 0.9 4.9 7.3 8.5 9.1 10\n",
"```\n",
"\n",
"At each time point $i$, the solver will store the solution to the equations $\\mathbf{y}_i$ and perhaps some other relvent information about the timestep. At the end of the solve, the solver will return the solution comprised of the $n$ solutions $\\mathbf{y}_i$ at each time-point $t_i$. In our example above the solution would consist of $n=7$ solutions. This data can be later used to either plot the solution, or to evaluate any output variables that the user has requested. "
]
},
{
"cell_type": "code",
"execution_count": 2,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"solution was generated at times [ 0. 0.01 0.02 0.04 0.08 0.16 0.32 0.64 1.28 2.56 3.84 5.12\n",
" 7.68 10. ]\n"
]
}
],
"source": [
"sim = pybamm.Simulation(pybamm.lithium_ion.SPM(), solver=pybamm.IDAKLUSolver())\n",
"sol = sim.solve([0, 10])\n",
"print(\"solution was generated at times\", sol.t)"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"You can see that the initial timestep used by the solver starts out quite small, but then exponentially increases as the solver becomes more confident in the solution. The method in this case is a multi-step method with variable order, so the solver is able to take longer steps with an increased order of accuracy as it builds up information over multiple steps.\n",
"\n",
"## Evaluation points\n",
"\n",
"Thus far, we have described how the solver chooses its own internal timepoints in order to solve the equations at. In some cases, however, these timepoints need to be supplied by the user to ensure an accurate solution. A trivial example of this is the start and end timepoints that you wish to perform the solve. Other points include the points at which the solution is discontinuous, which the solver cannot know in advance or detect. For this reason PyBaMM also allows the user to supply a set of evaluation points to the solver using the [`t_eval`](https://docs.pybamm.org/en/stable/source/api/solvers/base_solver.html#pybamm.BaseSolver.solve) argument to the `solve` function. The solver will then make sure to stop at each evaluation point and return the solution at these points. For example, say if `t_eval = [0, 5, 10]`, the solution returned by the solver might be:\n",
"\n",
"```\n",
"|-------|-----------------------------|--|--------|-----------------|-----------------|-----|\n",
"0 0.9 4.9 5 5.9 7.3 9 10\n",
"```\n",
"In this case the solver has returned not only the evaluation points requested, but also all the internal time points that it used.\n",
"\n",
"Normally, a PyBaMM user would not be required to supply any evaluation points other than the start and end points of the simulation. PyBaMM itself detects any discontinuous events in the model and adds these to the evaluation points before passing them to the solver. However, in some cases the user may know in advance that the solution is discontinuous at a certain point, and it is more accurate and numerically stable to supply this point to the solver. Note, however, that every additional evaluation point will increase the time taken to solve the equations as the solver will be reqired to take more steps to solve the equations, so only add additional evaluation points if required.\n",
"\n",
"Below we add an additional evaluation point at $t=5s$ to the example above."
]
},
{
"cell_type": "code",
"execution_count": 3,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"solution was generated at times [ 0. 0.01 0.02 0.04 0.08 0.16 0.32 0.64 1.28 2.56\n",
" 3.84 5. 5.005 5.01 5.02 5.04 5.08 5.16 5.32 5.64\n",
" 6.28 7.56 10. ]\n"
]
}
],
"source": [
"sim = pybamm.Simulation(pybamm.lithium_ion.SPM(), solver=pybamm.IDAKLUSolver())\n",
"sol = sim.solve(t_eval=np.array([0, 5, 10]))\n",
"print(\"solution was generated at times\", sol.t)"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"Adding the evaluation point at $t=5s$ will force the solver to stop and restart integration at this point. You can see after this point the solver takes smaller steps as it restarts, and then gradually increases the step size again as it becomes more confident in the solution.\n",
"\n",
"## Interpolation points\n",
"\n",
"When evaluating output variables using the solution, if no interpolation points are provided PyBaMM will interpolate between the internal time-points to get the solution at the time-points requested by the user. For example, if the user requested the solution at $t=0.1s$, PyBaMM would interpolate the solution between $t=0$ and $t=0.9$ to get the solution at $t=0.1s$. However, often a user will know in advance the time-points at which they want to evaluate the solution, and it can be more efficient to simply provide these time-points to the solver in advance so that it can do this interpolation during the solve. \n",
"\n",
"The IDAKLU solver allows the user to provide a set of interpolation points to the solver using the [`t_interp`](https://docs.pybamm.org/en/stable/source/api/solvers/base_solver.html#pybamm.BaseSolver.solve) argument to the `solve` function. The solver will then interpolate the solution on-the-fly during the solve, and return the solution at the requested time-points. For example, say if `t_interp = [2, 4, 6, 8]`, and the solver takes the same internal time-points as in the example above, the solution returned by the solver would be:\n",
"\n",
"```\n",
"|-------|---------*----------------*----|--------------------*------|------*---|------|-----|\n",
"0 2 4 6 8 10\n",
"```\n",
"\n",
"where the `*` represent the solution at the requested time-points. The solver is still stepping to the same internal time-points in order to solve the equations, but it is also interpolating the solution at the requested time-points `*` and storing only these interpolated solutions. Therefore, only use interpolation points if you know in advance the time-points at which you want to evaluate the solution, as interpolating to any other time-point post-solve will be much less accurate.\n"
]
},
{
"cell_type": "code",
"execution_count": 4,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"solution was generated at times [ 0. 2. 4. 6. 10.]\n"
]
}
],
"source": [
"import pybamm\n",
"import numpy as np\n",
"\n",
"sim = pybamm.Simulation(pybamm.lithium_ion.SPM(), solver=pybamm.IDAKLUSolver())\n",
"sol = sim.solve(t_eval=[0, 10], t_interp=[2, 4, 6])\n",
"print(\"solution was generated at times\", sol.t)"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"You can see that once we provide interpolation points to the solver, the solution will no longer store the solution at every internal time-point, but only at the interpolation and evaluation points provided.\n",
"\n",
"## Performance considerations\n",
"\n",
"We have already mentioned that adding additional evaluation points will increase the time taken to solve the equations, as the solver will be required to take more steps and will restart itself at each evaluation point, which can be computationally expensive. \n",
"\n",
"Using interpolation points can help to decrease the time taken by the solver, but only if the number of internal timesteps is much greater than the number of interpolation points. For example, the simulation below uses >3000 internal timesteps to compute the whole solution, but we know in advance that we only want to evaluate the solution at 1000 known time-points. In this case, providing interpolation points to the solver will be more efficient than storing all the internal time-points and then interpolating the solution post-solve. "
]
},
{
"cell_type": "code",
"execution_count": 5,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Number of internal time steps: 3601\n",
"Time to solve (no t_interp): 0.07621171302162111s\n",
"Time to solve (with t_interp): 0.030624893959611654s\n"
]
}
],
"source": [
"parameter_values = pybamm.ParameterValues(\"Chen2020\")\n",
"parameter_values.set_initial_stoichiometries(1)\n",
"experiment = pybamm.step.CRate(0.1, period=10, duration=36000)\n",
"sim = pybamm.Simulation(\n",
" pybamm.lithium_ion.DFN(),\n",
" solver=pybamm.IDAKLUSolver(),\n",
" parameter_values=parameter_values,\n",
" experiment=experiment,\n",
")\n",
"sol = sim.solve()\n",
"print(f\"Number of internal time steps: {len(sol.t)}\")\n",
"t_final = sol[\"Time [h]\"].entries[-1]\n",
"t_data = np.linspace(0, t_final, 1000)\n",
"\n",
"start_time = time.perf_counter()\n",
"sol = sim.solve()\n",
"voltage = sol[\"Terminal voltage [V]\"](t_data)\n",
"end_time = time.perf_counter()\n",
"print(f\"Time to solve (no t_interp): {end_time - start_time}s\")\n",
"\n",
"start_time = time.perf_counter()\n",
"sol = sim.solve(t_interp=t_data)\n",
"voltage = sol[\"Terminal voltage [V]\"].data\n",
"end_time = time.perf_counter()\n",
"print(f\"Time to solve (with t_interp): {end_time - start_time}s\")"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"In the opposite case, lets increase the number of interpolation points so that they are greater than the number of internal time-points. In this case, more work will be required in both cases due to the cost of interpolating at more points. However, storing these points instead of only the internal time-points will be costly, so using interpolation points is slower in this case."
]
},
{
"cell_type": "code",
"execution_count": 6,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Time to solve (no t_interp): 0.06396568496711552s\n",
"Time to solve (with t_interp): 0.12752164108678699s\n"
]
}
],
"source": [
"t_data = np.linspace(0, t_final, 10000)\n",
"start_time = time.perf_counter()\n",
"sol = sim.solve()\n",
"voltage = sol[\"Terminal voltage [V]\"](t_data)\n",
"end_time = time.perf_counter()\n",
"print(f\"Time to solve (no t_interp): {end_time - start_time}s\")\n",
"\n",
"start_time = time.perf_counter()\n",
"sol = sim.solve(t_interp=t_data)\n",
"voltage = sol[\"Terminal voltage [V]\"].data\n",
"end_time = time.perf_counter()\n",
"print(f\"Time to solve (with t_interp): {end_time - start_time}s\")"
]
}
],
"metadata": {
"kernelspec": {
"display_name": "env",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.10.12"
}
},
"nbformat": 4,
"nbformat_minor": 2
}

Large diffs are not rendered by default.

Loading