Performance Enhancements #151

Grochocinski · 2025-03-17T18:31:22Z

SimOpt Core

Fixed return typing for objectives and stoch_constraint stats
Fixed factor_settings_filename not being an optional argument for a DataFarmingExperiment
Changed multiprocessing from map_async to imap_unordered to reduce busy waiting and overall memory usage
Changed multiprocessing pool to use the min of (# CPU cores, # macroreps) to prevent spawning processes that will never be used or spawning more processes than can be handled at once
Fixed directory type hints to indicate subclasses of Problems/Solvers/Models are being returned rather than instances of the parent class

Problems/Models/Solvers

Optimized every Model's replicate() function and every Solver's solve() function to reduce unnecessary calculations, optimize the remaining calculations, and improve the general clarity of the code (see comment below for performance changes)
Discovered issues Off By One Indexing in FixedSAN #153 and IRONORE Ignoring mean_price Value #154

Testing

Refactored testing scripts with child/parent class layout to vastly reduce duplicate code and prevent large future refactorings
Introduced script to dynamically build test classes based on existing YAML files, allowing VS Code and unittest to automatically find the most up-to-date testing files
Updated expected results to align with prerelease mrg32k3a V1.1
Modified testing logic and base classes to prevent numpy's binary values from being dumped to expected YAMLs

Dev Utils

Generalized create_profiles.py into run_experiment.py to allow for easier runtime analysis with tools like VizTracer

Demo Files

Made simopt directory adding logic more robust
Formatted imports

Project Settings

Added imports flag ("I") to Ruff's linting
Added "simopt" and "mrg32k3a" to Ruff's known-first-party list to ensure import consistency regardless of mrg32k3a import method

…ore efficient

… readability

…ation loop

…tion and lower overhead

…ff x5 data

…with viztracer

Grochocinski · 2025-03-17T20:51:24Z

Early testing with VizTracer seems to confirm that mrg32k3a random number generation is taking up a significant portion of processor time. It might be worth looking into further optimization such as JIT compilation or straight up reimplementing it in C/C++/Rust and creating Python bindings for that with PyO3.

EDIT: I made quite a few optimizations to the mrg32k3a and opened a PR. Testing shows a 30-40% speed increase which is probably the best we can get without implementing some JIT/reimplementation techniques. At What point do we hit diminishing returns on that?

…erelease version, updated functions to improve typing consistency

…nding differences)

…dling post_reps and post_norms

…module and helper file

…e name, cleaned up dev_tools

…ion, improved clarity

…customer loop

Grochocinski · 2025-03-27T04:24:21Z

Current Performance Comparison vs Development

% Change in Runtime vs Development (Avg. -27.35%)

Problem	ADAM	ALOE	ASTRODF	NELDMD	RNDSRCH	SPSA	STRONG
AMUSEMENTPARK-1					-19.64%
CHESS-1					-16.62%
CNTNEWS-1	-30.31%	-33.55%	-31.38%	-32.50%	-33.88%	-32.20%	-32.28%
CONTAM-1					-30.57%
CONTAM-2					-31.63%
DUALSOURCING-1					-79.42%
DYNAMNEWS-1	-32.59%	-33.94%	-25.60%	-23.12%	-35.35%	-33.65%	-35.39%
EXAMPLE-1	-32.07%	-32.59%	-35.52%	-34.04%	-34.45%	-33.70%	-34.36%
FACSIZE-1					-33.95%
FACSIZE-2					-33.81%
FIXEDSAN-1	17.51%	23.81%	67.20%	12.03%	2.50%	1.75%	14.64%
HOTEL-1					-8.25%
IRONORE-1					-32.28%
IRONORECONT-1	-30.79%	-32.47%	-27.72%	-30.89%	-31.54%	-29.96%	-30.71%
MM1-1	-6.95%	-13.30%	-10.97%	-9.80%	-19.98%	-1.61%	-14.95%
NETWORK-1					-21.19%
PARAMESTI-1	-33.43%	-36.48%	-36.11%	-35.77%	-34.62%	-33.83%	-35.78%
RMITD-1					-35.84%
SAN-1	-33.54%	-36.58%	-28.14%	-37.39%	-38.31%	-38.10%	-35.47%
SSCONT-1	-49.26%	-49.04%	-49.24%	-46.56%	-49.23%	-50.00%	-50.48%
TABLEALLOCATION-1					-35.68%

Testing Notes

Calculated as (new-old)/old (lower is better)
Values used is the total time to run, post-replicate, and post-normalize an experiment with 10 macroreps and 100 postreps

Notes on FIXEDSAN regression

As can be seen in the commit for the FIXEDSAN-1 refactor, the code was changed from a mess of calculations to a much simpler series of function calls and helper functions. This adds overhead, but in return allows much greater clarity into the code. This greater clarity even allowed for a bug that was present since the code's inception to be easily spotted (see #153).

If we ignore this regression, the average difference in speed is -32.21%

Copilot

Pull Request Overview

This PR introduces performance enhancements across core experiment functionality, testing scripts, demo files, and developer utilities. Key changes include improved return typing for objectives and constraints, refined multiprocessing pool usage to reduce overhead, and extensive refactoring of testing and demo code using pathlib for cleaner imports and path handling.

Reviewed Changes

Copilot reviewed 208 out of 208 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
pyproject.toml	Reformatted dependency and linting configuration for consistency.
docs/source/conf.py	Minor formatting adjustment to support Sphinx documentation build.
dev_tools/run_experiment.py	Added a new CLI experiment runner with improved compatibility checks.
dev_tools/profiling/create_profiles.py	Removed legacy profiling script, likely replaced by run_experiment.py.
dev_tools/generate_experiment_results.py	Refactored test generation using pathlib and streamlined file handling.
demo/*	Updated import paths and object instantiation for clarity and robustness.

Comments suppressed due to low confidence (1)

demo/demo_problem.py:100

[nitpick] Consider adding an informative message to this assert statement (e.g., 'stoch_constraints should not be None') to clarify the failure condition.

assert mysolution.stoch_constraints is not None

…re gradient use

Copilot

Pull Request Overview

This pull request introduces performance enhancements, refactors several modules for clarity and efficiency, and updates various demo and testing scripts. Key changes include:

Updates to dependency and linting configurations in pyproject.toml.
Reworking of experimental and testing scripts (run_experiment, generate_experiment_results) to improve runtime performance and maintainability.
Migration of demo files from os.path to pathlib for improved readability and consistency.

Reviewed Changes

Copilot reviewed 208 out of 208 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
pyproject.toml	Reformatted dependencies and updated linting configuration.
docs/source/conf.py	Minor formatting adjustment for Sphinx configuration.
dev_tools/testing/init.py	Removed docstring; init file now empty.
dev_tools/run_experiment.py	New script to run experiments with multiprocessing improvements.
dev_tools/profiling/create_profiles.py	Removed outdated profiling code.
dev_tools/generate_experiment_results.py	Refactored test generation using pathlib and simplified compatibility checks.
demo/* (multiple demo files)	Updated import paths using pathlib; small code adjustments and minor fixes.

Grochocinski · 2025-03-29T07:16:22Z

all the refactoring is done and the I've implemented fixes for the two issues I discovered during the rewrites, so I think we're good to review and merge

Grochocinski and others added 18 commits March 10, 2025 15:30

modified ADAM solve() to use np's vectorization

adc6003

refactored finite_diff helper to take advantage of np vectorization

b46aaac

changed ALOE's solve() to use np's vectorized operations

fd9dbcf

vectorized ALOE's finite_diff helper

bd082ea

added finite_diff docstring, changed solution simulation loop to be m…

1f6d870

…ore efficient

removed inline comments causing statements to run over 80 characters

9841859

refactored NelderMead's helper functions for better vectorization and…

f9ae0cc

… readability

refactored neldmd's solve() function

85b1fe9

removed repeated casting of bounds to/from np arrays

5c1e005

fixed type hints, removed unnecessary tuple casts, simplified minimiz…

4bdd2eb

…ation loop

refactored helper functions and solve function for improved vectoriza…

d0c2d77

…tion and lower overhead

partial STRONG refactor to avoid recomputation of values

f9bcec2

refactored to reduce repeated code and improve clarity

169e985

refactored finite diff to remove large if block

183e5b4

fixed bug in STRONG hessian calculation where x6 solution was based o…

ecd2df5

…ff x5 data

refactored finite_diff to reduce duplicate code

c8d2b5e

replaced create_profiles script with improved run_experiment for use …

7b30f6a

…with viztracer

moved valid pair generation out of main method

4e96948

Grochocinski self-assigned this Mar 17, 2025

moved method checks outside of loop to avoid repetitive checks

de9d940

Grochocinski added 9 commits March 18, 2025 12:06

replaced map_async with imap_unordered to reduce busy waiting

bb92768

refactored solve function to reduce repeat code and improve clarity

97e194f

rolled tests back to using mrg32k3a v1.0.2 due to issues with v1.1 pr…

274e616

…erelease version, updated functions to improve typing consistency

regenerated tests with new mrg32k3a v1.1 prerelease (mostly minor rou…

9ad3444

…nding differences)

fixed formatting

faddf8d

updated run_experiment script with some documentation and correct han…

66a39b7

…dling post_reps and post_norms

updated test results (again)

51f16db

refactored experiment testing files into a single dynamic-generation …

736f27b

…module and helper file

move experiment result generator script and gave it a more descriptiv…

bead9be

…e name, cleaned up dev_tools

Grochocinski added 16 commits March 25, 2025 23:46

refactored replicate() to be streamlined and easier to follow

8d44ab6

refactored replicate() to make contam calculation clearer

d537e72

refactored replicate to remove repeated numpy array creation/destruct…

4623deb

…ion, improved clarity

refactored replicate() for clearer gumbel and utility inits, clearer …

74fa818

…customer loop

refactored replicate() with vectorized stockout calcs

614ad8c

greatly simplified node-building logic in replicate()

77636ac

refactored replicate to clarify logic and simplify operations

d95b063

streamlined replicate logic

8f18801

partial refactor of replicate()

e4fe9ad

refactored replicate for improved readability

59744a3

vectorized replicate and added enum for matrix indexing

6bc8e32

made current message indexing clearer

5c99228

minor refactoring of replicate

4bbaf04

refactored replicate for slight performance/clarity improvements

e8a9015

simplified replicate logic, added fast_weighted_choice

9299e82

optimized replicate loop core + slicing logic

22632dc

Grochocinski requested a review from Copilot March 28, 2025 17:53

Copilot AI reviewed Mar 28, 2025

View reviewed changes

Grochocinski added 7 commits March 29, 2025 01:14

added timing printout after running methods

039ce1c

fixed #153, regenerated affected test results

e5f7656

fixed #154, regenerated affected test results

59c5914

split ironore profit calculation into seperate costs/profits for futu…

85c1fcf

…re gradient use

vectorized net profic calc, regen-ed affected tests

509e165

replaced producing array with checks to prod_costs array

b86fca4

tweaked make_nonzero to use copysign, added message for assert in demo

fb576c3

Grochocinski requested a review from Copilot March 29, 2025 07:04

Copilot AI reviewed Mar 29, 2025

View reviewed changes

Grochocinski marked this pull request as ready for review March 29, 2025 07:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance Enhancements #151

Performance Enhancements #151

Grochocinski commented Mar 17, 2025 •

edited

Loading

Grochocinski commented Mar 17, 2025 •

edited

Loading

Grochocinski commented Mar 27, 2025 •

edited

Loading

Copilot AI left a comment

Copilot AI left a comment

Grochocinski commented Mar 29, 2025

Performance Enhancements #151

Are you sure you want to change the base?

Performance Enhancements #151

Conversation

Grochocinski commented Mar 17, 2025 • edited Loading

SimOpt Core

Problems/Models/Solvers

Testing

Dev Utils

Demo Files

Project Settings

Grochocinski commented Mar 17, 2025 • edited Loading

Grochocinski commented Mar 27, 2025 • edited Loading

Current Performance Comparison vs Development

% Change in Runtime vs Development (Avg. -27.35%)

Testing Notes

Notes on FIXEDSAN regression

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Grochocinski commented Mar 29, 2025

Grochocinski commented Mar 17, 2025 •

edited

Loading

Grochocinski commented Mar 17, 2025 •

edited

Loading

Grochocinski commented Mar 27, 2025 •

edited

Loading