Feature/3299 improve ess mcse #3305

mitzimorris · 2024-07-31T04:20:25Z

Submission Checklist

Run unit tests: ./runTests.py src/test/unit
Run cpplint: make cpplint
Declare copyright holder and open-source license: see below

Summary

Add split-rank-folded ESS .

Intended Effect

Expose new split-Rhat and split-ESS for CmdStan

How to Verify

unit tests

Side Effects

N/A

Documentation

Copyright and Licensing

Please list the copyright holder for the work you are submitting (this will be you or your assignee, such as a university or company): Columbia University

By submitting this pull request, the copyright holder is agreeing to license the submitted work under the following licenses:

Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)
Documentation: CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/)

…hub.com/stan-dev/stan into feature/3299-improve-ESS-MCSE

…re/3299-improve-ESS-MCSE

mitzimorris · 2024-07-31T04:25:13Z

@SteveBronder - I would really appreciate your eyeballs on the ESS calculations - a preliminary review?

@WardBrian - does this API look OK?

also @avehtari - I implemented tail ESS - current logic is that if tail ESS returns NaN, set if to bulk ESS, as this seems to be what the posterior package does. is this correct?

the current set of unit tests test split R-hat and split-ESS against what's in posterior for a run of 2 chains on the eight_schools model. suggestions for further tests welcome.

…an-dev/stan into feature/3299-improve-ESS-MCSE

stan-buildbot · 2024-08-05T00:59:41Z

Name	Old Result	New Result	Ratio	Performance change( 1 - new / old )
arma/arma.stan	0.34	0.32	1.07	6.31% faster
low_dim_corr_gauss/low_dim_corr_gauss.stan	0.01	0.01	0.91	-10.28% slower
gp_regr/gen_gp_data.stan	0.02	0.02	1.03	3.15% faster
gp_regr/gp_regr.stan	0.09	0.1	0.99	-1.12% slower
sir/sir.stan	70.04	68.31	1.03	2.47% faster
irt_2pl/irt_2pl.stan	4.2	4.31	0.97	-2.7% slower
eight_schools/eight_schools.stan	0.06	0.06	1.03	2.94% faster
pkpd/sim_one_comp_mm_elim_abs.stan	0.25	0.25	0.98	-2.06% slower
pkpd/one_comp_mm_elim_abs.stan	19.51	18.79	1.04	3.71% faster
garch/garch.stan	0.45	0.41	1.09	8.47% faster
low_dim_gauss_mix/low_dim_gauss_mix.stan	2.72	2.6	1.05	4.4% faster
arK/arK.stan	1.82	1.72	1.06	5.58% faster
gp_pois_regr/gp_pois_regr.stan	2.87	2.72	1.06	5.25% faster
low_dim_gauss_mix_collapse/low_dim_gauss_mix_collapse.stan	8.78	8.44	1.04	3.88% faster
performance.compilation	181.33	181.18	1.0	0.08% faster
Mean result: 1.0224876057703807

Jenkins Console Log
Blue Ocean
Commit hash: 2cd95be39b0482d0fedb7fe9a1e8c77b7084592a

Machine information

No LSB modules are available. Distributor ID: Ubuntu Description: Ubuntu 20.04.3 LTS Release: 20.04 Codename: focal

CPU:
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Byte Order: Little Endian
Address sizes: 46 bits physical, 48 bits virtual
CPU(s): 80
On-line CPU(s) list: 0-79
Thread(s) per core: 2
Core(s) per socket: 20
Socket(s): 2
NUMA node(s): 2
Vendor ID: GenuineIntel
CPU family: 6
Model: 85
Model name: Intel(R) Xeon(R) Gold 6148 CPU @ 2.40GHz
Stepping: 4
CPU MHz: 2400.000
CPU max MHz: 3700.0000
CPU min MHz: 1000.0000
BogoMIPS: 4800.00
Virtualization: VT-x
L1d cache: 1.3 MiB
L1i cache: 1.3 MiB
L2 cache: 40 MiB
L3 cache: 55 MiB
NUMA node0 CPU(s): 0,2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34,36,38,40,42,44,46,48,50,52,54,56,58,60,62,64,66,68,70,72,74,76,78
NUMA node1 CPU(s): 1,3,5,7,9,11,13,15,17,19,21,23,25,27,29,31,33,35,37,39,41,43,45,47,49,51,53,55,57,59,61,63,65,67,69,71,73,75,77,79
Vulnerability Gather data sampling: Mitigation; Microcode
Vulnerability Itlb multihit: KVM: Mitigation: VMX disabled
Vulnerability L1tf: Mitigation; PTE Inversion; VMX conditional cache flushes, SMT vulnerable
Vulnerability Mds: Mitigation; Clear CPU buffers; SMT vulnerable
Vulnerability Meltdown: Mitigation; PTI
Vulnerability Mmio stale data: Mitigation; Clear CPU buffers; SMT vulnerable
Vulnerability Retbleed: Mitigation; IBRS
Vulnerability Spec rstack overflow: Not affected
Vulnerability Spec store bypass: Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Vulnerability Spectre v2: Mitigation; IBRS, IBPB conditional, STIBP conditional, RSB filling, PBRSB-eIBRS Not affected
Vulnerability Srbds: Not affected
Vulnerability Tsx async abort: Mitigation; Clear CPU buffers; SMT vulnerable
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd mba ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm mpx rdt_a avx512f avx512dq rdseed adx smap clflushopt clwb intel_pt avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts hwp hwp_act_window hwp_epp hwp_pkg_req pku ospke md_clear flush_l1d arch_capabilities

G++:
g++ (Ubuntu 9.4.0-1ubuntu1~20.04) 9.4.0
Copyright (C) 2019 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

Clang:
clang version 10.0.0-4ubuntu1
Target: x86_64-pc-linux-gnu
Thread model: posix
InstalledDir: /usr/bin

…an-dev/stan into feature/3299-improve-ESS-MCSE

stan-buildbot · 2024-08-05T06:14:08Z

Name	Old Result	New Result	Ratio	Performance change( 1 - new / old )
arma/arma.stan	0.36	0.32	1.14	12.57% faster
low_dim_corr_gauss/low_dim_corr_gauss.stan	0.01	0.01	0.99	-0.72% slower
gp_regr/gen_gp_data.stan	0.03	0.02	1.2	16.83% faster
gp_regr/gp_regr.stan	0.1	0.09	1.04	3.42% faster
sir/sir.stan	69.38	70.3	0.99	-1.31% slower
irt_2pl/irt_2pl.stan	4.25	3.96	1.07	6.73% faster
eight_schools/eight_schools.stan	0.06	0.05	1.07	6.22% faster
pkpd/sim_one_comp_mm_elim_abs.stan	0.25	0.25	0.98	-2.37% slower
pkpd/one_comp_mm_elim_abs.stan	19.72	18.95	1.04	3.92% faster
garch/garch.stan	0.44	0.41	1.06	6.07% faster
low_dim_gauss_mix/low_dim_gauss_mix.stan	2.71	2.6	1.04	3.77% faster
arK/arK.stan	1.76	1.72	1.02	2.31% faster
gp_pois_regr/gp_pois_regr.stan	2.84	2.8	1.01	1.22% faster
low_dim_gauss_mix_collapse/low_dim_gauss_mix_collapse.stan	8.83	8.4	1.05	4.89% faster
performance.compilation	180.95	180.82	1.0	0.07% faster
Mean result: 1.0473007260170133

Jenkins Console Log
Blue Ocean
Commit hash: 2cd95be39b0482d0fedb7fe9a1e8c77b7084592a

Machine information

No LSB modules are available. Distributor ID: Ubuntu Description: Ubuntu 20.04.3 LTS Release: 20.04 Codename: focal

CPU:
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Byte Order: Little Endian
Address sizes: 46 bits physical, 48 bits virtual
CPU(s): 80
On-line CPU(s) list: 0-79
Thread(s) per core: 2
Core(s) per socket: 20
Socket(s): 2
NUMA node(s): 2
Vendor ID: GenuineIntel
CPU family: 6
Model: 85
Model name: Intel(R) Xeon(R) Gold 6148 CPU @ 2.40GHz
Stepping: 4
CPU MHz: 2400.000
CPU max MHz: 3700.0000
CPU min MHz: 1000.0000
BogoMIPS: 4800.00
Virtualization: VT-x
L1d cache: 1.3 MiB
L1i cache: 1.3 MiB
L2 cache: 40 MiB
L3 cache: 55 MiB
NUMA node0 CPU(s): 0,2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34,36,38,40,42,44,46,48,50,52,54,56,58,60,62,64,66,68,70,72,74,76,78
NUMA node1 CPU(s): 1,3,5,7,9,11,13,15,17,19,21,23,25,27,29,31,33,35,37,39,41,43,45,47,49,51,53,55,57,59,61,63,65,67,69,71,73,75,77,79
Vulnerability Gather data sampling: Mitigation; Microcode
Vulnerability Itlb multihit: KVM: Mitigation: VMX disabled
Vulnerability L1tf: Mitigation; PTE Inversion; VMX conditional cache flushes, SMT vulnerable
Vulnerability Mds: Mitigation; Clear CPU buffers; SMT vulnerable
Vulnerability Meltdown: Mitigation; PTI
Vulnerability Mmio stale data: Mitigation; Clear CPU buffers; SMT vulnerable
Vulnerability Retbleed: Mitigation; IBRS
Vulnerability Spec rstack overflow: Not affected
Vulnerability Spec store bypass: Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Vulnerability Spectre v2: Mitigation; IBRS, IBPB conditional, STIBP conditional, RSB filling, PBRSB-eIBRS Not affected
Vulnerability Srbds: Not affected
Vulnerability Tsx async abort: Mitigation; Clear CPU buffers; SMT vulnerable
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd mba ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm mpx rdt_a avx512f avx512dq rdseed adx smap clflushopt clwb intel_pt avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts hwp hwp_act_window hwp_epp hwp_pkg_req pku ospke md_clear flush_l1d arch_capabilities

G++:
g++ (Ubuntu 9.4.0-1ubuntu1~20.04) 9.4.0
Copyright (C) 2019 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

Clang:
clang version 10.0.0-4ubuntu1
Target: x86_64-pc-linux-gnu
Thread model: posix
InstalledDir: /usr/bin

WardBrian

Thanks for tackling this

lib/stan_math

src/stan/analyze/mcmc/check_chains.hpp

src/stan/analyze/mcmc/compute_potential_scale_reduction.hpp

src/stan/analyze/mcmc/split_rank_normalized_ess.hpp

src/test/unit/analyze/mcmc/compute_potential_scale_reduction_test.cpp

src/stan/mcmc/chainset.hpp

src/stan/io/stan_csv_reader.hpp

…an-dev/stan into feature/3299-improve-ESS-MCSE

…re/3299-improve-ESS-MCSE

SteveBronder

Few comments about chainset

src/stan/mcmc/chainset.hpp

SteveBronder · 2024-09-24T21:22:00Z

src/stan/mcmc/chainset.hpp

+  explicit chainset(const std::vector<stan::io::stan_csv>& stan_csv) {
+    if (stan_csv.empty())
+      return;
+    init_from_stan_csv(stan_csv[0]);
+    for (size_t i = 1; i < stan_csv.size(); ++i) {
+      add(stan_csv[i]);
+    }
+  }


Can we write a thinned_samples function with a signature thinned_samples(const std::vector<stan::io::stan_csv>&)? Also add a function like the below so that we can use a member initialization list

std::vector<Eigen::MatrixXd> extract_samples(const std::vector<stan::io::stan_csv>&)`

Suggested change

explicit chainset(const std::vector<stan::io::stan_csv>& stan_csv) {

if (stan_csv.empty())

return;

init_from_stan_csv(stan_csv[0]);

for (size_t i = 1; i < stan_csv.size(); ++i) {

add(stan_csv[i]);

}

}

explicit chainset(const std::vector<stan::io::stan_csv>& stan_csvs) :

param_names_(stan_csvs[0].header),

num_samples_(thinned_samples(stan_csvs)),

chains_(extract_samples(stan_csvs)) {}

SteveBronder · 2024-09-24T21:27:07Z

src/stan/mcmc/chainset.hpp

+  inline int num_chains() const { return chains_.size(); }
+
+  /**
+   * Report number of parameters per chain.
+   * @return size of parameter names vector.
+   */
+  inline int num_params() const { return param_names_.size(); }
+
+  /**
+   * Report number of samples (draws) per chain.
+   * @return rows per chain
+   */
+  inline int num_samples() const { return num_samples_; }


std::vector returns size_t which is a unsigned int. I like using Eigen::Index instead of int since it's a long int and it's a reasonably safe int to convert to

Suggested change

inline int num_chains() const { return chains_.size(); }

/**

* Report number of parameters per chain.

* @return size of parameter names vector.

*/

inline int num_params() const { return param_names_.size(); }

/**

* Report number of samples (draws) per chain.

* @return rows per chain

*/

inline int num_samples() const { return num_samples_; }

inline int num_chains() const { return chains_.size(); }

/**

* Report number of parameters per chain.

* @return size of parameter names vector.

*/

inline int num_params() const { return param_names_.size(); }

/**

* Report number of samples (draws) per chain.

* @return rows per chain

*/

inline int num_samples() const { return num_samples_; }

src/stan/mcmc/chainset.hpp

…an-dev/stan into feature/3299-improve-ESS-MCSE

mitzimorris · 2024-10-04T00:45:20Z

closing this PR - see comment #3310 (comment)

mitzimorris and others added 26 commits July 11, 2024 10:16

Merge branch 'develop' of https://github.com/stan-dev/stan into develop

b2e2a34

Merge branch 'develop' of https://github.com/stan-dev/stan into develop

9d60cc7

code readibility

986fcfc

code readibility

487e4ab

code cleanup

90e0b1f

code cleanup

2ae8f73

code cleanup

f1af73d

checkpointing

447c809

checkpointing

f299899

Merge branch 'develop' of https://github.com/stan-dev/stan into develop

1dfa589

unit tests for added logic

ce88185

checks and unit tests

eef026d

lint fix

a201612

[Jenkins] auto-formatting by clang-format version 10.0.0-4ubuntu1

9730bdc

Merge branch 'bugfix/3301-stan-csv-reader-skip-warmup' of https://git…

2226ae3

…hub.com/stan-dev/stan into feature/3299-improve-ESS-MCSE

adding test file

d7aeddd

fix divide by zero error

c33e954

[Jenkins] auto-formatting by clang-format version 10.0.0-4ubuntu1

d7c0f05

Merge branch 'bugfix/3301-stan-csv-reader-skip-warmup' of https://git…

b1d815f

…hub.com/stan-dev/stan into feature/3299-improve-ESS-MCSE

refactoring, checkpointing

82aa58d

Merge branch 'develop' of https://github.com/stan-dev/stan into featu…

d732c02

…re/3299-improve-ESS-MCSE

checkpointing

9b91b62

checkpointing

7a30adc

checkpointing; unit tests pass

5caba03

checkpointing; unit tests pass

09170ed

checkpointing

988e2b1

mitzimorris requested a review from SteveBronder July 31, 2024 04:20

mitzimorris marked this pull request as draft July 31, 2024 04:20

checkpointing

203dcea

mitzimorris and others added 5 commits August 4, 2024 18:38

stan_csv_reader - logic to parse fixed_param output

9d3f80b

[Jenkins] auto-formatting by clang-format version 10.0.0-4ubuntu1

246cb52

Merge branch 'feature/3299-improve-ESS-MCSE' of https://github.com/st…

db6eb32

…an-dev/stan into feature/3299-improve-ESS-MCSE

Merge branch 'feature/3299-improve-ESS-MCSE' of https://github.com/st…

7993837

…an-dev/stan into feature/3299-improve-ESS-MCSE

stan_csv_reader - doc comments

6ab986e

mitzimorris and others added 4 commits August 5, 2024 00:04

adding autocovariance function, used by stansummary

3d94edc

[Jenkins] auto-formatting by clang-format version 10.0.0-4ubuntu1

1c6804d

Merge branch 'feature/3299-improve-ESS-MCSE' of https://github.com/st…

f746b43

…an-dev/stan into feature/3299-improve-ESS-MCSE

Merge branch 'feature/3299-improve-ESS-MCSE' of https://github.com/st…

bf03afd

…an-dev/stan into feature/3299-improve-ESS-MCSE

WardBrian reviewed Aug 5, 2024

View reviewed changes

This was referenced Aug 5, 2024

remove print utility stan-dev/cmdstan#1289

Open

Feature/1263 update stansummary to report rank-normalized ESS tail, ESS bulk, max abs deviation(MAD), and Rhat stan-dev/cmdstan#1290

Open

mitzimorris added 5 commits September 23, 2024 12:30

remove empty template parameter for chainset obj

e515fd1

Merge branch 'feature/3299-improve-ESS-MCSE' of https://github.com/st…

300809c

…an-dev/stan into feature/3299-improve-ESS-MCSE

stan_math fix?

94a7e3b

Merge branch 'develop' of https://github.com/stan-dev/stan into featu…

52c7bbf

…re/3299-improve-ESS-MCSE

changes per code review

4af0913

SteveBronder requested changes Sep 24, 2024

View reviewed changes

mitzimorris and others added 5 commits September 24, 2024 20:59

Merge branch 'feature/3299-improve-ESS-MCSE' of https://github.com/st…

e6d53c9

…an-dev/stan into feature/3299-improve-ESS-MCSE

harmonize get_stats from chainset and parsing of csv files

6f4f661

[Jenkins] auto-formatting by clang-format version 10.0.0-4ubuntu1

097883f

Merge branch 'feature/3299-improve-ESS-MCSE' of https://github.com/st…

c3178b9

…an-dev/stan into feature/3299-improve-ESS-MCSE

unit test fix

60495e7

mitzimorris mentioned this pull request Sep 25, 2024

Feature/3299 diagnostics chainset #3310

Closed

3 tasks

mitzimorris and others added 2 commits September 26, 2024 14:54

changes per code review

340f0d9

[Jenkins] auto-formatting by clang-format version 10.0.0-4ubuntu1

fbf45ab

mitzimorris closed this Oct 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/3299 improve ess mcse #3305

Feature/3299 improve ess mcse #3305

mitzimorris commented Jul 31, 2024

mitzimorris commented Jul 31, 2024

stan-buildbot commented Aug 5, 2024

stan-buildbot commented Aug 5, 2024

WardBrian left a comment

SteveBronder left a comment

SteveBronder Sep 24, 2024

SteveBronder Sep 24, 2024

mitzimorris commented Oct 4, 2024

Feature/3299 improve ess mcse #3305

Feature/3299 improve ess mcse #3305

Conversation

mitzimorris commented Jul 31, 2024

Submission Checklist

Summary

Intended Effect

How to Verify

Side Effects

Documentation

Copyright and Licensing

mitzimorris commented Jul 31, 2024

stan-buildbot commented Aug 5, 2024

stan-buildbot commented Aug 5, 2024

WardBrian left a comment

Choose a reason for hiding this comment

SteveBronder left a comment

Choose a reason for hiding this comment

SteveBronder Sep 24, 2024

Choose a reason for hiding this comment

SteveBronder Sep 24, 2024

Choose a reason for hiding this comment

mitzimorris commented Oct 4, 2024