Semi-concrete IR interpreter #44803

ianatol · 2022-03-30T22:54:05Z

Original PR description:

Background

in #43852, the compiler learned how to make use of the codegen'd version of a function in order to speed up constant propagation by 1000x, assuming two conditions are fulfilled:

All arguments are known to be constant at inference time
The evaluation is legal according to appropriate effect conditions

This mechanism works well, but leaves some very sharp performance edges in inference. For example, adding an unused argument to an otherwise consteval-eligible function will fall back to inference-based constant propagation and thus incur the 1000x penalty (even though the argument is entirely unused).

This PR attempts to address this by adding a new middle-of-the-road fallback that sits between the super fast constant, concrete evaluation from #43852 and the original inference based constant propagation.

This PR

The core idea is to use inference's existing abstract interpretation lattice, but rather than performing abstract interpretation over untyped source, we perform it over the typed/optimized julia IR that the non-constant inference pass produced.

Now, this is not legal to do in general, but the :consistent and :terminates_globally effects from #43852, do ensure legality (in the currently implementation we additionally require :effect_free for consistency with consteval, but that could be easily relaxed).

Why is this a good idea

There are several benefits to performing abstract evaluation over optimized IR rather than untyped IR:

We do not need to perform any method lookups and thus can also avoid inference's relatively complicated tracking of potential recursions. Method lookups were performed during the non-constant inference pass and the :termiantes_globally effect ensures the absence of recursion (at least when proven by the compiler - the property I documented for manual annotation is slightly too weak than what we need, so we may need to split out these effects).
Optimized IR is likely to be smaller (in terms of total number of instructions that need to be visited for a given set of input arguments) because some optimizations/simplification/DCE have been performed.
Optimized IR does not have any slots in it, so we avoid Very WIP: Refactor core inference loops to use less memory #43999-like problems around inference memory usage.
The same optimized IR can be re-used many times for different sets of constants.

Threats to validity

Now, there are some challenges also. For one, it is not guaranteed that this will have the same precision as the inference-based constant propagation. I've done some testing here and precision is decent, but there are some cases where precision is worse than the inference-based constant propagation:

In the current implementation, we never perform any method matching, instead relying on methods annotated in :invoke statements. This mostly works, but there are certain situations (e.g. the cases that optimizer: inline abstract union-split callsite #44512 is supposed to fix), where we currently cannot produce :invoke, even though inference was able to compute a full method match. My preferred solution to this is to extend the semantics of :invoke to always make it possible to generate an :invoke node when the full method match set is known (even if codegen does not make use of it). However, I haven't really seen this be much of a problem in my testing.
There are certain branching patterns that would require the insertion of additional Pi nodes in order to avoid overapproximation of the lattice elements. This is one of the reasons that we currently don't do regular inference on SSA IR (the other being debuggability of lowered code). We've been thinking about some solutions to this, but this particular case is somewhat simpler than inference because 1) the branching patterns that inference can prove :consistent are limited anyway 2) regular inference may have already inserted the requisite PiNodes in which case there isn't a problem. In particular, in my testing I haven't actually seen any case of precision regressions due to this issue.

Benchmark results

The benchmarks for this are looking very attractive. On my ongoing inference torture test (https://gist.github.com/Keno/5587fbfe89bff96c863c8eeb1477defa), this gives an additional 2x improvement in inference time on top of the improvements already merged on master and those pending in #44447 and #44494. I should note that this is total inference time. By the nature of the benchmark, it basically performs inference and constant propagation once for each specialized call site. The 2x improvement here essentially comes down to the constant-propagation portion of that time dropping down to noise level.

The performance is even better on the real world benchmark that motivated https://gist.github.com/Keno/5587fbfe89bff96c863c8eeb1477defa. There constant propagation is performed multiple times with different constants for each non-constant inference, so the performance improvement is proportionally better, to the point where inference time is no longer dominating in that benchmark (down to about 30s from several hours in early January before all the recent compile-time perf improvements).

Current status

This passes tests for me locally, but I'm convinced that issues will show up during PkgEval. The re-use of the abstract interpreter pieces is also quite janky and should be factored better.

ianatol · 2022-03-31T00:09:24Z

@nanosoldier runtests(["ADI", "AMLPipelineBase", "ANOVA", "ARules", "Agents", "AlgebraOfGraphics", "Arrow", "AutoMLPipeline", "BGEN", "BHAtp", "BallroomSkatingSystem", "BangBang", "BedgraphFiles", "BenchmarkConfigSweeps", "Binscatters", "BioFetch", "BipolarSphericalHarmonics", "BoltzmannMachinesPlots", "Bootstrap", "CMAEvolutionStrategy", "CSVReader", "Cassette", "ChainLadder", "CoinbasePro", "CombinatorialSpaces", "ConstraintSolver", "CovarianceMatrices", "CropRootBox", "CrystalInfoFramework", "Cthulhu", "DLMReader", "DPClustering", "Dagger", "DarkCurves", "DataFrameMacros", "DataFrameTools", "DataFramesMeta", "DataKnots", "DataSkimmer", "DecisionProgramming", "Diagonalizations", "DimensionalData", "Distances", "Diversity", "DrelTools", "DrillHoles", "EchelleCCFs", "EffectSizes", "Effects", "Equate", "FHIRClient", "FastAI", "Feather", "FeatureTransforms", "FeynmanDiagram", "FileTrees", "FinEtoolsFlexBeams", "FixedEffectModels", "FixedPointAcceleration", "FlowAtlas", "FlowWorkspace", "Folds", "FrameFun", "FunSQL", "Futbolista", "GBIF", "GLM", "GPLinearODEMaker", "GeoClustering", "GeoDatasets", "GeoLearning", "GeoStatsBase", "GlobalSearchRegression", "GlobalSensitivityAnalysis", "Graph500", "GraphDataFrameBridge", "GridapDistributed", "GroupedTemporalTerms", "GslibIO", "HITRAN", "Hadleyverse", "HighFrequencyCovariance", "HurdleDMR", "INMET", "ImageComponentAnalysis", "ImageGeoms", "InMemoryDatasets", "IncrementalPruning", "IndexedTables", "InformationGeometry", "InteractionWeightedDIDs", "InteractiveErrors", "IntervalTrees", "InvariantCausal", "IterTools", "JMcDM", "JSONLines", "JSONTables", "JWAS", "JlrsReflect", "JudiLing", "JuliaCon", "KCenters", "KeyedFrames", "LITS", "Lasso", "LazyGrids", "Legolas", "LegolasFlux", "LifeTable", "Lighthouse", "LinRegOutliers", "LocalAnisotropies", "LockandKeyLookups", "LoopVectorization", "LowLevelParticleFilters", "LsqFit", "MCMCChains", "MIPVerify", "MLJAbstractGPsGlue", "MLJEnsembles", "MLJLinearModels", "MLJModels", "MLJMultivariateStatsInterface", "MLJScientificTypes", "MatrixLMnet", "Meshes", "Metida", "MimiPAGE2020", "MixedModels", "MixedModelsExtras", "MixedModelsMakie", "MixedModelsPermutations", "MixedModelsSim", "ModelParameters", "Modia", "ModiaPlot_CairoMakie", "ModiaPlot_PyPlot", "ModiaResult", "MultiModalMuSig", "MusicManipulations", "MutualInformationImageRegistration", "NCBITaxonomy", "NNlib", "NaiveGAflux", "OMOPCommonDataModel", "ObservationDims", "Onda", "OndaEDF", "Optim", "OptimKit", "OrdinalGWAS", "OutlierDetection", "OutlierDetectionData", "OutlierDetectionNeighbors", "PNGFiles", "POMDPSimulators", "PackageCompiler", "ParameterSpacePartitions", "Persa", "PersistenceDiagramsBase", "Phylo", "PhyloNetworks", "PhyloPlots", "Pitchjx", "PkgUtility", "PlantBiophysics", "PlotMesh", "PlotlyBase", "PopGenCore", "PopGenSims", "PoreMatMod", "PorousMaterials", "PowerModelsAnnex", "PowerPlots", "PowerSimulations", "PowerSystems", "ProbabilisticCircuits", "ProgenyTestingTools", "ProgressiveHedging", "PyRhodium", "QuantumLattices", "QuantumTomography", "QuerySQLite", "Queryverse", "Qwind", "RData", "RDatasets", "RELOG", "ReadWriteDlm2", "Recommenders", "RegressionDiscontinuity", "RegressionTables", "Remark", "ReplicateBE", "ReportMetrics", "RigidBodyTools", "RipQP", "Ripserer", "RobustModels", "Run", "RvSpectMLPlots", "SQLite", "Santiago", "ScenTrees", "ScientificTypes", "ShapML", "Shapley", "SimpleChains", "SlackThreads", "SnoopCompile", "SolverBenchmark", "SortMark", "SpatialDependence", "SphericalHarmonicModes", "SpineBasedRecordLinkage", "SpmSpectroscopy", "StataDTAFiles", "StaticArrays", "StatsModels", "StochasticIntegrals", "Stonks", "Strapping", "StructArrays", "SugarKelp", "SunAsAStar", "SyntheticDatasets", "SystemBenchmark", "TMLE", "TableOperations", "TextAnalysis", "TidyStanza", "TimeSeries", "Tracking", "Transducers", "TypedTables", "UncertaintyQuantification", "UnitfulAssets", "UpROOT", "VectorSphericalHarmonics", "VisualSearchACTR", "WRDSMerger", "WiSER", "Wordlegames", "XLSX", "YAAD"], vs = ":master")

nanosoldier · 2022-03-31T04:41:56Z

Something went wrong when running your job:

NanosoldierError: failed to run tests: Build for julia on x86_64-linux-gnu did not complete successfully

Logs and partial data can be found here
cc @maleadt

aviatesk · 2022-03-31T05:11:29Z

@nanosoldier runtests(["ADI", "AMLPipelineBase", "ANOVA", "ARules", "Agents", "AlgebraOfGraphics", "Arrow", "AutoMLPipeline", "BGEN", "BHAtp", "BallroomSkatingSystem", "BangBang", "BedgraphFiles", "BenchmarkConfigSweeps", "Binscatters", "BioFetch", "BipolarSphericalHarmonics", "BoltzmannMachinesPlots", "Bootstrap", "CMAEvolutionStrategy", "CSVReader", "Cassette", "ChainLadder", "CoinbasePro", "CombinatorialSpaces", "ConstraintSolver", "CovarianceMatrices", "CropRootBox", "CrystalInfoFramework", "Cthulhu", "DLMReader", "DPClustering", "Dagger", "DarkCurves", "DataFrameMacros", "DataFrameTools", "DataFramesMeta", "DataKnots", "DataSkimmer", "DecisionProgramming", "Diagonalizations", "DimensionalData", "Distances", "Diversity", "DrelTools", "DrillHoles", "EchelleCCFs", "EffectSizes", "Effects", "Equate", "FHIRClient", "FastAI", "Feather", "FeatureTransforms", "FeynmanDiagram", "FileTrees", "FinEtoolsFlexBeams", "FixedEffectModels", "FixedPointAcceleration", "FlowAtlas", "FlowWorkspace", "Folds", "FrameFun", "FunSQL", "Futbolista", "GBIF", "GLM", "GPLinearODEMaker", "GeoClustering", "GeoDatasets", "GeoLearning", "GeoStatsBase", "GlobalSearchRegression", "GlobalSensitivityAnalysis", "Graph500", "GraphDataFrameBridge", "GridapDistributed", "GroupedTemporalTerms", "GslibIO", "HITRAN", "Hadleyverse", "HighFrequencyCovariance", "HurdleDMR", "INMET", "ImageComponentAnalysis", "ImageGeoms", "InMemoryDatasets", "IncrementalPruning", "IndexedTables", "InformationGeometry", "InteractionWeightedDIDs", "InteractiveErrors", "IntervalTrees", "InvariantCausal", "IterTools", "JMcDM", "JSONLines", "JSONTables", "JWAS", "JlrsReflect", "JudiLing", "JuliaCon", "KCenters", "KeyedFrames", "LITS", "Lasso", "LazyGrids", "Legolas", "LegolasFlux", "LifeTable", "Lighthouse", "LinRegOutliers", "LocalAnisotropies", "LockandKeyLookups", "LoopVectorization", "LowLevelParticleFilters", "LsqFit", "MCMCChains", "MIPVerify", "MLJAbstractGPsGlue", "MLJEnsembles", "MLJLinearModels", "MLJModels", "MLJMultivariateStatsInterface", "MLJScientificTypes", "MatrixLMnet", "Meshes", "Metida", "MimiPAGE2020", "MixedModels", "MixedModelsExtras", "MixedModelsMakie", "MixedModelsPermutations", "MixedModelsSim", "ModelParameters", "Modia", "ModiaPlot_CairoMakie", "ModiaPlot_PyPlot", "ModiaResult", "MultiModalMuSig", "MusicManipulations", "MutualInformationImageRegistration", "NCBITaxonomy", "NNlib", "NaiveGAflux", "OMOPCommonDataModel", "ObservationDims", "Onda", "OndaEDF", "Optim", "OptimKit", "OrdinalGWAS", "OutlierDetection", "OutlierDetectionData", "OutlierDetectionNeighbors", "PNGFiles", "POMDPSimulators", "PackageCompiler", "ParameterSpacePartitions", "Persa", "PersistenceDiagramsBase", "Phylo", "PhyloNetworks", "PhyloPlots", "Pitchjx", "PkgUtility", "PlantBiophysics", "PlotMesh", "PlotlyBase", "PopGenCore", "PopGenSims", "PoreMatMod", "PorousMaterials", "PowerModelsAnnex", "PowerPlots", "PowerSimulations", "PowerSystems", "ProbabilisticCircuits", "ProgenyTestingTools", "ProgressiveHedging", "PyRhodium", "QuantumLattices", "QuantumTomography", "QuerySQLite", "Queryverse", "Qwind", "RData", "RDatasets", "RELOG", "ReadWriteDlm2", "Recommenders", "RegressionDiscontinuity", "RegressionTables", "Remark", "ReplicateBE", "ReportMetrics", "RigidBodyTools", "RipQP", "Ripserer", "RobustModels", "Run", "RvSpectMLPlots", "SQLite", "Santiago", "ScenTrees", "ScientificTypes", "ShapML", "Shapley", "SimpleChains", "SlackThreads", "SnoopCompile", "SolverBenchmark", "SortMark", "SpatialDependence", "SphericalHarmonicModes", "SpineBasedRecordLinkage", "SpmSpectroscopy", "StataDTAFiles", "StaticArrays", "StatsModels", "StochasticIntegrals", "Stonks", "Strapping", "StructArrays", "SugarKelp", "SunAsAStar", "SyntheticDatasets", "SystemBenchmark", "TMLE", "TableOperations", "TextAnalysis", "TidyStanza", "TimeSeries", "Tracking", "Transducers", "TypedTables", "UncertaintyQuantification", "UnitfulAssets", "UpROOT", "VectorSphericalHarmonics", "VisualSearchACTR", "WRDSMerger", "WiSER", "Wordlegames", "XLSX", "YAAD"], vs = ":master")

nanosoldier · 2022-03-31T12:03:00Z

Something went wrong when running your job:

NanosoldierError: failed to run tests: Build for julia on x86_64-linux-gnu did not complete successfully

Logs and partial data can be found here
cc @maleadt

maleadt · 2022-03-31T12:52:52Z

The build failed -- re-requesting PkgEval won't fix that.

unknown function (ip: 0x7ff09679396c)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff09678cea5)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff096789d0d)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff096788b77)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff0967c28ce)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff0967bfc12)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff0967b372e)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff0967a7a8e)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff0967a3a3c)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff0967a259e)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff09679951a)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff096796032)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff09679396c)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff09678cea5)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff096789d0d)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff096788b77)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff09676c07c)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff09675f597)
jl_toplevel_eval_flex at /workspace/srcdir/src/toplevel.c:900
jl_parse_eval_all at /workspace/srcdir/src/toplevel.c:1043
ijl_load_ at /workspace/srcdir/src/toplevel.c:1090
unknown function (ip: 0x7ff09672e61d)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
unknown function (ip: 0x7ff09672e564)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
jl_apply at /workspace/srcdir/src/julia.h:1830 [inlined]
do_call at /workspace/srcdir/src/interpreter.c:126
eval_value at /workspace/srcdir/src/interpreter.c:215
eval_stmt_value at /workspace/srcdir/src/interpreter.c:166 [inlined]
eval_body at /workspace/srcdir/src/interpreter.c:612
jl_interpret_toplevel_thunk at /workspace/srcdir/src/interpreter.c:750
top-level scope at compiler/compiler.jl:161
jl_toplevel_eval_flex at /workspace/srcdir/src/toplevel.c:909
jl_eval_module_expr at /workspace/srcdir/src/toplevel.c:203 [inlined]
jl_toplevel_eval_flex at /workspace/srcdir/src/toplevel.c:712
ijl_toplevel_eval_in at /workspace/srcdir/src/toplevel.c:968
unknown function (ip: 0x7ff09672da0d)
_jl_invoke at /workspace/srcdir/src/gf.c:2367 [inlined]
ijl_apply_generic at /workspace/srcdir/src/gf.c:2549
jl_apply at /workspace/srcdir/src/julia.h:1830 [inlined]
do_call at /workspace/srcdir/src/interpreter.c:126
eval_value at /workspace/srcdir/src/interpreter.c:215
eval_stmt_value at /workspace/srcdir/src/interpreter.c:166 [inlined]
eval_body at /workspace/srcdir/src/interpreter.c:612
jl_interpret_toplevel_thunk at /workspace/srcdir/src/interpreter.c:750
top-level scope at compiler/compiler.jl:3
jl_toplevel_eval_flex at /workspace/srcdir/src/toplevel.c:909
jl_parse_eval_all at /workspace/srcdir/src/toplevel.c:1043
ijl_load_ at /workspace/srcdir/src/toplevel.c:1090
ijl_load at /workspace/srcdir/src/toplevel.c:1103
exec_program at /workspace/srcdir/src/jlapi.c:517
true_main at /workspace/srcdir/src/jlapi.c:570
jl_repl_entrypoint at /workspace/srcdir/src/jlapi.c:702
main at /workspace/srcdir/cli/loader_exe.c:59
__libc_start_main at /workspace/srcdir/glibc-2.29/csu/../csu/libc-start.c:308
unknown function (ip: 0x400808)


make[1]: *** [sysimage.mk:61: /workspace/srcdir/usr/lib/julia/corecompiler.ji] Error 1
make[1]: Leaving directory '/workspace/srcdir'
make: *** [Makefile:82: julia-sysimg-ji] Error 2
Previous command exited with 2

ianatol · 2022-03-31T15:47:21Z

Ah yeah, glanced at it late last night and thought it might've been a CI thing so asked Shuhei to re-run. Thanks for the logs

aviatesk · 2022-04-07T10:15:33Z

The failing test case will be fixed by #44896.

ianatol · 2022-04-07T15:30:29Z

@nanosoldier runtests(ALL, vs = ":master")

nanosoldier · 2022-04-07T23:16:55Z

Your package evaluation job has completed - possible new issues were detected. A full report can be found here.

ianatol · 2022-04-07T23:20:15Z

Let's do a re-run of failed tests to double check for real ones:

@nanosoldier runtests(["BGEN", "BackgroundSubtraction", "BayesianQuadrature", "Cassette", "CombinatorialSpaces", "DeepQLearning", "DelayDiffEq", "DimensionalData", "Distances", "EffectSizes", "ElasticArrays", "Evolutionary", "FluxTraining", "ForestBiometrics", "FunctionWrappers", "GaussianMixtureAlignment", "GigaSOM", "GridapDistributed", "ImageGeoms", "IterTools", "LokiLogger", "LoopVectorization", "LsqFit", "MLJMultivariateStatsInterface", "MatrixLMnet", "MinimallyDisruptiveCurves", "MixedModelsExtras", "Modia", "MutualInformationImageRegistration", "NaiveGAflux", "NiLang", "Optim", "PNGFiles", "QSimulator", "QuadEig", "QuantumLattices", "Quiqbox", "Reactive", "RigidBodyTools", "SimpleChains", "SlackThreads", "SnoopCompile", "SphericalHarmonicModes", "StochasticDelayDiffEq", "Sundials", "TableOperations", "TypedTables", "WavePropBase"], vs = ":master")

ianatol · 2022-04-08T00:08:47Z

Not definitive, but just wanted to make this list of test failures that look real at first glance

Inference accuracy regressions:
IterTools
DimensionalData
TypedTables

Allocation regressions:
SimpleChains
MutualInformationImageRegistration

Possibly need upstream changes:
Cassette
SnoopCompile
SlackThreads (This seems to need JET.jl upstream change to account for ret::CallMeta here)

Other:
FunctionWrappers (error message is clearly related to internals)

The unreachable instruction ones are also real as they turned up in the PkgEval on the original PR, but hopefully are all from the same issue...

nanosoldier · 2022-04-08T00:33:54Z

Your package evaluation job has completed - possible new issues were detected. A full report can be found here.

aviatesk · 2022-04-11T03:53:55Z

@nanosoldier runtests(["BGEN", "BackgroundSubtraction", "BayesianQuadrature", "Cassette", "CombinatorialSpaces", "DeepQLearning", "DelayDiffEq", "DimensionalData", "Distances", "EffectSizes", "ElasticArrays", "Evolutionary", "FluxTraining", "ForestBiometrics", "FunctionWrappers", "GaussianMixtureAlignment", "GigaSOM", "GridapDistributed", "ImageGeoms", "IterTools", "LokiLogger", "LoopVectorization", "LsqFit", "MLJMultivariateStatsInterface", "MatrixLMnet", "MinimallyDisruptiveCurves", "MixedModelsExtras", "Modia", "MutualInformationImageRegistration", "NaiveGAflux", "NiLang", "Optim", "PNGFiles", "QSimulator", "QuadEig", "QuantumLattices", "Quiqbox", "Reactive", "RigidBodyTools", "SimpleChains", "SlackThreads", "SnoopCompile", "SphericalHarmonicModes", "StochasticDelayDiffEq", "Sundials", "TableOperations", "TypedTables", "WavePropBase"], vs = ":master")

nanosoldier · 2022-04-11T04:50:02Z

Your package evaluation job has completed - possible new issues were detected. A full report can be found here.

ianatol · 2022-04-28T00:10:34Z

This will fail some abstract unionsplit tests due to 4874755

ianatol · 2022-04-28T00:20:07Z

@nanosoldier runtests(["BGEN", "BackgroundSubtraction", "BayesianQuadrature", "Cassette", "CombinatorialSpaces", "DeepQLearning", "DelayDiffEq", "DimensionalData", "Distances", "EffectSizes", "ElasticArrays", "Evolutionary", "FluxTraining", "ForestBiometrics", "FunctionWrappers", "GaussianMixtureAlignment", "GigaSOM", "GridapDistributed", "ImageGeoms", "IterTools", "LokiLogger", "LoopVectorization", "LsqFit", "MLJMultivariateStatsInterface", "MatrixLMnet", "MinimallyDisruptiveCurves", "MixedModelsExtras", "Modia", "MutualInformationImageRegistration", "NaiveGAflux", "NiLang", "Optim", "PNGFiles", "QSimulator", "QuadEig", "QuantumLattices", "Quiqbox", "Reactive", "RigidBodyTools", "SimpleChains", "SlackThreads", "SnoopCompile", "SphericalHarmonicModes", "StochasticDelayDiffEq", "Sundials", "TableOperations", "TypedTables", "WavePropBase"], vs = ":master")

nanosoldier · 2022-04-28T04:54:08Z

Your package evaluation job has completed - possible new issues were detected. A full report can be found here.

test/compiler/inference.jl

base/essentials.jl

base/compiler/utilities.jl

base/compiler/typeinfer.jl

base/compiler/tfuncs.jl

base/compiler/stmtinfo.jl

base/compiler/inferencestate.jl

aviatesk · 2022-08-31T08:04:57Z

test/compiler/EscapeAnalysis/EAUtils.jl

        if isa(linfo, InferenceResult)
            ecache = get(interp.cache, linfo, nothing)
+        elseif isa(linfo, SemiConcreteResult)
+            ecache = get(interp.cache, linfo, nothing)


This never returns cache (as interp.cache is IdDict{InferenceResult,EscapeCache} object). We should tweak callsites of get_escape_cache instead.

base/compiler/ssair/inlining.jl

base/compiler/abstractinterpretation.jl

nanosoldier · 2022-08-31T08:30:57Z

Your package evaluation job has completed - possible new issues were detected. A full report can be found here.

aviatesk · 2022-08-31T08:29:13Z

base/compiler/abstractinterpretation.jl

+            ir = codeinst_to_ir(interp, code)
+            if isa(ir, IRCode)
+                T = ir_abstract_constant_propagation(interp, mi_cache, sv, mi, ir, arginfo.argtypes)
+                if !isa(T, Type) || typeintersect(T, Bool) === Union{}


Why do we want to have this check?

base/compiler/abstractinterpretation.jl

base/compiler/inferenceresult.jl

base/compiler/ssair/irinterp.jl

aviatesk · 2022-08-31T09:12:02Z

base/compiler/ssair/irinterp.jl

+                    # All other effects already guaranteed effect free by construction
+                    if is_nothrow(effects)
+                        ir.stmts[idx][:flag] |= IR_FLAG_EFFECT_FREE
+                    end


Why do we want to handle this here rather than letting the optimizer to annotate this as like the regular absint->optimization chain?

I'd just like to generally move towards inference annotating this information if it has it. We're already doing all the analysis, so doing it over again in the optimizer is wasteful. That said, this branch is dead anyway, since we're currently requiring nothrow, but I'll plan to address that after merge.

aviatesk · 2022-08-31T09:18:18Z

2. There are certain branching patterns that would require the insertion of additional Pi nodes in order to avoid overapproximation of the lattice elements. This is one of the reasons that we currently don't do regular inference on SSA IR (the other being debuggability of lowered code). We've been thinking about some solutions to this, but this particular case is somewhat simpler than inference because 1) the branching patterns that inference can prove :consistent are limited anyway 2) regular inference may have already inserted the requisite PiNodes in which case there isn't a problem. In particular, in my testing I haven't actually seen any case of precision regressions due to this issue.

Is this still a consideration? I'm still not sure why abstract interpretation on IRCode faces this problem -- AFAIU if we obtain a type constraint on a variable (e.g. from isa branch), we may generate PiNode at the start of optimization and replace all the usages of the variable in that branch with that PiNode. Then I think we can successfully reason about that constrained type of the variable if we just look at the type of the PiNode. Am I missing something?

nanosoldier · 2022-08-31T11:21:42Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

vtjnash · 2022-08-31T14:32:30Z

if we obtain a type constraint on a variable (e.g. from isa branch), we may generate PiNode at the start of optimization and replace all the usages of the variable in that branch with that PiNode. Then I think we can successfully reason about that constrained type of the variable if we just look at the type of the PiNode.

We would lose the ability to further constrain and refine the value further, since we don't allow SSAValues to be used in in the Conditional type. But as Ian mentioned, it is somewhat unlikely we would consider the program to be consistent if it needed further Conditional type splitting.

ianatol · 2022-08-31T18:25:17Z

@nanosoldier runtests(["Accessors", "CSVReader", "CUTEst", "CairoMakie", "ComplexPhasePortrait", "CountdownNumbers", "FlameGraphs", "Folds", "GalaxyBrain", "MeshViz", "MolecularGraph", "MzPlots", "Pidfile", "PlantGeom", "ProfileView", "RetroCap", "SpmGrids", "StrBase", "TopOptMakie"], vs = ":master")

@nanosoldier runbenchmarks(!"scalar", vs=":master")

nanosoldier · 2022-09-01T00:14:32Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

nanosoldier · 2022-09-01T06:37:35Z

Your package evaluation job has completed - possible new issues were detected. A full report can be found here.

ianatol · 2022-09-01T17:50:48Z

@nanosoldier runbenchmarks("broadcast", vs=":master")

nanosoldier · 2022-09-01T18:13:02Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

Keno

Alright, let's get this in. There some follow-up cleanup that I have to do, but let's get this in first.

ianatol force-pushed the kf/semiconcreteeval branch from 66b45df to 6ff6eb8 Compare March 30, 2022 23:34

ianatol force-pushed the kf/semiconcreteeval branch 2 times, most recently from 9576635 to 96e4731 Compare April 5, 2022 01:57

aviatesk self-requested a review April 5, 2022 02:44

aviatesk assigned aviatesk and ianatol Apr 5, 2022

aviatesk force-pushed the kf/semiconcreteeval branch 2 times, most recently from 0f669e2 to 53cc7cd Compare April 6, 2022 07:44

aviatesk force-pushed the kf/semiconcreteeval branch from cac1d83 to ed737ad Compare April 11, 2022 03:53

ianatol mentioned this pull request Apr 22, 2022

Allow inlining methods with unmatched type parameters #45062

Merged

ianatol force-pushed the kf/semiconcreteeval branch 2 times, most recently from cf35245 to 9d5a93a Compare April 28, 2022 00:02

ianatol force-pushed the kf/semiconcreteeval branch from 9d5a93a to 52ab1f5 Compare July 22, 2022 18:47

aviatesk added the compiler:inference Type inference label Aug 31, 2022

aviatesk reviewed Aug 31, 2022

View reviewed changes

base/compiler/ssair/inlining.jl Outdated Show resolved Hide resolved

base/compiler/ssair/inlining.jl Outdated Show resolved Hide resolved

base/compiler/ssair/inlining.jl Outdated Show resolved Hide resolved

aviatesk reviewed Aug 31, 2022

View reviewed changes

ianatol force-pushed the kf/semiconcreteeval branch 2 times, most recently from bdd0d25 to 4382c9b Compare August 31, 2022 17:56

ianatol force-pushed the kf/semiconcreteeval branch from 0b7ac1a to 14382bb Compare September 1, 2022 17:49

Semi-concrete IR interpreter

3557af8

ianatol force-pushed the kf/semiconcreteeval branch from 14382bb to 3557af8 Compare September 1, 2022 18:16

ianatol requested a review from Keno September 1, 2022 18:18

Keno approved these changes Sep 1, 2022

View reviewed changes

ianatol added the merge me PR is reviewed. Merge when all tests are passing label Sep 1, 2022

Keno merged commit aa20b32 into JuliaLang:master Sep 1, 2022

aviatesk mentioned this pull request Sep 2, 2022

WIP: Semi-concrete IR interpreter #44660

Closed

ianatol removed the merge me PR is reviewed. Merge when all tests are passing label Sep 2, 2022

ianatol deleted the kf/semiconcreteeval branch September 2, 2022 00:42

aviatesk mentioned this pull request Sep 12, 2022

fix JET on nightly aviatesk/JET.jl#385

Closed

maleadt mentioned this pull request Oct 27, 2022

Inference regression because of semi-concrete interpretation #47349

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Semi-concrete IR interpreter #44803

Semi-concrete IR interpreter #44803

ianatol commented Mar 30, 2022 •

edited by aviatesk

Loading

ianatol commented Mar 31, 2022

nanosoldier commented Mar 31, 2022

aviatesk commented Mar 31, 2022

nanosoldier commented Mar 31, 2022

maleadt commented Mar 31, 2022

ianatol commented Mar 31, 2022

aviatesk commented Apr 7, 2022

ianatol commented Apr 7, 2022

nanosoldier commented Apr 7, 2022

ianatol commented Apr 7, 2022

ianatol commented Apr 8, 2022 •

edited

Loading

nanosoldier commented Apr 8, 2022

aviatesk commented Apr 11, 2022

nanosoldier commented Apr 11, 2022

ianatol commented Apr 28, 2022

ianatol commented Apr 28, 2022

nanosoldier commented Apr 28, 2022

aviatesk Aug 31, 2022

nanosoldier commented Aug 31, 2022

aviatesk Aug 31, 2022

aviatesk Aug 31, 2022

Keno Aug 31, 2022

aviatesk commented Aug 31, 2022 •

edited

Loading

nanosoldier commented Aug 31, 2022

vtjnash commented Aug 31, 2022

ianatol commented Aug 31, 2022

nanosoldier commented Sep 1, 2022

nanosoldier commented Sep 1, 2022

ianatol commented Sep 1, 2022

nanosoldier commented Sep 1, 2022

Keno left a comment

Semi-concrete IR interpreter #44803

Semi-concrete IR interpreter #44803

Conversation

ianatol commented Mar 30, 2022 • edited by aviatesk Loading

Background

This PR

Why is this a good idea

Threats to validity

Benchmark results

Current status

ianatol commented Mar 31, 2022

nanosoldier commented Mar 31, 2022

aviatesk commented Mar 31, 2022

nanosoldier commented Mar 31, 2022

maleadt commented Mar 31, 2022

ianatol commented Mar 31, 2022

aviatesk commented Apr 7, 2022

ianatol commented Apr 7, 2022

nanosoldier commented Apr 7, 2022

ianatol commented Apr 7, 2022

ianatol commented Apr 8, 2022 • edited Loading

nanosoldier commented Apr 8, 2022

aviatesk commented Apr 11, 2022

nanosoldier commented Apr 11, 2022

ianatol commented Apr 28, 2022

ianatol commented Apr 28, 2022

nanosoldier commented Apr 28, 2022

aviatesk Aug 31, 2022

Choose a reason for hiding this comment

nanosoldier commented Aug 31, 2022

aviatesk Aug 31, 2022

Choose a reason for hiding this comment

aviatesk Aug 31, 2022

Choose a reason for hiding this comment

Keno Aug 31, 2022

Choose a reason for hiding this comment

aviatesk commented Aug 31, 2022 • edited Loading

nanosoldier commented Aug 31, 2022

vtjnash commented Aug 31, 2022

ianatol commented Aug 31, 2022

nanosoldier commented Sep 1, 2022

nanosoldier commented Sep 1, 2022

ianatol commented Sep 1, 2022

nanosoldier commented Sep 1, 2022

Keno left a comment

Choose a reason for hiding this comment

ianatol commented Mar 30, 2022 •

edited by aviatesk

Loading

ianatol commented Apr 8, 2022 •

edited

Loading

aviatesk commented Aug 31, 2022 •

edited

Loading