Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

merge latest upstream bits into our fork #7

Merged
merged 152 commits into from
Aug 24, 2022
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
152 commits
Select commit Hold shift + click to select a range
40f895d
`-mtune=`/`-mcpu=` support for x86 AMD CPU's (#6655)
LebedevRI Mar 31, 2022
f56614e
Fix GPU depredication/scalarization (#6669)
shoaibkamil Apr 1, 2022
43af5b6
Allow PyPipeline and PyFunc to realize() scalar buffers (#6674)
steven-johnson Apr 5, 2022
fdd6500
Future-proof 'processor` to `tune processor` (#6673)
LebedevRI Apr 5, 2022
9866df2
Fix ctors for Realization (#6675)
steven-johnson Apr 5, 2022
72ad2e6
`-mtune=native` CPU autodetection for AMD Zen 3 CPU (#6648)
LebedevRI Apr 6, 2022
12270a5
Bump development Halide version to 15.0.0 (#6678)
alexreinking Apr 6, 2022
ad0408e
Clean up Python extensions in python_bindings (#6670)
steven-johnson Apr 6, 2022
1d1b556
Halide::Tools::save_image() should accept buffers with `const` types …
steven-johnson Apr 6, 2022
fe96aaa
Fix "set but not used" warnings/errors (#6683)
steven-johnson Apr 7, 2022
f64bd08
Remove deprecated `Halide::Output` type (#6685)
steven-johnson Apr 7, 2022
e549be7
Remove deprecated `build()` support from Generators (#6684)
steven-johnson Apr 8, 2022
b5840f7
Drop support for LLVM12 (#6686)
steven-johnson Apr 8, 2022
887d340
Upgrade to clang-format 13 (#6689)
steven-johnson Apr 8, 2022
54f3977
Always mark _ucon as 'unused' in Codegen_C (#6691)
steven-johnson Apr 8, 2022
d568469
Add `break` to avoid 'possible unintentional fallthru' warning (#6694)
steven-johnson Apr 11, 2022
f906eba
Silence "unknown warning" in Clang 13 (#6693)
steven-johnson Apr 11, 2022
08325a4
Fixes for top-of-tree LLVM (#6697)
steven-johnson Apr 11, 2022
3944fb0
Faster `widening_mul(int16x, int16x) -> int32x` for x86 (AVX2 and SSE…
rootjalex Apr 12, 2022
009d86f
Remove deprecated versions of Func::prefetch() (#6698)
steven-johnson Apr 12, 2022
4da8932
Remove deprecated JIT handler setters (#6699)
steven-johnson Apr 12, 2022
3d7b977
Drop support for Matlab extensions (#6696)
steven-johnson Apr 12, 2022
87c0cc9
llvm no longer wants a type suffix on vst intrinsics (#6701)
abadams Apr 12, 2022
77f7f5e
Python: make Func implicitly convertible to Stage (#6702) (#6704)
steven-johnson Apr 13, 2022
60a909f
Fix type-mangling for vst on arm32 for LLVM15 (#6705)
steven-johnson Apr 14, 2022
4df3c5d
Remove the last remaining call to getPointerElementType() (#6715)
steven-johnson Apr 19, 2022
01ca823
ARM vst mangling needs to be conditional on opaque ptrs (#6716)
steven-johnson Apr 19, 2022
65ba16e
Combine string constants in combine_strings() (#6717)
steven-johnson Apr 19, 2022
460c77e
Update CodeGen_PTX_Dev to use new PassManager (#6718)
steven-johnson Apr 20, 2022
a07d3e4
Closure functions for parallel tasks should be internal, not external…
steven-johnson Apr 20, 2022
3b3e89e
Smarten type_of<> for fn ptrs; fix async_parallel for C backend (#6719)
steven-johnson Apr 20, 2022
accc644
Remove legacy::FunctionPassManager usage in Codegen_PTX_Dev (#6722)
steven-johnson Apr 21, 2022
aa384af
`get_amd_processor()`: implement detection for the rest of supported …
LebedevRI Apr 21, 2022
754018b
Add Func::output_type() method (#6724)
steven-johnson Apr 21, 2022
85b9f29
Grab-bag of minor Python fixes (#6725)
steven-johnson Apr 21, 2022
f5c77ce
Deprecate variadic-template version of Realization ctor (#6695)
steven-johnson Apr 25, 2022
86a4a59
Remove `rounding_halving_sub` and non-existent arm rhsub instructions…
rootjalex Apr 26, 2022
799c546
Augment Halide::Func to allow for constraining Type and Dimensionalit…
steven-johnson Apr 27, 2022
00f4b29
More typed-Func work (#6735)
steven-johnson Apr 28, 2022
fc0f4ed
Add missing #include <functional> in ThreadPool.h (#6738)
steven-johnson Apr 28, 2022
41b2d07
Fix regression from #6734 (#6739)
steven-johnson Apr 28, 2022
e6260a8
Add forwarding for the recently-added Func::output_type() method (#6741)
steven-johnson Apr 30, 2022
f376cbb
Silence "unscheduled update stage" warnings in msan_generator.cpp (#6…
steven-johnson Apr 30, 2022
92dfb61
Add __pycache__ to toplevel .gitignore file (#6743)
steven-johnson May 2, 2022
1606039
Revise PyStub calling convention for GeneratorParams (#6742)
steven-johnson May 4, 2022
c8531a5
Silence "may be used uninitialized" in Buffer::for_each_element() (#6…
steven-johnson May 5, 2022
557690e
Update WABT to 1.0.29 (#6748)
steven-johnson May 5, 2022
6fbf203
Update hannk README link to hosted models page (#6749)
steven-johnson May 5, 2022
47d8103
Add a `HalideError` base class to Python bindings (#6750)
steven-johnson May 6, 2022
a986078
Deprecate GeneratorContext getters with `get_` prefix (#6753)
steven-johnson May 9, 2022
a2e89d8
Add GeneratorFactoryProvider to generate_filter_main() (#6755)
steven-johnson May 10, 2022
b38b661
Deprecate disable_llvm_loop_opt (#4113) (#6754)
steven-johnson May 10, 2022
4ab4ad9
Minor metadata-related cleanups (#6759)
steven-johnson May 13, 2022
09a986e
Expand the x86 SIMD variants tested in correctness_vector_reductions …
steven-johnson May 13, 2022
cc41e65
Fix Param<T>::set_estimate for T=void (#6766)
steven-johnson May 16, 2022
25a3272
add_python_aot_extension should use FUNCTION_NAME for the .so output …
steven-johnson May 16, 2022
13a5470
Update the list of fused_pairs and run validate_fused_group for speca…
vksnk May 18, 2022
61f6af7
Add Func::type()/types(), deprecate Func::output_type()/output_types(…
steven-johnson May 19, 2022
b5f024f
Fix fundamental confusion about target/tune CPU (#6765)
LebedevRI May 19, 2022
56acc6e
Fix annoying typo in Func.h (#6774)
steven-johnson May 19, 2022
d973993
Add execute_generator() API (#6771)
steven-johnson May 23, 2022
83a90e7
Allow overriding of `Generator::init_from_context()` for debug purpos…
steven-johnson May 23, 2022
ad1e7f6
Convert some assert-only usage of output_types() -> types() (#6779)
steven-johnson May 24, 2022
d0c53fa
[miscompile] Don't de-negate and change direction of shifts-by-unsign…
LebedevRI May 26, 2022
0f7d548
Move some options from execute_generator back to generate_filter_main…
steven-johnson May 27, 2022
3ba2f94
LLVM codegen: register AA pipeline if LLVM is older than 14 (#6785)
LebedevRI May 27, 2022
25f615d
halide_type_of<>() should always be constexpr (#6790)
steven-johnson May 31, 2022
74d9909
Define an AbstractGenerator interface (#6637)
steven-johnson May 31, 2022
255ff18
hexagon_scatter test should run only if target has HVX (#6793)
steven-johnson May 31, 2022
76793b4
Add Target support for architectures with implementation specific vec…
May 31, 2022
2b29bde
slow tests should support sharding (#6780)
steven-johnson Jun 1, 2022
4f2251c
Add missing include to test_sharding.h (#6795)
steven-johnson Jun 1, 2022
e832c4f
Pacify clang-tidy (#6796)
steven-johnson Jun 1, 2022
00b5728
Silence a "possibly uninitialized" warning (#6797)
steven-johnson Jun 2, 2022
8b31327
Make all tests default to `-fvisibility=hidden` (#6799)
steven-johnson Jun 2, 2022
f712f4f
Minor typedef cleanup (#6800)
steven-johnson Jun 6, 2022
0ec2740
Fix auto_schedule/machine_params parsing (#6804)
steven-johnson Jun 6, 2022
ce75862
Rewrite strided loads of 4 in AlignLoads (#6806)
vksnk Jun 14, 2022
fc0f1f7
Fix two minor bugs triggered by an or reduction with early-out (#6807)
abadams Jun 14, 2022
0e17e67
[CMake] Mark multi-threaded tests as such (#6810)
LebedevRI Jun 27, 2022
9e5c5ce
Add support for vscale vector code generation. (#6802)
Jun 27, 2022
feba77c
Rework .gitignore (#6822)
alexreinking Jun 28, 2022
e0a9825
Update presets to format version 3 (#6824)
alexreinking Jun 28, 2022
c12f8a5
Fix for top-of-tree LLVM (#6825)
steven-johnson Jun 28, 2022
3e142cf
Tweak python apps for better Blaze/Bazel compatibility (#6823)
steven-johnson Jun 28, 2022
d36cd04
Change stub module names in Python to be _pystub rather than _stub (#…
steven-johnson Jun 29, 2022
ece5fb7
Apply CMAKE_C_COMPILER_LAUNCHER to initmod clang calls (#6831)
alexreinking Jun 30, 2022
60d2b98
Remove Python bindings from Makefiles (#6821)
alexreinking Jun 30, 2022
fac313e
Add a new, alternate JIT-call convention (#6777)
steven-johnson Jun 30, 2022
b2771c1
Scrub Python from Makefile after buildbot update (#6833)
alexreinking Jun 30, 2022
6838db0
Remove unused function in callable_generator.cpp (#6834)
steven-johnson Jun 30, 2022
23a1fa8
Disable testing for apps/linear_algebra on x86-32-linux/Make (#6836)
steven-johnson Jul 1, 2022
23c4cf1
Rearrange subdirectories in python_bindings (#6835)
steven-johnson Jul 1, 2022
29ebde9
Better lowering of halving_sub and rounding_halving_add (#6827)
abadams Jul 11, 2022
8159dd3
Check RDom::where predicates for race conditions (#6842)
alexreinking Jul 11, 2022
d266e4e
Remove Generator::value_tracker and friends (#6845)
steven-johnson Jul 11, 2022
708a320
Deprecate/remove Generator::get_externs_map() and friends (#6844)
steven-johnson Jul 11, 2022
13a43c0
Add placeholder code for bfloat16 in Python (#6849) (#6850)
steven-johnson Jul 12, 2022
bdd7114
Fix the PLUGINS argument to properly join multiple arguments (#6851)
steven-johnson Jul 13, 2022
f9c2cdf
Add autoscheduling to the generator_aot_stubuser test (#6855)
steven-johnson Jul 14, 2022
24913eb
Silence Adams2019 Autoscheduler (#6854)
steven-johnson Jul 15, 2022
b1ca334
Rework autoscheduler API (#6788) (#6838)
steven-johnson Jul 15, 2022
2d907c4
[vulkan phase0] Add adts for containers and memory allocation to runt…
derek-gerstmann Jul 15, 2022
359026a
Promote Reinterpret Intrinsic into an Reinterpret IR Node (#6853)
LebedevRI Jul 20, 2022
51c06b7
Python source reorg (#6867)
alexreinking Jul 20, 2022
967c3bf
Fix simd_op_check for top-of-tree LLVM (#6874)
steven-johnson Jul 20, 2022
9a94756
Use pmaddubsw 8-bit horizontal widening adds (Fixes #6859) (#6873)
rootjalex Jul 21, 2022
8b5486b
[Codegen_LLVM] Radically simplify `visit(const Reinterpret *op)` (#6865)
LebedevRI Jul 21, 2022
04c465b
[Codegen] Fail to codegen `Call::undef`, just like `Call::signed_inte…
LebedevRI Jul 21, 2022
06fcf94
Fix error in Makefile for Adams2019 on OSX (#6877)
steven-johnson Jul 21, 2022
c904c53
Refactor/cleanup in Autoscheduler code (#6858)
steven-johnson Jul 21, 2022
4770495
Ensure $CMAKE_{lang}_OUTPUT_EXTENSION is set before using it (#6879)
shoaibkamil Jul 22, 2022
11a049c
#6863 - Fixes to make address sanitizer happy for internal runtime cl…
derek-gerstmann Jul 22, 2022
5e69ad9
[Codegen_LLVM] Define all the things (#6866)
LebedevRI Jul 25, 2022
7821212
Add set-host-dirty/copy-to-host to PythonExtensionGen (#6869)
steven-johnson Jul 25, 2022
e3e169d
Rewrite PythonExtensionGen to be C++ based (#6888)
steven-johnson Jul 27, 2022
c8b811a
Fixes to allow compiling with LLVM16 (#6889)
steven-johnson Jul 27, 2022
3859b36
Add support for generating x86 sum-of-absolute-difference reductions …
abadams Jul 27, 2022
b9a3356
Remove (most) of the env var usage from Adams2019 (#6861)
steven-johnson Jul 27, 2022
9c25902
[vulkan phase1] Add SPIR-V IR (#6882)
derek-gerstmann Jul 29, 2022
6cc77b2
Add `auto_schedule` label to Adams2019 and Li2018 tests in CMake (#6898)
steven-johnson Jul 29, 2022
0739045
[Simplify] Drop no-op single-input identity shuffles (#6901)
LebedevRI Aug 1, 2022
e03b0e0
[Codegen_LLVM] Annotate LLVM IR functions with `nounwind`/`mustprogre…
LebedevRI Aug 1, 2022
e35654b
Don't try to fold saturating_sub of VectorReduce (#6896)
rootjalex Aug 1, 2022
703a738
Upgrade clang-format and clang-tidy to v14 (v2) (#6902)
steven-johnson Aug 1, 2022
8871404
Allow AMX instructions with K dimension larger than 4 bytes (#6582)
frengels Aug 1, 2022
2239119
Fix autoscheduling trivial lut wrappers (#6905)
abadams Aug 2, 2022
dd391e6
Fix broken Makefile rules for autoschedulers on OSX (#6906)
steven-johnson Aug 2, 2022
88e7229
Start developing pip package (#6886)
alexreinking Aug 2, 2022
0072946
LICENSE.txt: Include full text of Apache 2.0 license (not just the 'h…
steven-johnson Aug 3, 2022
a893d5e
LICENSE.txt: add spirv license (#6913)
steven-johnson Aug 3, 2022
857b045
LICENSE.txt: add BLAS license. (#6914)
steven-johnson Aug 3, 2022
cc44ee5
Upgrade CMake minimum version to 3.22 (#6916)
steven-johnson Aug 4, 2022
3a04fc0
Remove unused GHA and packaging workflows. (#6917)
alexreinking Aug 4, 2022
ffa2c36
Fix two warnings found with clang 16 (#6918)
steven-johnson Aug 4, 2022
256c4d9
Fix bug when realize condition depends on tuple call (#6915)
abadams Aug 4, 2022
9ca7560
Fix wrong install path for *.py files (#6921)
steven-johnson Aug 5, 2022
8794fac
Make use of CMake 3.22 features (#6919)
alexreinking Aug 8, 2022
1bf1599
Make saturating_cast an intrinsic (#6900)
rootjalex Aug 8, 2022
8981861
Merge branch 'inteon-latest' into brent/remerge
brent-carmer Aug 9, 2022
3e8403a
Remove guard for MCTS
brent-carmer Aug 9, 2022
92de4a1
Halide::Error should not extend std::runtime_error (#6927)
steven-johnson Aug 10, 2022
43e6a26
Rework internal PYTHONPATH maintenance (#6922)
steven-johnson Aug 10, 2022
4cdc2a1
Tutorial 10 needs to be skipped for Python when targeting Wasm (just …
steven-johnson Aug 11, 2022
b734957
Add build & test presets for release and debug CMake builds (#6934)
steven-johnson Aug 11, 2022
5e8f97b
Add ASAN support to CMake via toolchain file (#6920)
steven-johnson Aug 11, 2022
f60a8fb
Fix badly-merged CMakePresets.json file (#6936)
steven-johnson Aug 12, 2022
52b91a4
Add minimal useful implementation of extracting and concatenating bit…
abadams Aug 14, 2022
6798467
Merge branch 'main' into brent/remerge
brent-carmer Aug 15, 2022
2ce991a
Show error when user_error occurs
brent-carmer Aug 19, 2022
510ad6e
Be even more liberal about printing errors
brent-carmer Aug 19, 2022
bac2a50
fixed merge issue that omitted the PyEvictionKey.cpp from makefile
Aug 22, 2022
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
1 change: 1 addition & 0 deletions .clang-format-ignore
Original file line number Diff line number Diff line change
Expand Up @@ -12,3 +12,4 @@
./tutorial
# hexagon_remote/bin/src is also special
./src/runtime/hexagon_remote/bin/src
./dependencies/spirv
15 changes: 11 additions & 4 deletions .clang-tidy
Original file line number Diff line number Diff line change
Expand Up @@ -6,11 +6,14 @@ Checks: >
-*,
bugprone-*,
-bugprone-branch-clone,
-bugprone-easily-swappable-parameters,
-bugprone-exception-escape,
-bugprone-implicit-widening-of-multiplication-result,
-bugprone-integer-division,
-bugprone-narrowing-conversions,
-bugprone-reserved-identifier,
-bugprone-signed-char-misuse,
clang-diagnostic-shadow-field,
misc-*,
-misc-no-recursion,
-misc-non-private-member-variables-in-classes,
Expand All @@ -22,7 +25,11 @@ Checks: >
modernize-make-unique,
modernize-redundant-void-arg,
modernize-use-bool-literals,
modernize-use-default-member-init,
# Disabled: there is not consensus on whether the Clang-14 behavior
# of this checker is always desirable or not, and there isn't currently
# a way to revert to the Clang-13 behavior. We may revisit this
# check the next time we examine clang-tidy options.
# modernize-use-default-member-init,
modernize-use-emplace,
modernize-use-equals-default,
modernize-use-equals-delete,
Expand Down Expand Up @@ -51,7 +58,7 @@ Checks: >
WarningsAsErrors: '*'
HeaderFilterRegex: '.*'
FormatStyle: 'file'
CheckOptions:
- key: modernize-use-default-member-init.UseAssignment
value: 1
#CheckOptions:
# - key: modernize-use-default-member-init.UseAssignment
# value: 1
...
53 changes: 0 additions & 53 deletions .github/workflows/packaging.yml

This file was deleted.

16 changes: 10 additions & 6 deletions .github/workflows/presubmit.yml
Original file line number Diff line number Diff line change
Expand Up @@ -15,25 +15,29 @@ jobs:
runs-on: ubuntu-20.04
steps:
- uses: actions/checkout@v2
- uses: DoozyX/clang-format-lint-action@v0.12
- uses: DoozyX/clang-format-lint-action@v0.14
with:
source: '.'
extensions: 'h,c,cpp'
clangFormatVersion: 12
clangFormatVersion: 14
check_clang_tidy:
name: Check clang-tidy
runs-on: ubuntu-20.04
steps:
- uses: actions/checkout@v2
- name: Install clang-tidy
run: |
# from apt.llvm.org
# wget -O - https://apt.llvm.org/llvm-snapshot.gpg.key | apt-key add -
sudo apt-key adv --keyserver keyserver.ubuntu.com --recv-keys 15CF4D18AF4F7421
sudo apt-add-repository "deb https://apt.llvm.org/$(lsb_release -sc)/ llvm-toolchain-$(lsb_release -sc)-14 main"
sudo apt-get update
sudo apt-get install llvm-12 clang-12 liblld-12-dev libclang-12-dev clang-tidy-12 ninja-build
sudo apt-get install llvm-14 clang-14 liblld-14-dev libclang-14-dev clang-tidy-14 ninja-build
- name: Run clang-tidy
run: |
export CC=clang-12
export CXX=clang++-12
export CLANG_TIDY_LLVM_INSTALL_DIR=/usr/lib/llvm-12
export CC=clang-14
export CXX=clang++-14
export CLANG_TIDY_LLVM_INSTALL_DIR=/usr/lib/llvm-14
./run-clang-tidy.sh
check_cmake_file_lists:
name: Check CMake file lists
Expand Down
Loading