S2d #2

dhruvaray · 2020-04-28T11:36:34Z

Thanks for contributing to TVM! Please refer to guideline https://tvm.apache.org/docs/contribute/ for useful information and tips. After the pull request is submitted, please request code reviews from Reviewers by @ them in the pull request thread.

Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>

…e#5316) * [RELAY][PYTORCH]isNan, isinf, isfinite, ceil, clamp, round ops * Review comments

Previously MakePackedAPI was in the target independent stage, but never the less requires the device_type information that will be binded at a later target dependent stage. The previous implementation was due to the limitation of LoweredFunc which can not carry buffer_map info(so they have to be lowered right away). This is no longer the case after the unified IR refactor. This PR migrates MakePackedAPI to a target dependent stage and removes the un-necessary BindDevice pass.

apache#5338) The older variants of CreateCall have been deprecated and were recently removed from LLVM. This caused compilation failures.

…mand (apache#5336) * when passing --net=host to build.sh it needs to be also sent as --network=host to "docker build", so that both build and run will use the same network configuration

…nous execution support. (apache#5324) * Cleanup type pack and unpack for tuples. * Clean up the memory_pass using common helpers * Clean up memory.cc * Refactor pass * Add doc strings * Fix CPPlint * Fix PyLint * Fix * Apply suggestions from code review Co-Authored-By: Zhi <5145158+zhiics@users.noreply.github.com> * Fix typo Co-authored-by: Zhi <5145158+zhiics@users.noreply.github.com>

* Windows Support for cpp_rpc * Add missing patches that fix crashes under Windows * On Windows, use python to untar vs wsl * remove some CMakeLists.txt stuff * more minor CMakeLists.txt changes * Remove items from CMakeLists.txt * Minor CMakeLists.txt changes * More minor CMakeLists.txt changes * Even more minor CMakeLists.txt changes * Modify readme

* [PYTORCH]take, topk op support * Ci Failure fix

) * get_valid_count updated to have correct results * speedup nms * update nms * revert back nms * recover one test for get_valid_count

…pache#5335)

* [TIR] Remove ProducerConsumer and AllocateNode::new_expr This PR removes two legacy IR parts in TIR that are deprecated. ProducerConsumer node only serves as a hint markup and may no longer be informative after extensive transformations in the pass. If necessary, we can add related info via AttrStmt. The new_expr field in the AllocateNode is deprecated since it can just be replaced by a LetStmt. - Remove dependencies of passes on ProducerConsumer. - Remove ProducerConsumer from the IR. - Remove the deprecated fields (new_expr, free_function) from AllocateNode. * Fix additional testcases

* Fix duplicate output in partitiongraph * Add test case * Fix test_annotated_regions with duplicate compiler_end outputs * Revert "Fix duplicate output in partitiongraph" This reverts commit e1f8ef3. * Prevent duplicate outputs in Tuple in PartitionGraph * Fix lint * Add another test case for when regions are merged, and when TupleGetItem was duplicated * Pull GetFunctionOutput out of branch, improve description of GetFunctionOutput * Use std::move for GetFunctionOutput. Fix typo with testcase name * Use tvm.transform.Sequential

…che#5321) * add pytorch tutorial code and doc stub * add more docs * formatting, more docs * typo fix * try make sphinx happy * add performance section * type and nit fix * format fix

- Changes most of the relay docs to use autosummary. - Bring relay API docs to the top-level flat view for easier discovery - Removed a few cases of re-exports.

…5262) * [RELAY][BYOC] Register pattern tables from external codegens This adds utility functions to support registering and retrieving pattern tables used by MergeComposite for external codegens. Change-Id: I5be165a321440e48b15ff6aff4970e0c67496aaa * Updated DNNL tests to use pattern table mechanism * Removed pattern table standalone test * Change reg to _op

Signed-off-by: windclarion <windclarion@gmail.com>

Default annotations were incorrectly being named 'defualt' which results in them not being removed in PartitionGraph.

…he#5349)

…pache#5346) This file was added before the variable with TVM/RT was initialized. The initialization overwrote the addition.

* [TOPI-ARM] Do not alter layout if layout is NHWC * Add test.

…pache#5307) * support extent(threadIdx.x) < warp_size in lower_warp_memory * more docs for lower_warp_memory

Change-Id: Ia15c3c8f41f75423814e559f6fdb062098f19464

…5365)

* [RUNTIME] FastRPC interface for Hexagon runtime Co-authored-by: Ravishankar Kolachana <quic_rkolacha@quicinc.com> Co-authored-by: Krzysztof Parzyszek <kparzysz@quicinc.com> * Explain store offset in a comment in launcher Co-authored-by: Abhikrant Sharma <quic_abhikran@quicinc.com> Co-authored-by: Ravishankar Kolachana <quic_rkolacha@quicinc.com>

* Non-Recursive AnnotatedTarget and MergeAnnotation * Non-Recursive AnnotatedRegionSet and RegionMerger

* [RFC] Pass pytest options globally. In many places having a global pytest flag is useful . For me with the build and test of tvm , I would like to be able to globally pass in pytest options as part of development flow or CI flows where one would like to measure other things regularly that need measurements including pytest coverage data that I would like to experiment with across the stack. This has been achieved with an additional setup-pytest-env.sh file in tests/scripts rather than putting in something in every single task test script and something I would like to avoid. This now means the -v option to pytest is superfluous. I did consider having a pytest.ini file but that doesn't allow me to pass any old environment variable in and this seems to be the compromise. * Improve other use case documentation * Rationalize pytest environment. * Remove the setting from docker/with_same_user. * Take the opportunity to migrate common PYTHONPATH and TVM_PATH into the common environment setting. * Fixup vta fsim * Be more explicit with common PYTHONPATH * Fix python path for task_python_vta_fsim.sh properly * Fix nit in documentation.

… str (apache#5426) To make runtime.String to work as naturally as possible in the python side, we make it sub-class the python's str object. Note that however, we cannot sub-class Object at the same time due to python's type layout constraint. We introduce a PyNativeObject class to handle this kind of object sub-classing and updated the FFI to handle PyNativeObject classes.

* [PYTORCH]Where, addcdiv, addcmul op support * Review comments fixed

* [FRONTEND][TFLITE]Gather, StridedSlice op added * Review comments fixed

…che#5431)

Added missing "tir" in tvm.tir.analysis.verify_gpu_code(f, kwargs)

…5423) The _type_child_slots can be used to enable quick type checking optimization by checking the whether the type index is within the bound. This PR enables these static slots: - Introduce a static assert to avoid the scenario when a developer forget to _type_child_slots when the field is set for the type's parent. - Revamp and assign static type index to common runtime objects - Add a DumpTypeTable call to allow developer monitor the current situation of type table and offers suggestions for the slots(ideally the slots equals the number of children so there is no overflow.

* [RELAY][PYTORCH]cosh,sinh,log2,log10,log1p op support * Review comment fixed * Gradient testcase added

* Add TopK to ONNX Frontend * respond to review comments

- remove unnecessary white spaces from storage kind - do not start a new scope for vectorization as temporary variables are alll uniquely generated. The above two changes make vectorized code much cleaner. Signed-off-by: Wei Pan <weip@nvidia.com>

* [RELAY] Move frontend utils The util file currently under frontend is used from outside of frontend (in qnn/op/legalizations). This suggests that the file should be pushed up to a higher level. The benefit from this change is that importing qnn no longer also imports all the frontends. * Inline get_scalar_from_constant Change-Id: I1cc64e9ecb0eadb6ac0f7b62e6ea174644af4ad4 * Remove util.py from Relay Change-Id: If9cd7cf3fc0bd1861a3a9b5604f338e084d8db96 * Shorten functions Change-Id: Ieb537d82e6ee52421ff05a90cd00a03679ffebf2 * Line length Change-Id: I1d216b7e73a060c4f118f5da50ce58b18eba907f

…ate() (apache#5331) * Add operation relay.nn.dilate() which calls topi.nn.dilate(). * Fix typo * Set op pattern to injective

apache#5451)

Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>

dhruvaray and others added 30 commits April 15, 2020 09:56

[Relay][Frontend][TFLite] Add parser support for shape and range

1077352

Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>

[RELAY][PYTORCH]isNan, isinf, isfinite, ceil, clamp, round ops (apach…

28fcb2d

…e#5316) * [RELAY][PYTORCH]isNan, isinf, isfinite, ceil, clamp, round ops * Review comments

[RELAY] Remove re-exports of tvm.transform (apache#5337)

2b6e845

[LLVM] Use llvm::FunctionCallee in IRBuilder::CreateCall with LLVM 11+ (

747a4a8

apache#5338) The older variants of CreateCall have been deprecated and were recently removed from LLVM. This caused compilation failures.

[CI] Fix build.sh to propagate --network=host to the docker build com…

d7c977c

…mand (apache#5336) * when passing --net=host to build.sh it needs to be also sent as --network=host to "docker build", so that both build and run will use the same network configuration

[PYTORCH]Take, Topk op support (apache#5332)

19ce0a9

* [PYTORCH]take, topk op support * Ci Failure fix

[TOPI] Using x86 schedules for ARM conv2d. (apache#5334)

8a42257

[TOPI] Improve get_valid_count and nms performance for CUDA (apache#5339

23a5e8e

) * get_valid_count updated to have correct results * speedup nms * update nms * revert back nms * recover one test for get_valid_count

[PYTHON] Enhance with_attr API, cleanup MakeAPILegacy in testcases (a…

e1a1f55

…pache#5335)

[Tutorial, QNN] Add tutorial for loading quantized PyTorch model (apa…

8639224

…che#5321) * add pytorch tutorial code and doc stub * add more docs * formatting, more docs * typo fix * try make sphinx happy * add performance section * type and nit fix * format fix

[DOCS] Bring relay docs to the top-level flat view (apache#5343)

e9ae136

- Changes most of the relay docs to use autosummary. - Bring relay API docs to the top-level flat view for easier discovery - Removed a few cases of re-exports.

[TOPI][PYTORCH]Logical & Bitwise operator support (apache#5341)

17b4961

[RUNTIME][CRT] support DLTensor whose ndim == 0 (apache#5344)

b9aa07f

Signed-off-by: windclarion <windclarion@gmail.com>

[BYOC][FIX] Fix typo in "default" (apache#5348)

38819e5

Default annotations were incorrectly being named 'defualt' which results in them not being removed in PartitionGraph.

enable tsim and fsim for GPU build (apache#5352)

b01fb67

[CRT]Compilation warnings fixed for 32bit and 64bit compilation (apac…

5e3222f

…he#5349)

[PYTORCH]Tensor creation ops support (apache#5347)

23e3e9e

[Hexagon] Add hexagon_posix.cc to TVM/RT sources in the right place (a…

64db78d

…pache#5346) This file was added before the variable with TVM/RT was initialized. The initialization overwrote the addition.

[TOPI-ARM] Do not alter layout if layout is NHWC (apache#5350)

9568e0b

* [TOPI-ARM] Do not alter layout if layout is NHWC * Add test.

[TIR] Make lower_warp_memory support extent(threadIdx.x) < warp_size (a…

dbfd277

…pache#5307) * support extent(threadIdx.x) < warp_size in lower_warp_memory * more docs for lower_warp_memory

[RELAY][PYTORCH]GroupNorm op support added (apache#5358)

453da00

docker: Drop caffe2 download progess bars (apache#5359)

7f995ce

Change-Id: Ia15c3c8f41f75423814e559f6fdb062098f19464

fix fuse over functions that are handled by external codegen (apache#…

29da9ec

…5365)

comaniac and others added 29 commits April 28, 2020 14:16

[BYOC] Use Non-Recursive Visitor/Mutator (apache#5410)

af079c1

* Non-Recursive AnnotatedTarget and MergeAnnotation * Non-Recursive AnnotatedRegionSet and RegionMerger

[MXNET]DepthToSpace & SpaceToDepth Operator (apache#5408)

d8f4641

Add option to specify flatbuffers location (apache#5425)

66c16cf

[FRONTEND][MXNET] support elemwise logic ops (apache#5361)

f0b5a9e

[PYTORCH]where, addcdiv, addcmul op support (apache#5383)

baf6674

* [PYTORCH]Where, addcdiv, addcmul op support * Review comments fixed

[FRONTEND][TFLITE]Gather, StridedSlice op support added (apache#4788)

155601b

* [FRONTEND][TFLITE]Gather, StridedSlice op added * Review comments fixed

misc fixes for ROCm (pointer lifetime, runtime::String refactor) (apa…

3b5c577

…che#5431)

Corrected TVM autotuning on GPU (apache#5432)

89a6237

Added missing "tir" in tvm.tir.analysis.verify_gpu_code(f, kwargs)

[RELAY][PYTORCH]cosh,sinh,log2,log10,log1p op support (apache#5395)

ca93121

* [RELAY][PYTORCH]cosh,sinh,log2,log10,log1p op support * Review comment fixed * Gradient testcase added

[PYTORCH]Rsub, Embedded, OneHot ops support (apache#5434)

1bebb6e

fix miopen pad (apache#5433)

1d331f4

Add TopK to ONNX Frontend (apache#5441)

2c9da4d

* Add TopK to ONNX Frontend * respond to review comments

[KERAS]Embedding layer (apache#5444)

d186475

[Docs] VTA install doc migration from md to rst (apache#5442)

df52be0

Improve IntervalSet's floormod (apache#5367)

fa42562

[ONNX]GatherNd, Round, IsNaN, IsInf (apache#5445)

339c8ff

[relay][topi] Add operation relay.nn.dilate() which calls topi.nn.dil…

4ecb171

…ate() (apache#5331) * Add operation relay.nn.dilate() which calls topi.nn.dilate(). * Fix typo * Set op pattern to injective

[Pytorch] fix translation of transpose when axis argument is as a list (

7717425

apache#5451)

[TOPI,RELAY][TFLITE] Sparse to dense operator

2458cc3

Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>

use param name in documentation

69e9bd7

Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>

sphinx doc errors fixed

aa0f81b

Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>

Merge remote-tracking branch 'upstream/master' into s2d

db3d04c

incorporated code review comments

4e42714

Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>

Fixed indentation

aaf46d4

Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>

dhruvaray merged commit 3da94a0 into master2 Apr 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

S2d #2

S2d #2

dhruvaray commented Apr 28, 2020

S2d #2

S2d #2

Conversation

dhruvaray commented Apr 28, 2020