-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
S2d #2
Commits on Apr 15, 2020
-
[Relay][Frontend][TFLite] Add parser support for shape and range
Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 1077352 - Browse repository at this point
Copy the full SHA 1077352View commit details
Commits on Apr 28, 2020
-
[RELAY][PYTORCH]isNan, isinf, isfinite, ceil, clamp, round ops (apach…
…e#5316) * [RELAY][PYTORCH]isNan, isinf, isfinite, ceil, clamp, round ops * Review comments
Configuration menu - View commit details
-
Copy full SHA for 28fcb2d - Browse repository at this point
Copy the full SHA 28fcb2dView commit details -
[TIR] Refactor MakePackedAPI to target dependent stage. (apache#5326)
Previously MakePackedAPI was in the target independent stage, but never the less requires the device_type information that will be binded at a later target dependent stage. The previous implementation was due to the limitation of LoweredFunc which can not carry buffer_map info(so they have to be lowered right away). This is no longer the case after the unified IR refactor. This PR migrates MakePackedAPI to a target dependent stage and removes the un-necessary BindDevice pass.
Configuration menu - View commit details
-
Copy full SHA for 16d3da1 - Browse repository at this point
Copy the full SHA 16d3da1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2b6e845 - Browse repository at this point
Copy the full SHA 2b6e845View commit details -
[LLVM] Use llvm::FunctionCallee in IRBuilder::CreateCall with LLVM 11+ (
apache#5338) The older variants of CreateCall have been deprecated and were recently removed from LLVM. This caused compilation failures.
Configuration menu - View commit details
-
Copy full SHA for 747a4a8 - Browse repository at this point
Copy the full SHA 747a4a8View commit details -
[CI] Fix build.sh to propagate --network=host to the docker build com…
…mand (apache#5336) * when passing --net=host to build.sh it needs to be also sent as --network=host to "docker build", so that both build and run will use the same network configuration
Configuration menu - View commit details
-
Copy full SHA for d7c977c - Browse repository at this point
Copy the full SHA d7c977cView commit details -
[Runtime][Relay][Cleanup] Clean up for memory pass to enable heteroge…
…nous execution support. (apache#5324) * Cleanup type pack and unpack for tuples. * Clean up the memory_pass using common helpers * Clean up memory.cc * Refactor pass * Add doc strings * Fix CPPlint * Fix PyLint * Fix * Apply suggestions from code review Co-Authored-By: Zhi <5145158+zhiics@users.noreply.github.com> * Fix typo Co-authored-by: Zhi <5145158+zhiics@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for ab5afbc - Browse repository at this point
Copy the full SHA ab5afbcView commit details -
Windows Support for cpp_rpc (apache#4857)
* Windows Support for cpp_rpc * Add missing patches that fix crashes under Windows * On Windows, use python to untar vs wsl * remove some CMakeLists.txt stuff * more minor CMakeLists.txt changes * Remove items from CMakeLists.txt * Minor CMakeLists.txt changes * More minor CMakeLists.txt changes * Even more minor CMakeLists.txt changes * Modify readme
Configuration menu - View commit details
-
Copy full SHA for c8e933e - Browse repository at this point
Copy the full SHA c8e933eView commit details -
[PYTORCH]Take, Topk op support (apache#5332)
* [PYTORCH]take, topk op support * Ci Failure fix
Configuration menu - View commit details
-
Copy full SHA for 19ce0a9 - Browse repository at this point
Copy the full SHA 19ce0a9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8a42257 - Browse repository at this point
Copy the full SHA 8a42257View commit details -
[TOPI] Improve get_valid_count and nms performance for CUDA (apache#5339
Configuration menu - View commit details
-
Copy full SHA for 23a5e8e - Browse repository at this point
Copy the full SHA 23a5e8eView commit details -
Configuration menu - View commit details
-
Copy full SHA for e1a1f55 - Browse repository at this point
Copy the full SHA e1a1f55View commit details -
[TIR] Remove ProducerConsumer and AllocateNode::new_expr (apache#5333)
* [TIR] Remove ProducerConsumer and AllocateNode::new_expr This PR removes two legacy IR parts in TIR that are deprecated. ProducerConsumer node only serves as a hint markup and may no longer be informative after extensive transformations in the pass. If necessary, we can add related info via AttrStmt. The new_expr field in the AllocateNode is deprecated since it can just be replaced by a LetStmt. - Remove dependencies of passes on ProducerConsumer. - Remove ProducerConsumer from the IR. - Remove the deprecated fields (new_expr, free_function) from AllocateNode. * Fix additional testcases
Configuration menu - View commit details
-
Copy full SHA for d3d155c - Browse repository at this point
Copy the full SHA d3d155cView commit details -
[BYOC] Prevent duplicate outputs in subgraph Tuple (apache#5320)
* Fix duplicate output in partitiongraph * Add test case * Fix test_annotated_regions with duplicate compiler_end outputs * Revert "Fix duplicate output in partitiongraph" This reverts commit e1f8ef3. * Prevent duplicate outputs in Tuple in PartitionGraph * Fix lint * Add another test case for when regions are merged, and when TupleGetItem was duplicated * Pull GetFunctionOutput out of branch, improve description of GetFunctionOutput * Use std::move for GetFunctionOutput. Fix typo with testcase name * Use tvm.transform.Sequential
Configuration menu - View commit details
-
Copy full SHA for ecdb00c - Browse repository at this point
Copy the full SHA ecdb00cView commit details -
[Tutorial, QNN] Add tutorial for loading quantized PyTorch model (apa…
…che#5321) * add pytorch tutorial code and doc stub * add more docs * formatting, more docs * typo fix * try make sphinx happy * add performance section * type and nit fix * format fix
Configuration menu - View commit details
-
Copy full SHA for 8639224 - Browse repository at this point
Copy the full SHA 8639224View commit details -
[DOCS] Bring relay docs to the top-level flat view (apache#5343)
- Changes most of the relay docs to use autosummary. - Bring relay API docs to the top-level flat view for easier discovery - Removed a few cases of re-exports.
Configuration menu - View commit details
-
Copy full SHA for e9ae136 - Browse repository at this point
Copy the full SHA e9ae136View commit details -
Configuration menu - View commit details
-
Copy full SHA for 17b4961 - Browse repository at this point
Copy the full SHA 17b4961View commit details -
[RELAY][BYOC] Register pattern tables from external codegens (apache#…
…5262) * [RELAY][BYOC] Register pattern tables from external codegens This adds utility functions to support registering and retrieving pattern tables used by MergeComposite for external codegens. Change-Id: I5be165a321440e48b15ff6aff4970e0c67496aaa * Updated DNNL tests to use pattern table mechanism * Removed pattern table standalone test * Change reg to _op
Configuration menu - View commit details
-
Copy full SHA for 7448081 - Browse repository at this point
Copy the full SHA 7448081View commit details -
[RUNTIME][CRT] support DLTensor whose ndim == 0 (apache#5344)
Signed-off-by: windclarion <windclarion@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for b9aa07f - Browse repository at this point
Copy the full SHA b9aa07fView commit details -
[BYOC][FIX] Fix typo in "default" (apache#5348)
Default annotations were incorrectly being named 'defualt' which results in them not being removed in PartitionGraph.
Configuration menu - View commit details
-
Copy full SHA for 38819e5 - Browse repository at this point
Copy the full SHA 38819e5View commit details -
Configuration menu - View commit details
-
Copy full SHA for b01fb67 - Browse repository at this point
Copy the full SHA b01fb67View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5e3222f - Browse repository at this point
Copy the full SHA 5e3222fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 23e3e9e - Browse repository at this point
Copy the full SHA 23e3e9eView commit details -
[Hexagon] Add hexagon_posix.cc to TVM/RT sources in the right place (a…
…pache#5346) This file was added before the variable with TVM/RT was initialized. The initialization overwrote the addition.
Configuration menu - View commit details
-
Copy full SHA for 64db78d - Browse repository at this point
Copy the full SHA 64db78dView commit details -
[TOPI-ARM] Do not alter layout if layout is NHWC (apache#5350)
* [TOPI-ARM] Do not alter layout if layout is NHWC * Add test.
Configuration menu - View commit details
-
Copy full SHA for 9568e0b - Browse repository at this point
Copy the full SHA 9568e0bView commit details -
[TIR] Make lower_warp_memory support extent(threadIdx.x) < warp_size (a…
…pache#5307) * support extent(threadIdx.x) < warp_size in lower_warp_memory * more docs for lower_warp_memory
Configuration menu - View commit details
-
Copy full SHA for dbfd277 - Browse repository at this point
Copy the full SHA dbfd277View commit details -
Configuration menu - View commit details
-
Copy full SHA for 453da00 - Browse repository at this point
Copy the full SHA 453da00View commit details -
docker: Drop caffe2 download progess bars (apache#5359)
Change-Id: Ia15c3c8f41f75423814e559f6fdb062098f19464
Configuration menu - View commit details
-
Copy full SHA for 7f995ce - Browse repository at this point
Copy the full SHA 7f995ceView commit details -
Configuration menu - View commit details
-
Copy full SHA for 29da9ec - Browse repository at this point
Copy the full SHA 29da9ecView commit details -
[RUNTIME] FastRPC interface for Hexagon runtime (apache#5353)
* [RUNTIME] FastRPC interface for Hexagon runtime Co-authored-by: Ravishankar Kolachana <quic_rkolacha@quicinc.com> Co-authored-by: Krzysztof Parzyszek <kparzysz@quicinc.com> * Explain store offset in a comment in launcher Co-authored-by: Abhikrant Sharma <quic_abhikran@quicinc.com> Co-authored-by: Ravishankar Kolachana <quic_rkolacha@quicinc.com>
Configuration menu - View commit details
-
Copy full SHA for be0c661 - Browse repository at this point
Copy the full SHA be0c661View commit details -
[TIR][REFACTOR] Migrate low-level passes in tvm.lower to the Unified …
…IR pass manager. (apache#5364) - Migrate BoundCheckers and Simplify - Migrate RewriteUnsafeSelect and RemoveNoOp - Migrate UnrollLoop and StorageRewrite - Migrate InjectDoubleBuffer and InjectVirtualThread - Migrate LoopPartition and Vectorize - Migrate CoProcSync, LiftAttrScope, InjectCopyIntrin We still keep ir_pass registerations for now. Need a separate PR to refactor the parts before the StorageFlatten.
Configuration menu - View commit details
-
Copy full SHA for 9707ae5 - Browse repository at this point
Copy the full SHA 9707ae5View commit details -
[TIR] Fix lower_warp_memory when there are >1 warp buffers (apache#5368)
* fix recursion in lower_warp_memory * post-order mutation
Configuration menu - View commit details
-
Copy full SHA for 458814b - Browse repository at this point
Copy the full SHA 458814bView commit details -
Configuration menu - View commit details
-
Copy full SHA for eae387b - Browse repository at this point
Copy the full SHA eae387bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9248e15 - Browse repository at this point
Copy the full SHA 9248e15View commit details -
[TIR][REFACTOR] Remove te::Tensor dependencies from TIR passes. (apac…
…he#5372) * [TIR][REFACTOR] Remove te::Tensor dependencies from TIR passes. te::Tensor is an useful object for tensor expression, but brings un-necessary reverse dependency in TIR nodes such as Provide and Realize. This PR is a first step to remove this dependency. We will use Buffer in all the places where the te::Tensor was used. The rough correspondence are: - Provide -> BufferStore - Realize -> BufferRealize - HalideCall -> BufferLoad. After this change, we can not use IRModule of PrimFuncs cleanly to represent TIR at any point of the optimizations. Buffer will serve as the abstraction for the TIR data models to represent the intermediate storages and their constraints. We still keep Realize/HalideCall and Provide as TIR nodes for now to make the change minimum. Right after ScheduleOps, we call SchedulePostProcToPrimFunc to canonicalize the temporary IR generated by TE(which contains these nodes) to the TIR. The TIR optimizations are now mostly migrated to to the pass manager. Followup PRs are needed to migrate the remaining few passes. * Fix dev tutorial
Configuration menu - View commit details
-
Copy full SHA for f635fd5 - Browse repository at this point
Copy the full SHA f635fd5View commit details -
Configuration menu - View commit details
-
Copy full SHA for b5925fe - Browse repository at this point
Copy the full SHA b5925feView commit details -
[TIR][REFACTOR] RewriteForTensorCore -> te/schedule (apache#5379)
* [TIR][REFACTIR] RewriteForTensorCore -> te/schedule RewriteForTensor depends on the schedule information, which makes it differ from a typical pass(which should get all the information from the input TIR). As a result, we refactor it as a SchedulePostProc step for now. We should revisit it later as we introduce more support for tensor core patterns in the TIR. * Fix VTA to fit the new IR Pattern
Configuration menu - View commit details
-
Copy full SHA for f7ca70d - Browse repository at this point
Copy the full SHA f7ca70dView commit details -
Configuration menu - View commit details
-
Copy full SHA for a52ab12 - Browse repository at this point
Copy the full SHA a52ab12View commit details -
[REFACTOR][TE] Inline -> te/schedule/operation_inline.h (apache#5386)
Rationale: inline is a transformation used in te to rewrite its internal expressions. It is not a formal IRModule->IRModule transform pass. Also removed the python test as the test is covered by stage.compute_inline.
Configuration menu - View commit details
-
Copy full SHA for 118f943 - Browse repository at this point
Copy the full SHA 118f943View commit details -
[ARITH] Remove the legacy Simplify, migrate to Analyzer. (apache#5385)
The legacy Simplify/CanonicalSimplify are now a thin wrapper around the Analyzer. This PR removes these functions and migrated every place that requires simplification to enforce Analyzer creation. The new API would encourage more Analyzer sharing and potentially enable context-aware analyzer-based simplification.
Configuration menu - View commit details
-
Copy full SHA for 8499d01 - Browse repository at this point
Copy the full SHA 8499d01View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1f7b94a - Browse repository at this point
Copy the full SHA 1f7b94aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0969181 - Browse repository at this point
Copy the full SHA 0969181View commit details -
[Topi, ARM] Disbale Winograd for quantized tensors. (apache#5363)
* [Topi, ARM] Disbale Winograd for quantized tensors. * Relaxing float
Configuration menu - View commit details
-
Copy full SHA for 7d52c1a - Browse repository at this point
Copy the full SHA 7d52c1aView commit details -
Fix test_ir_type. (apache#5390)
* The void return type is not None/nullptr, it's VoidType or TupleType([]).
Configuration menu - View commit details
-
Copy full SHA for c4bebb8 - Browse repository at this point
Copy the full SHA c4bebb8View commit details -
* Fix oversight in importing tf.compat.v1 as tf. * Actually disable test for lstm in TF2.1 Since the testing framework actually uses pytest, the version check needs to be moved.
Configuration menu - View commit details
-
Copy full SHA for 59c867d - Browse repository at this point
Copy the full SHA 59c867dView commit details -
Configuration menu - View commit details
-
Copy full SHA for babdf7e - Browse repository at this point
Copy the full SHA babdf7eView commit details -
[LLVM] Use ArrayRef<int> in calls to CreateShuffleVector (apache#5399)
This switch was made in LLVM 11. Previously this function was expecting mask indices of type uint32_t. This variant is now deprecated.
Configuration menu - View commit details
-
Copy full SHA for f448dac - Browse repository at this point
Copy the full SHA f448dacView commit details -
Configuration menu - View commit details
-
Copy full SHA for a407dd5 - Browse repository at this point
Copy the full SHA a407dd5View commit details -
Factor out import of common tflite.Operator in tflite frontend. (apac…
…he#5355) * Restructure imports in tflite frontend. These python modules are needed for every tflite file parsed. Factorize out imports of the common most ones. Now that the import of operator is common, asserts can be commonized. Loses 473 lines of duplication. * Only restrict to tflite.Operator
Configuration menu - View commit details
-
Copy full SHA for d7ec8e0 - Browse repository at this point
Copy the full SHA d7ec8e0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6dda520 - Browse repository at this point
Copy the full SHA 6dda520View commit details -
Configuration menu - View commit details
-
Copy full SHA for 848a1f5 - Browse repository at this point
Copy the full SHA 848a1f5View commit details -
[TIR] Enhance Substitute, python bindings for Substitute/PostOrderVis…
…it/IRTransform. (apache#5400) Substitute now takes a std::function to customize more replacing behaviors. Co-authored-by: Siyuan Feng <hzfengsy@sjtu.edu.cn> Co-authored-by: Siyuan Feng <hzfengsy@sjtu.edu.cn>
Configuration menu - View commit details
-
Copy full SHA for b9aa740 - Browse repository at this point
Copy the full SHA b9aa740View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6e8e8b1 - Browse repository at this point
Copy the full SHA 6e8e8b1View commit details -
Customize SI prefix in logging (apache#5411)
* Customize SI prefix in logging * Include unit test
Configuration menu - View commit details
-
Copy full SHA for 697327c - Browse repository at this point
Copy the full SHA 697327cView commit details -
[LLVM] Replace calls to Type::getVectorNumElements (apache#5398)
This function has recently been removed from LLVM 11. Use alternative way to obtain vector element count (VectorType::getNumElements) which works for all LLVM versions.
Configuration menu - View commit details
-
Copy full SHA for 4b03597 - Browse repository at this point
Copy the full SHA 4b03597View commit details -
Don't remove() TempDirectory in __del__ after atexit hook runs. (apac…
…he#5414) * Use atexit to remove TempDirectory before interpreter shutdown. * Can't rely on complex functions from __del__ anyway. * Fixes warning message on my box: Exception ignored in: <function TempDirectory.__del__ at 0x12be10680> Traceback (most recent call last): File ".../tvm/python/tvm/contrib/util.py", line 55, in __del__ File ".../tvm/python/tvm/contrib/util.py", line 51, in remove File "/usr/local/opt/python/Frameworks/Python.framework/Versions/3.7/lib/python3.7/shutil.py", line 509, in rmtree AttributeError: 'NoneType' object has no attribute 'path'
Configuration menu - View commit details
-
Copy full SHA for b87f73c - Browse repository at this point
Copy the full SHA b87f73cView commit details -
[TIR][REFACTOR] Remove ir_pass in favor of analysis/transform. (apach…
…e#5415) This PR removes ir_pass(old style pass functions) in favor of analysis/transform(new style pass manager).
Configuration menu - View commit details
-
Copy full SHA for 86fcfe0 - Browse repository at this point
Copy the full SHA 86fcfe0View commit details -
[RUNTIME][CONTRIB] CoreML Runtime (apache#5283)
* [RUNTIME][CONTRIB] CoreML Runtime * fix lint * fix CI * use xcrun to compile coreml model
Configuration menu - View commit details
-
Copy full SHA for 7893d20 - Browse repository at this point
Copy the full SHA 7893d20View commit details -
Configuration menu - View commit details
-
Copy full SHA for 05bfd1c - Browse repository at this point
Copy the full SHA 05bfd1cView commit details -
fix [RUNTIME][VULKAN] vkBuffer released before memory copy command se…
…nd to GPU (apache#5388) (apache#5418)
Configuration menu - View commit details
-
Copy full SHA for e68bb67 - Browse repository at this point
Copy the full SHA e68bb67View commit details -
Configuration menu - View commit details
-
Copy full SHA for 248300b - Browse repository at this point
Copy the full SHA 248300bView commit details -
[cuDNN] Add cuDNN grouped convolutions support (apache#5319)
Signed-off-by: Wei Pan <weip@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for db9e412 - Browse repository at this point
Copy the full SHA db9e412View commit details -
[CI] Migrate Tensorflow and Tensorflow lite in CI to 2.1.0 (apache#5392)
* Migrate Tensorflow and TFLite in the CI up to 1.15.2 The latest stable version of Tensorflow and Tensorflow lite in the 1.x series is 1.15.2. The tflite frontend is receiving support for versions of tflite > 1.14 but there is no consistent testing. There are 2 failures already in the source base with tf 1.15 and I'm concerned this will just get exacerbated over time if we don't have CI picking this up and I view this as a stepping stone towards stepping CI to TF2.x. The test failures that I have commented will get issues raised for them as issues to be fixed. * Comment out run of qnn_mobilenet_v3_net This is another test that fails with TFlite 1.15.2 * Skip the qnn_mobilenet_v3 test in the pytest fashion. * Switch docker versions to support Tensorflow 2.1.0 * Fix up pytest imports and usage. * Skip these tests currently for Tensorflow 2.1.0
Configuration menu - View commit details
-
Copy full SHA for efdd844 - Browse repository at this point
Copy the full SHA efdd844View commit details -
[DOCS] Migrate some markdowns to rst, fix sphinx3 warnings (apache#5416)
* [DOCS] Migrate some markdowns to rst, fix sphinx3 warnings * Add note block
Configuration menu - View commit details
-
Copy full SHA for d5560b4 - Browse repository at this point
Copy the full SHA d5560b4View commit details -
[BYOC] Use Non-Recursive Visitor/Mutator (apache#5410)
* Non-Recursive AnnotatedTarget and MergeAnnotation * Non-Recursive AnnotatedRegionSet and RegionMerger
Configuration menu - View commit details
-
Copy full SHA for af079c1 - Browse repository at this point
Copy the full SHA af079c1View commit details -
[RFC] Pytest environment improvements (apache#5421)
* [RFC] Pass pytest options globally. In many places having a global pytest flag is useful . For me with the build and test of tvm , I would like to be able to globally pass in pytest options as part of development flow or CI flows where one would like to measure other things regularly that need measurements including pytest coverage data that I would like to experiment with across the stack. This has been achieved with an additional setup-pytest-env.sh file in tests/scripts rather than putting in something in every single task test script and something I would like to avoid. This now means the -v option to pytest is superfluous. I did consider having a pytest.ini file but that doesn't allow me to pass any old environment variable in and this seems to be the compromise. * Improve other use case documentation * Rationalize pytest environment. * Remove the setting from docker/with_same_user. * Take the opportunity to migrate common PYTHONPATH and TVM_PATH into the common environment setting. * Fixup vta fsim * Be more explicit with common PYTHONPATH * Fix python path for task_python_vta_fsim.sh properly * Fix nit in documentation.
Configuration menu - View commit details
-
Copy full SHA for 0b0990c - Browse repository at this point
Copy the full SHA 0b0990cView commit details -
Configuration menu - View commit details
-
Copy full SHA for d8f4641 - Browse repository at this point
Copy the full SHA d8f4641View commit details -
Configuration menu - View commit details
-
Copy full SHA for 66c16cf - Browse repository at this point
Copy the full SHA 66c16cfView commit details -
Configuration menu - View commit details
-
Copy full SHA for f0b5a9e - Browse repository at this point
Copy the full SHA f0b5a9eView commit details -
[PY][FFI] Introduce PyNativeObject, enable runtime.String to subclass…
… str (apache#5426) To make runtime.String to work as naturally as possible in the python side, we make it sub-class the python's str object. Note that however, we cannot sub-class Object at the same time due to python's type layout constraint. We introduce a PyNativeObject class to handle this kind of object sub-classing and updated the FFI to handle PyNativeObject classes.
Configuration menu - View commit details
-
Copy full SHA for 9116de1 - Browse repository at this point
Copy the full SHA 9116de1View commit details -
[PYTORCH]where, addcdiv, addcmul op support (apache#5383)
* [PYTORCH]Where, addcdiv, addcmul op support * Review comments fixed
Configuration menu - View commit details
-
Copy full SHA for baf6674 - Browse repository at this point
Copy the full SHA baf6674View commit details -
[FRONTEND][TFLITE]Gather, StridedSlice op support added (apache#4788)
* [FRONTEND][TFLITE]Gather, StridedSlice op added * Review comments fixed
Configuration menu - View commit details
-
Copy full SHA for 155601b - Browse repository at this point
Copy the full SHA 155601bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3b5c577 - Browse repository at this point
Copy the full SHA 3b5c577View commit details -
Corrected TVM autotuning on GPU (apache#5432)
Added missing "tir" in tvm.tir.analysis.verify_gpu_code(f, kwargs)
Configuration menu - View commit details
-
Copy full SHA for 89a6237 - Browse repository at this point
Copy the full SHA 89a6237View commit details -
[RUNTIME][OBJECT] Introduce static slots for common objects. (apache#…
…5423) The _type_child_slots can be used to enable quick type checking optimization by checking the whether the type index is within the bound. This PR enables these static slots: - Introduce a static assert to avoid the scenario when a developer forget to _type_child_slots when the field is set for the type's parent. - Revamp and assign static type index to common runtime objects - Add a DumpTypeTable call to allow developer monitor the current situation of type table and offers suggestions for the slots(ideally the slots equals the number of children so there is no overflow.
Configuration menu - View commit details
-
Copy full SHA for 6361483 - Browse repository at this point
Copy the full SHA 6361483View commit details -
[RELAY][PYTORCH]cosh,sinh,log2,log10,log1p op support (apache#5395)
* [RELAY][PYTORCH]cosh,sinh,log2,log10,log1p op support * Review comment fixed * Gradient testcase added
Configuration menu - View commit details
-
Copy full SHA for ca93121 - Browse repository at this point
Copy the full SHA ca93121View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1bebb6e - Browse repository at this point
Copy the full SHA 1bebb6eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1d331f4 - Browse repository at this point
Copy the full SHA 1d331f4View commit details -
Add TopK to ONNX Frontend (apache#5441)
* Add TopK to ONNX Frontend * respond to review comments
Configuration menu - View commit details
-
Copy full SHA for 2c9da4d - Browse repository at this point
Copy the full SHA 2c9da4dView commit details -
[CodeGen] Cleanup generated code (apache#5424)
- remove unnecessary white spaces from storage kind - do not start a new scope for vectorization as temporary variables are alll uniquely generated. The above two changes make vectorized code much cleaner. Signed-off-by: Wei Pan <weip@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for ae9a581 - Browse repository at this point
Copy the full SHA ae9a581View commit details -
[RELAY] Move frontend utils (apache#5345)
* [RELAY] Move frontend utils The util file currently under frontend is used from outside of frontend (in qnn/op/legalizations). This suggests that the file should be pushed up to a higher level. The benefit from this change is that importing qnn no longer also imports all the frontends. * Inline get_scalar_from_constant Change-Id: I1cc64e9ecb0eadb6ac0f7b62e6ea174644af4ad4 * Remove util.py from Relay Change-Id: If9cd7cf3fc0bd1861a3a9b5604f338e084d8db96 * Shorten functions Change-Id: Ieb537d82e6ee52421ff05a90cd00a03679ffebf2 * Line length Change-Id: I1d216b7e73a060c4f118f5da50ce58b18eba907f
Configuration menu - View commit details
-
Copy full SHA for 4c06e2e - Browse repository at this point
Copy the full SHA 4c06e2eView commit details -
Configuration menu - View commit details
-
Copy full SHA for d186475 - Browse repository at this point
Copy the full SHA d186475View commit details -
Configuration menu - View commit details
-
Copy full SHA for df52be0 - Browse repository at this point
Copy the full SHA df52be0View commit details -
Configuration menu - View commit details
-
Copy full SHA for fa42562 - Browse repository at this point
Copy the full SHA fa42562View commit details -
Configuration menu - View commit details
-
Copy full SHA for 339c8ff - Browse repository at this point
Copy the full SHA 339c8ffView commit details -
[relay][topi] Add operation relay.nn.dilate() which calls topi.nn.dil…
…ate() (apache#5331) * Add operation relay.nn.dilate() which calls topi.nn.dilate(). * Fix typo * Set op pattern to injective
Configuration menu - View commit details
-
Copy full SHA for 4ecb171 - Browse repository at this point
Copy the full SHA 4ecb171View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7717425 - Browse repository at this point
Copy the full SHA 7717425View commit details -
[TOPI,RELAY][TFLITE] Sparse to dense operator
Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 2458cc3 - Browse repository at this point
Copy the full SHA 2458cc3View commit details -
use param name in documentation
Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 69e9bd7 - Browse repository at this point
Copy the full SHA 69e9bd7View commit details -
Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for aa0f81b - Browse repository at this point
Copy the full SHA aa0f81bView commit details -
Configuration menu - View commit details
-
Copy full SHA for db3d04c - Browse repository at this point
Copy the full SHA db3d04cView commit details -
incorporated code review comments
Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 4e42714 - Browse repository at this point
Copy the full SHA 4e42714View commit details -
Configuration menu - View commit details
-
Copy full SHA for aaf46d4 - Browse repository at this point
Copy the full SHA aaf46d4View commit details