Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

S2d #2

Merged
merged 93 commits into from
Apr 28, 2020
Merged

S2d #2

merged 93 commits into from
Apr 28, 2020

Commits on Apr 15, 2020

  1. [Relay][Frontend][TFLite] Add parser support for shape and range

    Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>
    dhruvaray committed Apr 15, 2020
    Configuration menu
    Copy the full SHA
    1077352 View commit details
    Browse the repository at this point in the history

Commits on Apr 28, 2020

  1. [RELAY][PYTORCH]isNan, isinf, isfinite, ceil, clamp, round ops (apach…

    …e#5316)
    
    * [RELAY][PYTORCH]isNan, isinf, isfinite, ceil, clamp, round ops
    
    * Review comments
    siju-samuel authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    28fcb2d View commit details
    Browse the repository at this point in the history
  2. [TIR] Refactor MakePackedAPI to target dependent stage. (apache#5326)

    Previously MakePackedAPI was in the target independent stage,
    but never the less requires the device_type information that will be
    binded at a later target dependent stage.
    
    The previous implementation was due to the limitation of LoweredFunc
    which can not carry buffer_map info(so they have to be lowered right away).
    This is no longer the case after the unified IR refactor.
    
    This PR migrates MakePackedAPI to a target dependent stage
    and removes the un-necessary BindDevice pass.
    tqchen authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    16d3da1 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    2b6e845 View commit details
    Browse the repository at this point in the history
  4. [LLVM] Use llvm::FunctionCallee in IRBuilder::CreateCall with LLVM 11+ (

    apache#5338)
    
    The older variants of CreateCall have been deprecated and were recently
    removed from LLVM. This caused compilation failures.
    Krzysztof Parzyszek authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    747a4a8 View commit details
    Browse the repository at this point in the history
  5. [CI] Fix build.sh to propagate --network=host to the docker build com…

    …mand (apache#5336)
    
    * when passing --net=host to build.sh it needs to be also
       sent as --network=host to "docker build", so that both
       build and run will use the same network configuration
    leandron authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    d7c977c View commit details
    Browse the repository at this point in the history
  6. [Runtime][Relay][Cleanup] Clean up for memory pass to enable heteroge…

    …nous execution support. (apache#5324)
    
    * Cleanup type pack and unpack for tuples.
    
    * Clean up the memory_pass using common helpers
    
    * Clean up memory.cc
    
    * Refactor pass
    
    * Add doc strings
    
    * Fix CPPlint
    
    * Fix PyLint
    
    * Fix
    
    * Apply suggestions from code review
    
    Co-Authored-By: Zhi <5145158+zhiics@users.noreply.github.com>
    
    * Fix typo
    
    Co-authored-by: Zhi <5145158+zhiics@users.noreply.github.com>
    2 people authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    ab5afbc View commit details
    Browse the repository at this point in the history
  7. Windows Support for cpp_rpc (apache#4857)

    * Windows Support for cpp_rpc
    
    * Add missing patches that fix crashes under Windows
    
    * On Windows, use python to untar vs wsl
    
    * remove some CMakeLists.txt stuff
    
    * more minor CMakeLists.txt changes
    
    * Remove items from CMakeLists.txt
    
    * Minor CMakeLists.txt changes
    
    * More minor CMakeLists.txt changes
    
    * Even more minor CMakeLists.txt changes
    
    * Modify readme
    jmorrill authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    c8e933e View commit details
    Browse the repository at this point in the history
  8. [PYTORCH]Take, Topk op support (apache#5332)

    * [PYTORCH]take, topk op support
    
    * Ci Failure fix
    siju-samuel authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    19ce0a9 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    8a42257 View commit details
    Browse the repository at this point in the history
  10. [TOPI] Improve get_valid_count and nms performance for CUDA (apache#5339

    )
    
    * get_valid_count updated to have correct results
    
    * speedup nms
    
    * update nms
    
    * revert back nms
    
    * recover one test for get_valid_count
    Laurawly authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    23a5e8e View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    e1a1f55 View commit details
    Browse the repository at this point in the history
  12. [TIR] Remove ProducerConsumer and AllocateNode::new_expr (apache#5333)

    * [TIR] Remove ProducerConsumer and AllocateNode::new_expr
    
    This PR removes two legacy IR parts in TIR that are deprecated.
    
    ProducerConsumer node only serves as a hint markup and may no longer be
    informative after extensive transformations in the pass.
    If necessary, we can add related info via AttrStmt.
    
    The new_expr field in the AllocateNode is deprecated since it can just be
    replaced by a LetStmt.
    
    - Remove dependencies of passes on ProducerConsumer.
    - Remove ProducerConsumer from the IR.
    - Remove the deprecated fields (new_expr, free_function) from AllocateNode.
    
    * Fix additional testcases
    tqchen authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    d3d155c View commit details
    Browse the repository at this point in the history
  13. [BYOC] Prevent duplicate outputs in subgraph Tuple (apache#5320)

    * Fix duplicate output in partitiongraph
    
    * Add test case
    
    * Fix test_annotated_regions with duplicate compiler_end outputs
    
    * Revert "Fix duplicate output in partitiongraph"
    
    This reverts commit e1f8ef3.
    
    * Prevent duplicate outputs in Tuple in PartitionGraph
    
    * Fix lint
    
    * Add another test case for when regions are merged, and when TupleGetItem was duplicated
    
    * Pull GetFunctionOutput out of branch, improve description of GetFunctionOutput
    
    * Use std::move for GetFunctionOutput. Fix typo with testcase name
    
    * Use tvm.transform.Sequential
    Trevor Morris authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    ecdb00c View commit details
    Browse the repository at this point in the history
  14. [Tutorial, QNN] Add tutorial for loading quantized PyTorch model (apa…

    …che#5321)
    
    * add pytorch tutorial code and doc stub
    
    * add more docs
    
    * formatting, more docs
    
    * typo fix
    
    * try make sphinx happy
    
    * add performance section
    
    * type and nit fix
    
    * format fix
    masahi authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    8639224 View commit details
    Browse the repository at this point in the history
  15. [DOCS] Bring relay docs to the top-level flat view (apache#5343)

    - Changes most of the relay docs to use autosummary.
    - Bring relay API docs to the top-level flat view for easier discovery
    - Removed a few cases of re-exports.
    tqchen authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    e9ae136 View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    17b4961 View commit details
    Browse the repository at this point in the history
  17. [RELAY][BYOC] Register pattern tables from external codegens (apache#…

    …5262)
    
    * [RELAY][BYOC] Register pattern tables from external codegens
    
    This adds utility functions to support registering
    and retrieving pattern tables used by MergeComposite for
    external codegens.
    
    Change-Id: I5be165a321440e48b15ff6aff4970e0c67496aaa
    
    * Updated DNNL tests to use pattern table mechanism
    
    * Removed pattern table standalone test
    
    * Change reg to _op
    mbaret authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    7448081 View commit details
    Browse the repository at this point in the history
  18. [RUNTIME][CRT] support DLTensor whose ndim == 0 (apache#5344)

    Signed-off-by: windclarion <windclarion@gmail.com>
    windclarion authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    b9aa07f View commit details
    Browse the repository at this point in the history
  19. [BYOC][FIX] Fix typo in "default" (apache#5348)

    Default annotations were incorrectly being named 'defualt'
    which results in them not being removed in PartitionGraph.
    mbaret authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    38819e5 View commit details
    Browse the repository at this point in the history
  20. Configuration menu
    Copy the full SHA
    b01fb67 View commit details
    Browse the repository at this point in the history
  21. Configuration menu
    Copy the full SHA
    5e3222f View commit details
    Browse the repository at this point in the history
  22. Configuration menu
    Copy the full SHA
    23e3e9e View commit details
    Browse the repository at this point in the history
  23. [Hexagon] Add hexagon_posix.cc to TVM/RT sources in the right place (a…

    …pache#5346)
    
    This file was added before the variable with TVM/RT was initialized.
    The initialization overwrote the addition.
    Krzysztof Parzyszek authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    64db78d View commit details
    Browse the repository at this point in the history
  24. [TOPI-ARM] Do not alter layout if layout is NHWC (apache#5350)

    * [TOPI-ARM] Do not alter layout if layout is NHWC
    
    * Add test.
    anijain2305 authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    9568e0b View commit details
    Browse the repository at this point in the history
  25. [TIR] Make lower_warp_memory support extent(threadIdx.x) < warp_size (a…

    …pache#5307)
    
    * support extent(threadIdx.x) < warp_size in lower_warp_memory
    
    * more docs for lower_warp_memory
    roastduck authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    dbfd277 View commit details
    Browse the repository at this point in the history
  26. Configuration menu
    Copy the full SHA
    453da00 View commit details
    Browse the repository at this point in the history
  27. docker: Drop caffe2 download progess bars (apache#5359)

    Change-Id: Ia15c3c8f41f75423814e559f6fdb062098f19464
    mshawcroft authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    7f995ce View commit details
    Browse the repository at this point in the history
  28. Configuration menu
    Copy the full SHA
    29da9ec View commit details
    Browse the repository at this point in the history
  29. [RUNTIME] FastRPC interface for Hexagon runtime (apache#5353)

    * [RUNTIME] FastRPC interface for Hexagon runtime
    
    Co-authored-by: Ravishankar Kolachana <quic_rkolacha@quicinc.com>
    Co-authored-by: Krzysztof Parzyszek <kparzysz@quicinc.com>
    
    * Explain store offset in a comment in launcher
    
    Co-authored-by: Abhikrant Sharma <quic_abhikran@quicinc.com>
    Co-authored-by: Ravishankar Kolachana <quic_rkolacha@quicinc.com>
    3 people authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    be0c661 View commit details
    Browse the repository at this point in the history
  30. [TIR][REFACTOR] Migrate low-level passes in tvm.lower to the Unified …

    …IR pass manager. (apache#5364)
    
    - Migrate BoundCheckers and Simplify
    - Migrate RewriteUnsafeSelect and RemoveNoOp
    - Migrate UnrollLoop and StorageRewrite
    - Migrate InjectDoubleBuffer and InjectVirtualThread
    - Migrate LoopPartition and Vectorize
    - Migrate CoProcSync, LiftAttrScope, InjectCopyIntrin
    
    We still keep ir_pass registerations for now.
    Need a separate PR to refactor the parts before the StorageFlatten.
    tqchen authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    9707ae5 View commit details
    Browse the repository at this point in the history
  31. [TIR] Fix lower_warp_memory when there are >1 warp buffers (apache#5368)

    * fix recursion in lower_warp_memory
    
    * post-order mutation
    roastduck authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    458814b View commit details
    Browse the repository at this point in the history
  32. Configuration menu
    Copy the full SHA
    eae387b View commit details
    Browse the repository at this point in the history
  33. Configuration menu
    Copy the full SHA
    9248e15 View commit details
    Browse the repository at this point in the history
  34. [TIR][REFACTOR] Remove te::Tensor dependencies from TIR passes. (apac…

    …he#5372)
    
    * [TIR][REFACTOR] Remove te::Tensor dependencies from TIR passes.
    
    te::Tensor is an useful object for tensor expression, but brings
    un-necessary reverse dependency in TIR nodes such as Provide and Realize.
    
    This PR is a first step to remove this dependency. We will use Buffer in all the places
    where the te::Tensor was used. The rough correspondence are:
    
    - Provide -> BufferStore
    - Realize -> BufferRealize
    - HalideCall -> BufferLoad.
    
    After this change, we can not use IRModule of PrimFuncs cleanly to represent TIR
    at any point of the optimizations. Buffer will serve as the abstraction for the TIR data
    models to represent the intermediate storages and their constraints.
    
    We still keep Realize/HalideCall and Provide as TIR nodes for now to make the change minimum.
    Right after ScheduleOps, we call SchedulePostProcToPrimFunc to canonicalize the temporary IR
    generated by TE(which contains these nodes) to the TIR.
    
    The TIR optimizations are now mostly migrated to to the pass manager.
    Followup PRs are needed to migrate the remaining few passes.
    
    * Fix dev tutorial
    tqchen authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    f635fd5 View commit details
    Browse the repository at this point in the history
  35. [PYTORCH]Unary Ops (apache#5378)

    siju-samuel authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    b5925fe View commit details
    Browse the repository at this point in the history
  36. [TIR][REFACTOR] RewriteForTensorCore -> te/schedule (apache#5379)

    * [TIR][REFACTIR] RewriteForTensorCore -> te/schedule
    
    RewriteForTensor depends on the schedule information, which makes it differ
    from a typical pass(which should get all the information from the input TIR).
    
    As a result, we refactor it as a SchedulePostProc step for now.
    We should revisit it later as we introduce more support for tensor core patterns in the TIR.
    
    * Fix VTA to fit the new IR Pattern
    tqchen authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    f7ca70d View commit details
    Browse the repository at this point in the history
  37. Configuration menu
    Copy the full SHA
    a52ab12 View commit details
    Browse the repository at this point in the history
  38. [REFACTOR][TE] Inline -> te/schedule/operation_inline.h (apache#5386)

    Rationale: inline is a transformation used in te to
    rewrite its internal expressions. It is not a formal IRModule->IRModule transform pass.
    
    Also removed the python test as the test is covered by stage.compute_inline.
    tqchen authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    118f943 View commit details
    Browse the repository at this point in the history
  39. [ARITH] Remove the legacy Simplify, migrate to Analyzer. (apache#5385)

    The legacy Simplify/CanonicalSimplify are now a thin wrapper around the Analyzer.
    This PR removes these functions and migrated every place that requires
    simplification to enforce Analyzer creation.
    The new API would encourage more Analyzer sharing and potentially enable
    context-aware analyzer-based simplification.
    tqchen authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    8499d01 View commit details
    Browse the repository at this point in the history
  40. Configuration menu
    Copy the full SHA
    1f7b94a View commit details
    Browse the repository at this point in the history
  41. Configuration menu
    Copy the full SHA
    0969181 View commit details
    Browse the repository at this point in the history
  42. [Topi, ARM] Disbale Winograd for quantized tensors. (apache#5363)

    * [Topi, ARM] Disbale Winograd for quantized tensors.
    
    * Relaxing float
    anijain2305 authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    7d52c1a View commit details
    Browse the repository at this point in the history
  43. Fix test_ir_type. (apache#5390)

    * The void return type is not None/nullptr, it's VoidType or
       TupleType([]).
    areusch authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    c4bebb8 View commit details
    Browse the repository at this point in the history
  44. Tf2 test fixups (apache#5391)

    * Fix oversight in importing tf.compat.v1 as tf.
    
    * Actually disable test for lstm in TF2.1
    
    Since the testing framework actually uses pytest, the version
    check needs to be moved.
    Ramana Radhakrishnan authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    59c867d View commit details
    Browse the repository at this point in the history
  45. Configuration menu
    Copy the full SHA
    babdf7e View commit details
    Browse the repository at this point in the history
  46. [LLVM] Use ArrayRef<int> in calls to CreateShuffleVector (apache#5399)

    This switch was made in LLVM 11. Previously this function was expecting
    mask indices of type uint32_t. This variant is now deprecated.
    Krzysztof Parzyszek authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    f448dac View commit details
    Browse the repository at this point in the history
  47. Configuration menu
    Copy the full SHA
    a407dd5 View commit details
    Browse the repository at this point in the history
  48. Factor out import of common tflite.Operator in tflite frontend. (apac…

    …he#5355)
    
    * Restructure imports in tflite frontend.
    
    These python modules are needed for every tflite file parsed.
    Factorize out imports of the common most ones.
    
    Now that the import of operator is common, asserts can be commonized.
    
    Loses 473 lines of duplication.
    
    * Only restrict to tflite.Operator
    Ramana Radhakrishnan authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    d7ec8e0 View commit details
    Browse the repository at this point in the history
  49. Configuration menu
    Copy the full SHA
    6dda520 View commit details
    Browse the repository at this point in the history
  50. Update dmlc-core to latest (apache#5401)

    tqchen authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    848a1f5 View commit details
    Browse the repository at this point in the history
  51. [TIR] Enhance Substitute, python bindings for Substitute/PostOrderVis…

    …it/IRTransform. (apache#5400)
    
    Substitute now takes a std::function to customize more replacing behaviors.
    
    Co-authored-by: Siyuan Feng <hzfengsy@sjtu.edu.cn>
    
    Co-authored-by: Siyuan Feng <hzfengsy@sjtu.edu.cn>
    2 people authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    b9aa740 View commit details
    Browse the repository at this point in the history
  52. Configuration menu
    Copy the full SHA
    6e8e8b1 View commit details
    Browse the repository at this point in the history
  53. Customize SI prefix in logging (apache#5411)

    * Customize SI prefix in logging
    
    * Include unit test
    areusch authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    697327c View commit details
    Browse the repository at this point in the history
  54. [LLVM] Replace calls to Type::getVectorNumElements (apache#5398)

    This function has recently been removed from LLVM 11. Use alternative
    way to obtain vector element count (VectorType::getNumElements) which
    works for all LLVM versions.
    Krzysztof Parzyszek authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    4b03597 View commit details
    Browse the repository at this point in the history
  55. Don't remove() TempDirectory in __del__ after atexit hook runs. (apac…

    …he#5414)
    
    * Use atexit to remove TempDirectory before interpreter shutdown.
     * Can't rely on complex functions from __del__ anyway.
     * Fixes warning message on my box:
           Exception ignored in: <function TempDirectory.__del__ at 0x12be10680>
           Traceback (most recent call last):
            File ".../tvm/python/tvm/contrib/util.py", line 55, in __del__
            File ".../tvm/python/tvm/contrib/util.py", line 51, in remove
            File "/usr/local/opt/python/Frameworks/Python.framework/Versions/3.7/lib/python3.7/shutil.py", line 509, in rmtree
            AttributeError: 'NoneType' object has no attribute 'path'
    areusch authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    b87f73c View commit details
    Browse the repository at this point in the history
  56. [TIR][REFACTOR] Remove ir_pass in favor of analysis/transform. (apach…

    …e#5415)
    
    This PR removes ir_pass(old style pass functions) in favor
    of analysis/transform(new style pass manager).
    tqchen authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    86fcfe0 View commit details
    Browse the repository at this point in the history
  57. [RUNTIME][CONTRIB] CoreML Runtime (apache#5283)

    * [RUNTIME][CONTRIB] CoreML Runtime
    
    * fix lint
    
    * fix CI
    
    * use xcrun to compile coreml model
    kazum authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    7893d20 View commit details
    Browse the repository at this point in the history
  58. Configuration menu
    Copy the full SHA
    05bfd1c View commit details
    Browse the repository at this point in the history
  59. Configuration menu
    Copy the full SHA
    e68bb67 View commit details
    Browse the repository at this point in the history
  60. Configuration menu
    Copy the full SHA
    248300b View commit details
    Browse the repository at this point in the history
  61. [cuDNN] Add cuDNN grouped convolutions support (apache#5319)

    Signed-off-by: Wei Pan <weip@nvidia.com>
    wpan11nv authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    db9e412 View commit details
    Browse the repository at this point in the history
  62. [CI] Migrate Tensorflow and Tensorflow lite in CI to 2.1.0 (apache#5392)

    * Migrate Tensorflow and TFLite in the CI up to 1.15.2
    
    The latest stable version of Tensorflow and Tensorflow lite
    in the 1.x series is 1.15.2. The tflite frontend is receiving
    support for versions of tflite > 1.14 but there is no consistent
    testing.
    
    There are 2 failures already in the source base with tf 1.15
    and I'm concerned this will just get exacerbated over time
    if we don't have CI picking this up and I view this as a stepping
    stone towards stepping CI to TF2.x.
    
    The test failures that I have commented will get issues raised
    for them as issues to be fixed.
    
    * Comment out run of qnn_mobilenet_v3_net
    
    This is another test that fails with TFlite 1.15.2
    
    * Skip the qnn_mobilenet_v3 test in the pytest fashion.
    
    * Switch docker versions to support Tensorflow 2.1.0
    
    * Fix up pytest imports and usage.
    
    * Skip these tests currently for Tensorflow 2.1.0
    Ramana Radhakrishnan authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    efdd844 View commit details
    Browse the repository at this point in the history
  63. [DOCS] Migrate some markdowns to rst, fix sphinx3 warnings (apache#5416)

    * [DOCS] Migrate some markdowns to rst, fix sphinx3 warnings
    
    * Add note block
    tqchen authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    d5560b4 View commit details
    Browse the repository at this point in the history
  64. [BYOC] Use Non-Recursive Visitor/Mutator (apache#5410)

    * Non-Recursive AnnotatedTarget and MergeAnnotation
    
    * Non-Recursive AnnotatedRegionSet and RegionMerger
    comaniac authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    af079c1 View commit details
    Browse the repository at this point in the history
  65. [RFC] Pytest environment improvements (apache#5421)

    * [RFC] Pass pytest options globally.
    
    In many places having a global pytest flag is useful . For me with the
    build and test of tvm , I would like to be able to globally pass in
    pytest options as part of development flow or CI flows where one would
    like to measure other things regularly that need measurements including
    pytest coverage data that I would like to experiment with across the stack.
    
    This has been achieved with an additional setup-pytest-env.sh file in
    tests/scripts rather than putting in something in every single task test
    script and something I would like to avoid.
    
    This now means the -v option to pytest is superfluous. I did consider
    having a pytest.ini file but that doesn't allow me to pass any old
    environment variable in and this seems to be the compromise.
    
    * Improve other use case documentation
    
    * Rationalize pytest environment.
    
    * Remove the setting from docker/with_same_user.
    * Take the opportunity to migrate common PYTHONPATH and
    TVM_PATH into the common environment setting.
    
    * Fixup vta fsim
    
    * Be more explicit with common PYTHONPATH
    
    * Fix python path for task_python_vta_fsim.sh properly
    
    * Fix nit in documentation.
    Ramana Radhakrishnan authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    0b0990c View commit details
    Browse the repository at this point in the history
  66. Configuration menu
    Copy the full SHA
    d8f4641 View commit details
    Browse the repository at this point in the history
  67. Configuration menu
    Copy the full SHA
    66c16cf View commit details
    Browse the repository at this point in the history
  68. Configuration menu
    Copy the full SHA
    f0b5a9e View commit details
    Browse the repository at this point in the history
  69. [PY][FFI] Introduce PyNativeObject, enable runtime.String to subclass…

    … str (apache#5426)
    
    To make runtime.String to work as naturally as possible in the python side,
    we make it sub-class the python's str object. Note that however, we cannot
    sub-class Object at the same time due to python's type layout constraint.
    
    We introduce a PyNativeObject class to handle this kind of object sub-classing
    and updated the FFI to handle PyNativeObject classes.
    tqchen authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    9116de1 View commit details
    Browse the repository at this point in the history
  70. [PYTORCH]where, addcdiv, addcmul op support (apache#5383)

    * [PYTORCH]Where, addcdiv, addcmul op support
    
    * Review comments fixed
    siju-samuel authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    baf6674 View commit details
    Browse the repository at this point in the history
  71. [FRONTEND][TFLITE]Gather, StridedSlice op support added (apache#4788)

    * [FRONTEND][TFLITE]Gather, StridedSlice op added
    
    * Review comments fixed
    siju-samuel authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    155601b View commit details
    Browse the repository at this point in the history
  72. Configuration menu
    Copy the full SHA
    3b5c577 View commit details
    Browse the repository at this point in the history
  73. Corrected TVM autotuning on GPU (apache#5432)

    Added missing "tir" in tvm.tir.analysis.verify_gpu_code(f, kwargs)
    JishinMaster authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    89a6237 View commit details
    Browse the repository at this point in the history
  74. [RUNTIME][OBJECT] Introduce static slots for common objects. (apache#…

    …5423)
    
    The _type_child_slots can be used to enable quick type checking optimization
    by checking the whether the type index is within the bound.
    
    This PR enables these static slots:
    
    - Introduce a static assert to avoid the scenario when a developer forget to
      _type_child_slots when the field is set for the type's parent.
    - Revamp and assign static type index to common runtime objects
    - Add a DumpTypeTable call to allow developer monitor the current situation
      of type table and offers suggestions for the slots(ideally the slots equals
      the number of children so there is no overflow.
    tqchen authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    6361483 View commit details
    Browse the repository at this point in the history
  75. [RELAY][PYTORCH]cosh,sinh,log2,log10,log1p op support (apache#5395)

    * [RELAY][PYTORCH]cosh,sinh,log2,log10,log1p op support
    
    * Review comment fixed
    
    * Gradient testcase added
    siju-samuel authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    ca93121 View commit details
    Browse the repository at this point in the history
  76. Configuration menu
    Copy the full SHA
    1bebb6e View commit details
    Browse the repository at this point in the history
  77. fix miopen pad (apache#5433)

    t-vi authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    1d331f4 View commit details
    Browse the repository at this point in the history
  78. Add TopK to ONNX Frontend (apache#5441)

    * Add TopK to ONNX Frontend
    
    * respond to review comments
    mbrookhart authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    2c9da4d View commit details
    Browse the repository at this point in the history
  79. [CodeGen] Cleanup generated code (apache#5424)

    - remove unnecessary white spaces from storage kind
    - do not start a new scope for vectorization as temporary
      variables are alll uniquely generated.
    
    The above two changes make vectorized code much cleaner.
    
    Signed-off-by: Wei Pan <weip@nvidia.com>
    wpan11nv authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    ae9a581 View commit details
    Browse the repository at this point in the history
  80. [RELAY] Move frontend utils (apache#5345)

    * [RELAY] Move frontend utils
    
    The util file currently under frontend is used from
    outside of frontend (in qnn/op/legalizations). This suggests
    that the file should be pushed up to a higher level.
    
    The benefit from this change is that importing qnn no longer
    also imports all the frontends.
    
    * Inline get_scalar_from_constant
    
    Change-Id: I1cc64e9ecb0eadb6ac0f7b62e6ea174644af4ad4
    
    * Remove util.py from Relay
    
    Change-Id: If9cd7cf3fc0bd1861a3a9b5604f338e084d8db96
    
    * Shorten functions
    
    Change-Id: Ieb537d82e6ee52421ff05a90cd00a03679ffebf2
    
    * Line length
    
    Change-Id: I1d216b7e73a060c4f118f5da50ce58b18eba907f
    mbaret authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    4c06e2e View commit details
    Browse the repository at this point in the history
  81. Configuration menu
    Copy the full SHA
    d186475 View commit details
    Browse the repository at this point in the history
  82. Configuration menu
    Copy the full SHA
    df52be0 View commit details
    Browse the repository at this point in the history
  83. Configuration menu
    Copy the full SHA
    fa42562 View commit details
    Browse the repository at this point in the history
  84. Configuration menu
    Copy the full SHA
    339c8ff View commit details
    Browse the repository at this point in the history
  85. [relay][topi] Add operation relay.nn.dilate() which calls topi.nn.dil…

    …ate() (apache#5331)
    
    * Add operation relay.nn.dilate() which calls topi.nn.dilate().
    
    * Fix typo
    
    * Set op pattern to injective
    notoraptor authored and dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    4ecb171 View commit details
    Browse the repository at this point in the history
  86. Configuration menu
    Copy the full SHA
    7717425 View commit details
    Browse the repository at this point in the history
  87. [TOPI,RELAY][TFLITE] Sparse to dense operator

    Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>
    dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    2458cc3 View commit details
    Browse the repository at this point in the history
  88. use param name in documentation

    Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>
    dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    69e9bd7 View commit details
    Browse the repository at this point in the history
  89. sphinx doc errors fixed

    Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>
    dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    aa0f81b View commit details
    Browse the repository at this point in the history
  90. Configuration menu
    Copy the full SHA
    db3d04c View commit details
    Browse the repository at this point in the history
  91. incorporated code review comments

    Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>
    dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    4e42714 View commit details
    Browse the repository at this point in the history
  92. Fixed indentation

    Signed-off-by: Dhruva Ray <dhruvaray@gmail.com>
    dhruvaray committed Apr 28, 2020
    Configuration menu
    Copy the full SHA
    aaf46d4 View commit details
    Browse the repository at this point in the history