Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update #5

Merged
merged 117 commits into from
May 25, 2021
Merged

update #5

merged 117 commits into from
May 25, 2021

Commits on May 7, 2021

  1. Configuration menu
    Copy the full SHA
    db5eac2 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    7468253 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    b2160e7 View commit details
    Browse the repository at this point in the history
  4. fix distro (#32771)

    tianshuo78520a authored May 7, 2021
    Configuration menu
    Copy the full SHA
    816afb9 View commit details
    Browse the repository at this point in the history
  5. Fix compile error on jetson platform (#32748)

    * fix compile error on jetson platform
    LielinJiang authored May 7, 2021
    Configuration menu
    Copy the full SHA
    8ce6b39 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    3753416 View commit details
    Browse the repository at this point in the history
  7. remove packages in __all__ (#32759)

    * [OPs] Bug fix, fix the segment mean for illegal syncthreads usage. (#32596) (#32610)
    
    * [OPs] Bug fix, fix the segment mean for illegal syncthreads usage.
    
    * remove packages in __all__
    
    * create new public api level paddle.callbacks;paddle.hub;paddle.utils.unique_name
    
    Co-authored-by: Zhong Hui <zhonghui.net@gmail.com>
    zhiboniu and ZHUI authored May 7, 2021
    Configuration menu
    Copy the full SHA
    a77ade0 View commit details
    Browse the repository at this point in the history

Commits on May 8, 2021

  1. Add raw program meta optimizer (#32597)

    * add raw program, test=develop
    lilong12 authored May 8, 2021
    Configuration menu
    Copy the full SHA
    c1c18b0 View commit details
    Browse the repository at this point in the history
  2. [Paddle-TRT]fix trt-converter-fc_op (#32671)

    * [Paddle-TRT]fix fc_op
    
    * [Paddle-TRT]fix fc_op
    
    * [Paddle-TRT]fix fc_op
    
    * test_trt_subgraph_pass.py
    
    * fix elementwise_op
    
    * fix elementwise_op
    
    * fix elementwise_op
    
    * fix elementwise_op.cc
    
    * op_teller.cc
    Wangzheee authored May 8, 2021
    Configuration menu
    Copy the full SHA
    62d848d View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    e8e4a9c View commit details
    Browse the repository at this point in the history
  4. Remove np Deprecation Warning since np.bool is alias of bool (#32798

    )
    
    Remove np Deprecation Warning since `np.bool` is alias of `bool`
    
    The warning report from test:
    
    ```
    2021-04-30 15:29:32 /workspace/Paddle/build/python/paddle/fluid/framework.py:689: DeprecationWarning: `np.bool` is a deprecated alias for the builtin `bool`. To silence this warning, use `bool` by itself. Doing this will not modify any behavior and is safe. If you specifically wanted the numpy scalar type, use `np.bool_` here.
    2021-04-30 15:29:32 Deprecated in NumPy 1.20; for more details and guidance: https://numpy.org/devdocs/release/1.20.0-notes.html#deprecations
    2021-04-30 15:29:32   elif dtype == np.bool:
    2021-04-30 15:29:32 /workspace/Paddle/build/python/paddle/fluid/layers/utils.py:77: DeprecationWarning: Using or importing the ABCs from 'collections' instead of from 'collections.abc' is deprecated, and in 3.8 it will stop working
    2021-04-30 15:29:32   return (isinstance(seq, collections.Sequence) and
    2021-04-30 15:29:32 /workspace/Paddle/build/python/paddle/fluid/tests/unittests/test_cond.py:99: DeprecationWarning: `np.bool` is a deprecated alias for the builtin `bool`. To silence this warning, use `bool` by itself. Doing this will not modify any behavior and is safe. If you specifically wanted the numpy scalar type, use `np.bool_` here.
    ```
    zhhsplendid authored May 8, 2021
    Configuration menu
    Copy the full SHA
    8a42b1f View commit details
    Browse the repository at this point in the history
  5. [NPU] refine update_loss_scaling npu kernel (#32580)

    * refine update_loss_scaling npu kernel
    
    * add mutable_data
    
    * change Zerolike op to MemcpyAsync
    
    * delete useless code
    
    * add found_inf_vec
    
    * add memcpy if not finite
    
    * fix unittest
    pangyoki authored May 8, 2021
    Configuration menu
    Copy the full SHA
    4628b6f View commit details
    Browse the repository at this point in the history
  6. add c_identity op npu (#32787)

    * add c_identity_op_npu
    Baibaifan authored May 8, 2021
    Configuration menu
    Copy the full SHA
    c8affff View commit details
    Browse the repository at this point in the history
  7. 【heterps】support cuda11 for heterps; add profiler in oneps (#32640)

    * add trainprofiler for heterps in oneps; test=develop
    
    * add set_use_ps_gpu; test=develop
    danleifeng authored May 8, 2021
    Configuration menu
    Copy the full SHA
    beab956 View commit details
    Browse the repository at this point in the history

Commits on May 10, 2021

  1. Configuration menu
    Copy the full SHA
    23ab01e View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    92adece View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    fd9a236 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    5fc734c View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    e357cfd View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    5aa8faa View commit details
    Browse the repository at this point in the history
  7. [pslib] pslib with cmake (#32800)

    * pslib with cmake
    
    * heter util
    
    * vlog
    
    * heter server test
    
    * add dtor
    
    * cmake
    Thunderbrook authored May 10, 2021
    Configuration menu
    Copy the full SHA
    fbbc339 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    3419de5 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    1eb59ef View commit details
    Browse the repository at this point in the history

Commits on May 11, 2021

  1. Configuration menu
    Copy the full SHA
    400eb9d View commit details
    Browse the repository at this point in the history
  2. fix ci bug

    huangjun12 authored and chajchaj committed May 11, 2021
    Configuration menu
    Copy the full SHA
    e2c293f View commit details
    Browse the repository at this point in the history
  3. fix ci coverage

    huangjun12 authored and chajchaj committed May 11, 2021
    Configuration menu
    Copy the full SHA
    6cd96c1 View commit details
    Browse the repository at this point in the history
  4. imporve efficiency

    huangjun12 authored and chajchaj committed May 11, 2021
    Configuration menu
    Copy the full SHA
    ef7e5fc View commit details
    Browse the repository at this point in the history
  5. fix ci coverage bug

    huangjun12 authored and chajchaj committed May 11, 2021
    Configuration menu
    Copy the full SHA
    e1ea895 View commit details
    Browse the repository at this point in the history
  6. add weigth data to unit test

    huangjun12 authored and chajchaj committed May 11, 2021
    Configuration menu
    Copy the full SHA
    9495211 View commit details
    Browse the repository at this point in the history
  7. add ignore_index for test case

    huangjun12 authored and chajchaj committed May 11, 2021
    Configuration menu
    Copy the full SHA
    9cdf6bd View commit details
    Browse the repository at this point in the history
  8. fix ci bug

    huangjun12 authored and chajchaj committed May 11, 2021
    Configuration menu
    Copy the full SHA
    a9e5305 View commit details
    Browse the repository at this point in the history
  9. Support control flow in DataParallel (#32826)

    * fix find_unused_parameters default value
    ForFishes authored May 11, 2021
    Configuration menu
    Copy the full SHA
    298f210 View commit details
    Browse the repository at this point in the history
  10. fix rccl bug (#32808)

    xymyeah authored May 11, 2021
    Configuration menu
    Copy the full SHA
    93fce18 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    84eca16 View commit details
    Browse the repository at this point in the history
  12. Fix retry error in download when exception occurs (#32816)

    * fix retry in download when exception occurs
    lyuwenyu authored May 11, 2021
    Configuration menu
    Copy the full SHA
    1ef2327 View commit details
    Browse the repository at this point in the history

Commits on May 12, 2021

  1. Polish unittest test_decoupled_py_reader (#32832)

    * polish test decoupled py reader
    
    * remove bin file
    chenwhql authored May 12, 2021
    Configuration menu
    Copy the full SHA
    2611ed2 View commit details
    Browse the repository at this point in the history
  2. add varbasecopy func to fix the ParamBase type bug in layers.to API (#…

    …32789)
    
    * add varbasecopy func to fix the paraBase type bug in layers.to API
    
    * overload _copy_to func for ParamBase
    
    * add xpuplace
    
    * add waiting varbsecopy completion when not blocking
    
    * fix dst_device bug
    
    * modify varbase to shared_ptr
    MingMingShangTian authored May 12, 2021
    Configuration menu
    Copy the full SHA
    067f558 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    cefc063 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    e1a4c83 View commit details
    Browse the repository at this point in the history
  5. Optimize/fleet save (#32817)

    * fix cpp lint
    * fix save/load with unexpected value
    * fix save and user interface
    seiriosPlus authored May 12, 2021
    Configuration menu
    Copy the full SHA
    890f626 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    6b3bb79 View commit details
    Browse the repository at this point in the history
  7. fix dataloader exit hang when join re-enter (#32827)

    * fix dataloader exit hang when join re-enter. test=develop
    heavengate authored May 12, 2021
    Configuration menu
    Copy the full SHA
    f1d6302 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    85512d6 View commit details
    Browse the repository at this point in the history
  9. Fix comments in framework (#32861)

    * Fix comments in framework
    
    * Update framework.py
    lyuwenyu authored May 12, 2021
    Configuration menu
    Copy the full SHA
    e676360 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    eff84a5 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    24ffcd0 View commit details
    Browse the repository at this point in the history
  12. Polish Windows CI and open the normal GPU unittest on CI (#32794)

    * fix windows CI
    
    * fix windows CI
    zhwesky2010 authored May 12, 2021
    Configuration menu
    Copy the full SHA
    eeca963 View commit details
    Browse the repository at this point in the history

Commits on May 13, 2021

  1. Configuration menu
    Copy the full SHA
    b60ab6b View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    3e47eee View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    c3ae0d4 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    48fc16f View commit details
    Browse the repository at this point in the history
  5. [NPU] support global accumulator for adam (#32780)

    * add use_global_beta_pow
    
    * add use_global_beta_pow
    
    * update npu kernel
    
    * update python api
    
    * refine code
    
    * add ut for use_global_beta_pow
    
    * fix npu kernel
    
    * add ut for api
    
    * add ut for exception
    
    * add ut for save/load
    zhiqiu authored May 13, 2021
    Configuration menu
    Copy the full SHA
    dace3fd View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    a8625aa View commit details
    Browse the repository at this point in the history

Commits on May 14, 2021

  1. Configuration menu
    Copy the full SHA
    096b2f5 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b035c8b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    e48091d View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    2d9d8f5 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    6473e27 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    ed9e772 View commit details
    Browse the repository at this point in the history
  7. use the required instruction to determine if the environment fits t…

    …he sample code's required. (#32766)
    
    * add unittests
    
    * add find_last_future_line_end
    
    * extract_code_blocks_from_docstr and its testcases
    
    * test_codeblock_before_examples_is_ignored
    
    * sampcd_extract_to_file 拆为两步
    
    * update the codeblock element's format
    
    * code-block directive has no value options
    
    * insert the CODES_INTERTED_INTO_FRONTEND
    
    * using the new func insert_codes_into_codeblock
    
    * get_test_capacity and is_required_match
    
    * using the new funcitons in sampcd_extract_to_file
    
    * add some comments and refactor functions
    
    * using logger instead of all the print
    
    * remote wlist
    
    * collect summary info, and print them
    
    * call get capacity
    
    * update summary format
    
    * print the apis that don't have sample codes.
    
    * print the samples the consumed time more than 10s.
    
    print time
    
    * update unittest testcases
    
    * solve ResourceWarning: unclosed file
    
    * run tools test seperately
    
    * python2 does not have nonlocal keyword, using dict variable instead
    
    * remove unused import, rearrange a series of conditional statements.
    
    * remove wlist.json and its check approval
    
    * remove wlist.json and its check approval
    wadefelix authored May 14, 2021
    Configuration menu
    Copy the full SHA
    42aad30 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    479689f View commit details
    Browse the repository at this point in the history
  9. Fix four error messages (#32899)

    * fix two error message
    
    * fix two error message
    
    * fix error
    
    * fix error
    
    * fix error
    
    * fix error
    
    * fix some error message
    
    * fix some error
    
    * fix error
    
    * fix some error
    
    * fix some error
    
    * fix some error
    
    * fix one error
    
    * fix some error
    
    * fix seven error message
    
    * fix error
    
    * fix error
    
    * fix error
    
    * fix error
    
    * fix some error message
    
    * fix error
    
    * fix some error
    
    * fix some error
    
    * fix four error message
    
    * fix error
    
    * fix error
    Kqnonrime authored May 14, 2021
    Configuration menu
    Copy the full SHA
    c4787d7 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    15b05c7 View commit details
    Browse the repository at this point in the history
  11. Doc of paddle.save/load (#32900)

    * doc of paddle.save/load
    
    * polish doc of paddle.save/load
    hbwx24 authored May 14, 2021
    Configuration menu
    Copy the full SHA
    62f6550 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    e89fb25 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    a3debea View commit details
    Browse the repository at this point in the history

Commits on May 17, 2021

  1. Configuration menu
    Copy the full SHA
    585564d View commit details
    Browse the repository at this point in the history
  2. BugFix with ParseInputDataType from LodTensorArray (#32918)

    * BugFix with ParseInputDataType from LodTensorArray
    
    * BugFix with ParseInputDataType from LodTensorArray
    Aurelius84 authored May 17, 2021
    Configuration menu
    Copy the full SHA
    5f1c07d View commit details
    Browse the repository at this point in the history
  3. remove redundant graph files (#32924)

    * delete unused files.
    seemingwang authored May 17, 2021
    Configuration menu
    Copy the full SHA
    906db71 View commit details
    Browse the repository at this point in the history
  4. [HybridParallel]Fix precision problem of model parallel (#32897)

    * fix precision of mp
    
    * fix bug of seed
    
    * fix dp
    
    * print group
    ForFishes authored May 17, 2021
    Configuration menu
    Copy the full SHA
    c809530 View commit details
    Browse the repository at this point in the history

Commits on May 18, 2021

  1. fix the paddle compare op for the broadcast when the element equal (#…

    …32941)
    
    * fix the paddle compare op for the broadcast
    
    * fix compare op in for in the cuda device
    wawltor authored May 18, 2021
    Configuration menu
    Copy the full SHA
    c72ed82 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    59997d5 View commit details
    Browse the repository at this point in the history
  3. unit double (#32902)

    * unit double
    
    * unit double
    Thunderbrook authored May 18, 2021
    Configuration menu
    Copy the full SHA
    29bbeb0 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    59b74ee View commit details
    Browse the repository at this point in the history
  5. [Dy2Static] Refactor param_guard logic of @to_static (#32867)

    * Add param_guard in ParameterList to support @to_static
    
    * Refactor param_guard of @to_static
    
    * fix unittest failed
    
    * add more unittest
    Aurelius84 authored May 18, 2021
    Configuration menu
    Copy the full SHA
    b8d493d View commit details
    Browse the repository at this point in the history
  6. [UnitTest]Enhance grep syntax to avoid random failed of test_dist_mni…

    …st_dgc_nccl (#32946)
    
    * Enhance grep syntax to avoid random failed
    
    * Enhance grep syntax to avoid random failed
    Aurelius84 authored May 18, 2021
    Configuration menu
    Copy the full SHA
    b5882c6 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    bcd40f2 View commit details
    Browse the repository at this point in the history
  8. add unit8 for concat (#32850)

    vslyu authored May 18, 2021
    Configuration menu
    Copy the full SHA
    53580bb View commit details
    Browse the repository at this point in the history
  9. Update paths to Quant models (#32870)

    * Update paths to Quant models
    
    * Update description
    wozna authored May 18, 2021
    Configuration menu
    Copy the full SHA
    5d62748 View commit details
    Browse the repository at this point in the history
  10. [NPU] fix accuracy npu op bug and change top_k's output to int64 (#32935

    )
    
    * Output indices of top_k npu op change to int64
    
    * fix accuracy npu bug
    
    * fix errors
    
    * change cast method to FillNpuTensorWithConstant
    
    * change cast method to FillNpuTensorWithConstant
    pangyoki authored May 18, 2021
    Configuration menu
    Copy the full SHA
    c66586b View commit details
    Browse the repository at this point in the history

Commits on May 19, 2021

  1. Configuration menu
    Copy the full SHA
    d7d7fae View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    af89a94 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    56008aa View commit details
    Browse the repository at this point in the history
  4. [Dy2Stat]BugFix StaticAanlysis with gast.Subscript (#32969)

    * BugFix StaticAanlysis with gast.Subscript
    
    * remove codes
    Aurelius84 authored May 19, 2021
    Configuration menu
    Copy the full SHA
    c285261 View commit details
    Browse the repository at this point in the history
  5. remove ut from parallel_ut list (#32788)

    * remove ut from parallel_ut list
    
    * remove some timeout ut
    XieYunshen authored May 19, 2021
    Configuration menu
    Copy the full SHA
    f0b2f59 View commit details
    Browse the repository at this point in the history
  6. Optimize 102Flowers dataset reading speed (#31408)

    * Fix slow data reading, In the old version, one epoch read time of this data set was about 5371 seconds(MacBook Pro Retina, 13-inch, Early 2015 2.7 GHz), and a batch took 211 seconds, It's too painful to use. Now decompress the data in advance (about 10 seconds). Each epoch of reading takes about 3 seconds(MacBook Pro Retina, 13-inch, Early 2015 2.7 GHz), and a batch takes 0.017 seconds more.
    
    * Run CI, test=allcase
    
    * fix qq group number. test=document_fix
    
     fix qq group number. test=document_fix
    
    * fix qq group number. test=document_fix 
    
    fix qq group number. test=document_fix
    GT-ZhangAcer authored May 19, 2021
    Configuration menu
    Copy the full SHA
    67c2700 View commit details
    Browse the repository at this point in the history
  7. [Rocm] fix test of random_crop_op & logsumexp (#32824)

    * [Rocm] fix test of random_crop_op
    
    * [Rocm] fix test of random_crop_op
    
    * [Rocm] fix test of random_crop_op & simple_rnn_op
    
    * [Rocm] fix test of random_crop_op & simple_rnn_op & logsumexp
    
    * [Rocm] fix test of random_crop_op & simple_rnn_op & logsumexp
    
    * [Rocm] fix test of random_crop_op & simple_rnn_op & logsumexp
    
    * [Rocm] fix test of random_crop_op & logsumexp
    Ray2020BD authored May 19, 2021
    Configuration menu
    Copy the full SHA
    aa4a56f View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    1e1600e View commit details
    Browse the repository at this point in the history
  9. CI skip inference test if only python files modified (#32962)

    * CI skip inference test if only python files modified, test=develop
    
    * fix compilation error on ROCM, test=develop
    
    * fix cmake error on PR-CI-ROCM-Compile, test=develop
    Avin0323 authored May 19, 2021
    Configuration menu
    Copy the full SHA
    7896b51 View commit details
    Browse the repository at this point in the history
  10. fix test_paddle_save_load and test_paddle_save_load_binary (#32949)

    * fix test_paddle_save_load and test_paddle_save_load_binary
    
    * fix unittest:test_paddle_save_load and test_paddle_save_load_binary
    
    * delete *.pyc
    
    * add comment for unittest
    hbwx24 authored May 19, 2021
    Configuration menu
    Copy the full SHA
    6f8de31 View commit details
    Browse the repository at this point in the history

Commits on May 20, 2021

  1. Configuration menu
    Copy the full SHA
    1494952 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    7e27b5a View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    8854786 View commit details
    Browse the repository at this point in the history
  4. Add complex template type (#32857)

    * add complex template file
    
    * add numtraits for complex template
    
    * add complex template type register
    
    * modify specify template of complex
    
    * modify specify template of complex
    
    * modify specify template of complex
    
    * modify specify template of complex
    
    * make TensorCheckerVisitor support complex type
    
    * fix operator= error
    
    * add complex template
    
    * add complex template type
    
    * add complex template type to pyarray transform
    
    * add complex template type to pyarray transform
    
    * remove complex type for dlpack register
    
    * set dlpack supprot complex type
    
    * set dlpack supprot complex type
    
    * set dlpack supprot complex type
    
    * remove explict for complex constructor
    
    * add complex unit test file
    MingMingShangTian authored May 20, 2021
    Configuration menu
    Copy the full SHA
    738bf20 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    848cabf View commit details
    Browse the repository at this point in the history
  6. [Dy2Stat]Support convert sublayers in Sequential Container (#32978)

    * Support convert sublayers in Sequential Container
    
    * remove paddle.jit.set_code_level
    Aurelius84 authored May 20, 2021
    Configuration menu
    Copy the full SHA
    e409c7c View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    be8e94a View commit details
    Browse the repository at this point in the history
  8. fix gather op and add logsumexp op on kunlun (#32931)

    * fix gather op and add logsumexp op on kunlun
    
    * update xpu depence
    
    * update tests and fix elementwise_add
    tangzhiyi11 authored May 20, 2021
    Configuration menu
    Copy the full SHA
    a96e8bc View commit details
    Browse the repository at this point in the history

Commits on May 21, 2021

  1. [NPU] cast indices and label if their type is not consistent in accur…

    …acy npu op (#33016)
    
    * cast indices and label if their type is not consistent
    
    * fix bug
    
    * add unittest
    pangyoki authored May 21, 2021
    Configuration menu
    Copy the full SHA
    70dc5f4 View commit details
    Browse the repository at this point in the history
  2. update conda build script for cuda11 (#29594)

    * update conda build script for cuda11
    
    * update conda build script
    
    * modified wheel name
    
    * update conda_build
    
    * fix error
    
    * add cudnn8.1 for cuda11.2
    
    * fix format error
    XieYunshen authored May 21, 2021
    Configuration menu
    Copy the full SHA
    44668a7 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    a85eddd View commit details
    Browse the repository at this point in the history
  4. fix model_benchmark ci (#33035)

    * fix model_bhecnmark ci
    
    * fix model_bhecnmark ci
    tianshuo78520a authored May 21, 2021
    Configuration menu
    Copy the full SHA
    0e5d832 View commit details
    Browse the repository at this point in the history
  5. optimize softmax with cross entropy hard label (#32290)

    * optimize softmax with cross entropy hard label
    
    * label ignore_index cleaning
    xingfeng01 authored May 21, 2021
    Configuration menu
    Copy the full SHA
    7be6191 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    79ed717 View commit details
    Browse the repository at this point in the history
  7. replace complex64/128 with complex template in cast Op (#33019)

    * replace complex in set tensor from and to numpy
    
    * replace complex template in cast op
    MingMingShangTian authored May 21, 2021
    Configuration menu
    Copy the full SHA
    79d918d View commit details
    Browse the repository at this point in the history

Commits on May 22, 2021

  1. Added oneDNN matmul grad BF16/FP32 kernel (#32968)

    * added support for most matmul cases
    
    * added more functionality
    
    * full functionality of matmul op, fp32 only
    
    * added bf16 tests and functionality
    
    * added formatting
    
    * changes after review
    
    * minor change
    
    * added reviewers suggestions
    jakpiase authored May 22, 2021
    Configuration menu
    Copy the full SHA
    e2a3a6f View commit details
    Browse the repository at this point in the history
  2. refine conv2d doc (#33045)

    jerrywgz authored May 22, 2021
    Configuration menu
    Copy the full SHA
    a6dc68b View commit details
    Browse the repository at this point in the history

Commits on May 24, 2021

  1. Configuration menu
    Copy the full SHA
    d6aea4a View commit details
    Browse the repository at this point in the history
  2. fix potential overflow problem & node add & node remove & node clear (#…

    …33055)
    
    * graph engine demo
    
    * upload unsaved changes
    
    * fix dependency error
    
    * fix shard_num problem
    
    * py client
    
    * remove lock and graph-type
    
    * add load direct graph
    
    * add load direct graph
    
    * add load direct graph
    
    * batch random_sample
    
    * batch_sample_k
    
    * fix num_nodes size
    
    * batch brpc
    
    * batch brpc
    
    * add test
    
    * add test
    
    * add load_nodes; change add_node function
    
    * change sample return type to pair
    
    * resolve conflict
    
    * resolved conflict
    
    * resolved conflict
    
    * separate server and client
    
    * merge pair type
    
    * fix
    
    * resolved conflict
    
    * fixed segment fault; high-level VLOG for load edges and load nodes
    
    * random_sample return 0
    
    * rm useless loop
    
    * test:load edge
    
    * fix ret -1
    
    * test: rm sample
    
    * rm sample
    
    * random_sample return future
    
    * random_sample return int
    
    * test fake node
    
    * fixed here
    
    * memory leak
    
    * remove test code
    
    * fix return problem
    
    * add common_graph_table
    
    * random sample node &test & change data-structure from linkedList to vector
    
    * add common_graph_table
    
    * sample with srand
    
    * add node_types
    
    * optimize nodes sample
    
    * recover test
    
    * random sample
    
    * destruct weighted sampler
    
    * GraphEdgeBlob
    
    * WeightedGraphEdgeBlob to GraphEdgeBlob
    
    * WeightedGraphEdgeBlob to GraphEdgeBlob
    
    * pybind sample nodes api
    
    * pull nodes with step
    
    * fixed pull_graph_list bug; add test for pull_graph_list by step
    
    * add graph table;name
    
    * add graph table;name
    
    * add pybind
    
    * add pybind
    
    * add FeatureNode
    
    * add FeatureNode
    
    * add FeatureNode Serialize
    
    * add FeatureNode Serialize
    
    * get_feat_node
    
    * avoid local rpc
    
    * fix get_node_feat
    
    * fix get_node_feat
    
    * remove log
    
    * get_node_feat return  py:bytes
    
    * merge develop with graph_engine
    
    * fix threadpool.h head
    
    * fix
    
    * fix typo
    
    * resolve conflict
    
    * fix conflict
    
    * recover lost content
    
    * fix pybind of FeatureNode
    
    * recover cmake
    
    * recover tools
    
    * resolve conflict
    
    * resolve linking problem
    
    * code style
    
    * change test_server port
    
    * fix code problems
    
    * remove shard_num config
    
    * remove redundent threads
    
    * optimize start server
    
    * remove logs
    
    * fix code problems by reviewers' suggestions
    
    * move graph files into a folder
    
    * code style change
    
    * remove graph operations from base table
    
    * optimize get_feat function of graph engine
    
    * fix long long count problem
    
    * remove redandunt graph files
    
    * remove unused shell
    
    * recover dropout_op_pass.h
    
    * fix potential stack overflow when request number is too large & node add & node clear & node remove
    
    Co-authored-by: Huang Zhengjie <270018958@qq.com>
    Co-authored-by: Weiyue Su <weiyue.su@gmail.com>
    Co-authored-by: suweiyue <suweiyue@baidu.com>
    Co-authored-by: luobin06 <luobin06@baidu.com>
    Co-authored-by: liweibin02 <liweibin02@baidu.com>
    Co-authored-by: tangwei12 <tangwei12@baidu.com>
    7 people authored May 24, 2021
    Configuration menu
    Copy the full SHA
    60ac160 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    d0d5586 View commit details
    Browse the repository at this point in the history
  4. [oneDNN] bump up oneDNN to 2.2.2 (#32685)

    * - bump up oneDNN to 2.2.2 (should reduce perf drops of mobilenet)
    
    * - more recnet onednn 2.2.2 (some more bugfixes)
    jczaja authored May 24, 2021
    Configuration menu
    Copy the full SHA
    b8e4ec7 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    99a11e3 View commit details
    Browse the repository at this point in the history
  6. Revert "fix model_benchmark ci (#33035)" (#33080)

    This reverts commit 0e5d832.
    tianshuo78520a authored May 24, 2021
    Configuration menu
    Copy the full SHA
    6ad5ece View commit details
    Browse the repository at this point in the history
  7. [HybridParallel]Fix pipeline in dygraph (#33007)

    * fix pipeline
    
    * fix mp pp dp
    
    * fix utest of hybrid parallel
    
    * add utest for tuple
    ForFishes authored May 24, 2021
    Configuration menu
    Copy the full SHA
    4920c47 View commit details
    Browse the repository at this point in the history

Commits on May 25, 2021

  1. Configuration menu
    Copy the full SHA
    88b43b5 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    86ea8dc View commit details
    Browse the repository at this point in the history
  3. modify Ops to complex template (#33041)

    * modify conj, real, imag OP to complex template
    
    * replace with complex template to dot Op
    
    * replace with complex template to Abs Op
    
    * add support for complex64 and complex128
    MingMingShangTian authored May 25, 2021
    Configuration menu
    Copy the full SHA
    5fa44c3 View commit details
    Browse the repository at this point in the history