-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
update #5
update #5
Commits on May 7, 2021
-
Configuration menu - View commit details
-
Copy full SHA for db5eac2 - Browse repository at this point
Copy the full SHA db5eac2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7468253 - Browse repository at this point
Copy the full SHA 7468253View commit details -
Configuration menu - View commit details
-
Copy full SHA for b2160e7 - Browse repository at this point
Copy the full SHA b2160e7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 816afb9 - Browse repository at this point
Copy the full SHA 816afb9View commit details -
Fix compile error on jetson platform (#32748)
* fix compile error on jetson platform
Configuration menu - View commit details
-
Copy full SHA for 8ce6b39 - Browse repository at this point
Copy the full SHA 8ce6b39View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3753416 - Browse repository at this point
Copy the full SHA 3753416View commit details -
remove packages in __all__ (#32759)
* [OPs] Bug fix, fix the segment mean for illegal syncthreads usage. (#32596) (#32610) * [OPs] Bug fix, fix the segment mean for illegal syncthreads usage. * remove packages in __all__ * create new public api level paddle.callbacks;paddle.hub;paddle.utils.unique_name Co-authored-by: Zhong Hui <zhonghui.net@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for a77ade0 - Browse repository at this point
Copy the full SHA a77ade0View commit details
Commits on May 8, 2021
-
Add raw program meta optimizer (#32597)
* add raw program, test=develop
lilong12 authoredMay 8, 2021 Configuration menu - View commit details
-
Copy full SHA for c1c18b0 - Browse repository at this point
Copy the full SHA c1c18b0View commit details -
[Paddle-TRT]fix trt-converter-fc_op (#32671)
* [Paddle-TRT]fix fc_op * [Paddle-TRT]fix fc_op * [Paddle-TRT]fix fc_op * test_trt_subgraph_pass.py * fix elementwise_op * fix elementwise_op * fix elementwise_op * fix elementwise_op.cc * op_teller.cc
Configuration menu - View commit details
-
Copy full SHA for 62d848d - Browse repository at this point
Copy the full SHA 62d848dView commit details -
Configuration menu - View commit details
-
Copy full SHA for e8e4a9c - Browse repository at this point
Copy the full SHA e8e4a9cView commit details -
Remove np Deprecation Warning since
np.bool
is alias ofbool
(#32798) Remove np Deprecation Warning since `np.bool` is alias of `bool` The warning report from test: ``` 2021-04-30 15:29:32 /workspace/Paddle/build/python/paddle/fluid/framework.py:689: DeprecationWarning: `np.bool` is a deprecated alias for the builtin `bool`. To silence this warning, use `bool` by itself. Doing this will not modify any behavior and is safe. If you specifically wanted the numpy scalar type, use `np.bool_` here. 2021-04-30 15:29:32 Deprecated in NumPy 1.20; for more details and guidance: https://numpy.org/devdocs/release/1.20.0-notes.html#deprecations 2021-04-30 15:29:32 elif dtype == np.bool: 2021-04-30 15:29:32 /workspace/Paddle/build/python/paddle/fluid/layers/utils.py:77: DeprecationWarning: Using or importing the ABCs from 'collections' instead of from 'collections.abc' is deprecated, and in 3.8 it will stop working 2021-04-30 15:29:32 return (isinstance(seq, collections.Sequence) and 2021-04-30 15:29:32 /workspace/Paddle/build/python/paddle/fluid/tests/unittests/test_cond.py:99: DeprecationWarning: `np.bool` is a deprecated alias for the builtin `bool`. To silence this warning, use `bool` by itself. Doing this will not modify any behavior and is safe. If you specifically wanted the numpy scalar type, use `np.bool_` here. ```
Configuration menu - View commit details
-
Copy full SHA for 8a42b1f - Browse repository at this point
Copy the full SHA 8a42b1fView commit details -
[NPU] refine update_loss_scaling npu kernel (#32580)
* refine update_loss_scaling npu kernel * add mutable_data * change Zerolike op to MemcpyAsync * delete useless code * add found_inf_vec * add memcpy if not finite * fix unittest
Configuration menu - View commit details
-
Copy full SHA for 4628b6f - Browse repository at this point
Copy the full SHA 4628b6fView commit details -
Configuration menu - View commit details
-
Copy full SHA for c8affff - Browse repository at this point
Copy the full SHA c8affffView commit details -
【heterps】support cuda11 for heterps; add profiler in oneps (#32640)
* add trainprofiler for heterps in oneps; test=develop * add set_use_ps_gpu; test=develop
Configuration menu - View commit details
-
Copy full SHA for beab956 - Browse repository at this point
Copy the full SHA beab956View commit details
Commits on May 10, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 23ab01e - Browse repository at this point
Copy the full SHA 23ab01eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 92adece - Browse repository at this point
Copy the full SHA 92adeceView commit details -
Configuration menu - View commit details
-
Copy full SHA for fd9a236 - Browse repository at this point
Copy the full SHA fd9a236View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5fc734c - Browse repository at this point
Copy the full SHA 5fc734cView commit details -
Configuration menu - View commit details
-
Copy full SHA for e357cfd - Browse repository at this point
Copy the full SHA e357cfdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5aa8faa - Browse repository at this point
Copy the full SHA 5aa8faaView commit details -
[pslib] pslib with cmake (#32800)
* pslib with cmake * heter util * vlog * heter server test * add dtor * cmake
Configuration menu - View commit details
-
Copy full SHA for fbbc339 - Browse repository at this point
Copy the full SHA fbbc339View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3419de5 - Browse repository at this point
Copy the full SHA 3419de5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1eb59ef - Browse repository at this point
Copy the full SHA 1eb59efView commit details
Commits on May 11, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 400eb9d - Browse repository at this point
Copy the full SHA 400eb9dView commit details -
Configuration menu - View commit details
-
Copy full SHA for e2c293f - Browse repository at this point
Copy the full SHA e2c293fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6cd96c1 - Browse repository at this point
Copy the full SHA 6cd96c1View commit details -
Configuration menu - View commit details
-
Copy full SHA for ef7e5fc - Browse repository at this point
Copy the full SHA ef7e5fcView commit details -
Configuration menu - View commit details
-
Copy full SHA for e1ea895 - Browse repository at this point
Copy the full SHA e1ea895View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9495211 - Browse repository at this point
Copy the full SHA 9495211View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9cdf6bd - Browse repository at this point
Copy the full SHA 9cdf6bdView commit details -
Configuration menu - View commit details
-
Copy full SHA for a9e5305 - Browse repository at this point
Copy the full SHA a9e5305View commit details -
Support control flow in DataParallel (#32826)
* fix find_unused_parameters default value
Configuration menu - View commit details
-
Copy full SHA for 298f210 - Browse repository at this point
Copy the full SHA 298f210View commit details -
Configuration menu - View commit details
-
Copy full SHA for 93fce18 - Browse repository at this point
Copy the full SHA 93fce18View commit details -
Configuration menu - View commit details
-
Copy full SHA for 84eca16 - Browse repository at this point
Copy the full SHA 84eca16View commit details -
Fix retry error in download when exception occurs (#32816)
* fix retry in download when exception occurs
Configuration menu - View commit details
-
Copy full SHA for 1ef2327 - Browse repository at this point
Copy the full SHA 1ef2327View commit details
Commits on May 12, 2021
-
Polish unittest test_decoupled_py_reader (#32832)
* polish test decoupled py reader * remove bin file
Configuration menu - View commit details
-
Copy full SHA for 2611ed2 - Browse repository at this point
Copy the full SHA 2611ed2View commit details -
add varbasecopy func to fix the ParamBase type bug in layers.to API (#…
…32789) * add varbasecopy func to fix the paraBase type bug in layers.to API * overload _copy_to func for ParamBase * add xpuplace * add waiting varbsecopy completion when not blocking * fix dst_device bug * modify varbase to shared_ptr
Configuration menu - View commit details
-
Copy full SHA for 067f558 - Browse repository at this point
Copy the full SHA 067f558View commit details -
Configuration menu - View commit details
-
Copy full SHA for cefc063 - Browse repository at this point
Copy the full SHA cefc063View commit details -
Configuration menu - View commit details
-
Copy full SHA for e1a4c83 - Browse repository at this point
Copy the full SHA e1a4c83View commit details -
* fix cpp lint * fix save/load with unexpected value * fix save and user interface
Configuration menu - View commit details
-
Copy full SHA for 890f626 - Browse repository at this point
Copy the full SHA 890f626View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6b3bb79 - Browse repository at this point
Copy the full SHA 6b3bb79View commit details -
fix dataloader exit hang when join re-enter (#32827)
* fix dataloader exit hang when join re-enter. test=develop
Configuration menu - View commit details
-
Copy full SHA for f1d6302 - Browse repository at this point
Copy the full SHA f1d6302View commit details -
Configuration menu - View commit details
-
Copy full SHA for 85512d6 - Browse repository at this point
Copy the full SHA 85512d6View commit details -
Fix comments in framework (#32861)
* Fix comments in framework * Update framework.py
Configuration menu - View commit details
-
Copy full SHA for e676360 - Browse repository at this point
Copy the full SHA e676360View commit details -
Configuration menu - View commit details
-
Copy full SHA for eff84a5 - Browse repository at this point
Copy the full SHA eff84a5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 24ffcd0 - Browse repository at this point
Copy the full SHA 24ffcd0View commit details -
Polish Windows CI and open the normal GPU unittest on CI (#32794)
* fix windows CI * fix windows CI
Configuration menu - View commit details
-
Copy full SHA for eeca963 - Browse repository at this point
Copy the full SHA eeca963View commit details
Commits on May 13, 2021
-
Configuration menu - View commit details
-
Copy full SHA for b60ab6b - Browse repository at this point
Copy the full SHA b60ab6bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3e47eee - Browse repository at this point
Copy the full SHA 3e47eeeView commit details -
Configuration menu - View commit details
-
Copy full SHA for c3ae0d4 - Browse repository at this point
Copy the full SHA c3ae0d4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 48fc16f - Browse repository at this point
Copy the full SHA 48fc16fView commit details -
[NPU] support global accumulator for adam (#32780)
* add use_global_beta_pow * add use_global_beta_pow * update npu kernel * update python api * refine code * add ut for use_global_beta_pow * fix npu kernel * add ut for api * add ut for exception * add ut for save/load
Configuration menu - View commit details
-
Copy full SHA for dace3fd - Browse repository at this point
Copy the full SHA dace3fdView commit details -
Configuration menu - View commit details
-
Copy full SHA for a8625aa - Browse repository at this point
Copy the full SHA a8625aaView commit details
Commits on May 14, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 096b2f5 - Browse repository at this point
Copy the full SHA 096b2f5View commit details -
Configuration menu - View commit details
-
Copy full SHA for b035c8b - Browse repository at this point
Copy the full SHA b035c8bView commit details -
Configuration menu - View commit details
-
Copy full SHA for e48091d - Browse repository at this point
Copy the full SHA e48091dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2d9d8f5 - Browse repository at this point
Copy the full SHA 2d9d8f5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6473e27 - Browse repository at this point
Copy the full SHA 6473e27View commit details -
Configuration menu - View commit details
-
Copy full SHA for ed9e772 - Browse repository at this point
Copy the full SHA ed9e772View commit details -
use the
required
instruction to determine if the environment fits t……he sample code's required. (#32766) * add unittests * add find_last_future_line_end * extract_code_blocks_from_docstr and its testcases * test_codeblock_before_examples_is_ignored * sampcd_extract_to_file 拆为两步 * update the codeblock element's format * code-block directive has no value options * insert the CODES_INTERTED_INTO_FRONTEND * using the new func insert_codes_into_codeblock * get_test_capacity and is_required_match * using the new funcitons in sampcd_extract_to_file * add some comments and refactor functions * using logger instead of all the print * remote wlist * collect summary info, and print them * call get capacity * update summary format * print the apis that don't have sample codes. * print the samples the consumed time more than 10s. print time * update unittest testcases * solve ResourceWarning: unclosed file * run tools test seperately * python2 does not have nonlocal keyword, using dict variable instead * remove unused import, rearrange a series of conditional statements. * remove wlist.json and its check approval * remove wlist.json and its check approval
Configuration menu - View commit details
-
Copy full SHA for 42aad30 - Browse repository at this point
Copy the full SHA 42aad30View commit details -
Configuration menu - View commit details
-
Copy full SHA for 479689f - Browse repository at this point
Copy the full SHA 479689fView commit details -
Fix four error messages (#32899)
* fix two error message * fix two error message * fix error * fix error * fix error * fix error * fix some error message * fix some error * fix error * fix some error * fix some error * fix some error * fix one error * fix some error * fix seven error message * fix error * fix error * fix error * fix error * fix some error message * fix error * fix some error * fix some error * fix four error message * fix error * fix error
Configuration menu - View commit details
-
Copy full SHA for c4787d7 - Browse repository at this point
Copy the full SHA c4787d7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 15b05c7 - Browse repository at this point
Copy the full SHA 15b05c7View commit details -
Doc of paddle.save/load (#32900)
* doc of paddle.save/load * polish doc of paddle.save/load
Configuration menu - View commit details
-
Copy full SHA for 62f6550 - Browse repository at this point
Copy the full SHA 62f6550View commit details -
Configuration menu - View commit details
-
Copy full SHA for e89fb25 - Browse repository at this point
Copy the full SHA e89fb25View commit details -
Configuration menu - View commit details
-
Copy full SHA for a3debea - Browse repository at this point
Copy the full SHA a3debeaView commit details
Commits on May 17, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 585564d - Browse repository at this point
Copy the full SHA 585564dView commit details -
BugFix with ParseInputDataType from LodTensorArray (#32918)
* BugFix with ParseInputDataType from LodTensorArray * BugFix with ParseInputDataType from LodTensorArray
Configuration menu - View commit details
-
Copy full SHA for 5f1c07d - Browse repository at this point
Copy the full SHA 5f1c07dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 906db71 - Browse repository at this point
Copy the full SHA 906db71View commit details -
[HybridParallel]Fix precision problem of model parallel (#32897)
* fix precision of mp * fix bug of seed * fix dp * print group
Configuration menu - View commit details
-
Copy full SHA for c809530 - Browse repository at this point
Copy the full SHA c809530View commit details
Commits on May 18, 2021
-
fix the paddle compare op for the broadcast when the element equal (#…
…32941) * fix the paddle compare op for the broadcast * fix compare op in for in the cuda device
Configuration menu - View commit details
-
Copy full SHA for c72ed82 - Browse repository at this point
Copy the full SHA c72ed82View commit details -
Configuration menu - View commit details
-
Copy full SHA for 59997d5 - Browse repository at this point
Copy the full SHA 59997d5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 29bbeb0 - Browse repository at this point
Copy the full SHA 29bbeb0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 59b74ee - Browse repository at this point
Copy the full SHA 59b74eeView commit details -
[Dy2Static] Refactor param_guard logic of @to_static (#32867)
* Add param_guard in ParameterList to support @to_static * Refactor param_guard of @to_static * fix unittest failed * add more unittest
Configuration menu - View commit details
-
Copy full SHA for b8d493d - Browse repository at this point
Copy the full SHA b8d493dView commit details -
[UnitTest]Enhance grep syntax to avoid random failed of test_dist_mni…
…st_dgc_nccl (#32946) * Enhance grep syntax to avoid random failed * Enhance grep syntax to avoid random failed
Configuration menu - View commit details
-
Copy full SHA for b5882c6 - Browse repository at this point
Copy the full SHA b5882c6View commit details -
Configuration menu - View commit details
-
Copy full SHA for bcd40f2 - Browse repository at this point
Copy the full SHA bcd40f2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 53580bb - Browse repository at this point
Copy the full SHA 53580bbView commit details -
Update paths to Quant models (#32870)
* Update paths to Quant models * Update description
Configuration menu - View commit details
-
Copy full SHA for 5d62748 - Browse repository at this point
Copy the full SHA 5d62748View commit details -
[NPU] fix accuracy npu op bug and change top_k's output to int64 (#32935
Configuration menu - View commit details
-
Copy full SHA for c66586b - Browse repository at this point
Copy the full SHA c66586bView commit details
Commits on May 19, 2021
-
Configuration menu - View commit details
-
Copy full SHA for d7d7fae - Browse repository at this point
Copy the full SHA d7d7faeView commit details -
Configuration menu - View commit details
-
Copy full SHA for af89a94 - Browse repository at this point
Copy the full SHA af89a94View commit details -
Configuration menu - View commit details
-
Copy full SHA for 56008aa - Browse repository at this point
Copy the full SHA 56008aaView commit details -
[Dy2Stat]BugFix StaticAanlysis with gast.Subscript (#32969)
* BugFix StaticAanlysis with gast.Subscript * remove codes
Configuration menu - View commit details
-
Copy full SHA for c285261 - Browse repository at this point
Copy the full SHA c285261View commit details -
remove ut from parallel_ut list (#32788)
* remove ut from parallel_ut list * remove some timeout ut
Configuration menu - View commit details
-
Copy full SHA for f0b2f59 - Browse repository at this point
Copy the full SHA f0b2f59View commit details -
Optimize 102Flowers dataset reading speed (#31408)
* Fix slow data reading, In the old version, one epoch read time of this data set was about 5371 seconds(MacBook Pro Retina, 13-inch, Early 2015 2.7 GHz), and a batch took 211 seconds, It's too painful to use. Now decompress the data in advance (about 10 seconds). Each epoch of reading takes about 3 seconds(MacBook Pro Retina, 13-inch, Early 2015 2.7 GHz), and a batch takes 0.017 seconds more. * Run CI, test=allcase * fix qq group number. test=document_fix fix qq group number. test=document_fix * fix qq group number. test=document_fix fix qq group number. test=document_fix
Configuration menu - View commit details
-
Copy full SHA for 67c2700 - Browse repository at this point
Copy the full SHA 67c2700View commit details -
[Rocm] fix test of random_crop_op & logsumexp (#32824)
* [Rocm] fix test of random_crop_op * [Rocm] fix test of random_crop_op * [Rocm] fix test of random_crop_op & simple_rnn_op * [Rocm] fix test of random_crop_op & simple_rnn_op & logsumexp * [Rocm] fix test of random_crop_op & simple_rnn_op & logsumexp * [Rocm] fix test of random_crop_op & simple_rnn_op & logsumexp * [Rocm] fix test of random_crop_op & logsumexp
Configuration menu - View commit details
-
Copy full SHA for aa4a56f - Browse repository at this point
Copy the full SHA aa4a56fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1e1600e - Browse repository at this point
Copy the full SHA 1e1600eView commit details -
CI skip inference test if only python files modified (#32962)
* CI skip inference test if only python files modified, test=develop * fix compilation error on ROCM, test=develop * fix cmake error on PR-CI-ROCM-Compile, test=develop
Configuration menu - View commit details
-
Copy full SHA for 7896b51 - Browse repository at this point
Copy the full SHA 7896b51View commit details -
fix test_paddle_save_load and test_paddle_save_load_binary (#32949)
* fix test_paddle_save_load and test_paddle_save_load_binary * fix unittest:test_paddle_save_load and test_paddle_save_load_binary * delete *.pyc * add comment for unittest
Configuration menu - View commit details
-
Copy full SHA for 6f8de31 - Browse repository at this point
Copy the full SHA 6f8de31View commit details
Commits on May 20, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 1494952 - Browse repository at this point
Copy the full SHA 1494952View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7e27b5a - Browse repository at this point
Copy the full SHA 7e27b5aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8854786 - Browse repository at this point
Copy the full SHA 8854786View commit details -
Add complex template type (#32857)
* add complex template file * add numtraits for complex template * add complex template type register * modify specify template of complex * modify specify template of complex * modify specify template of complex * modify specify template of complex * make TensorCheckerVisitor support complex type * fix operator= error * add complex template * add complex template type * add complex template type to pyarray transform * add complex template type to pyarray transform * remove complex type for dlpack register * set dlpack supprot complex type * set dlpack supprot complex type * set dlpack supprot complex type * remove explict for complex constructor * add complex unit test file
Configuration menu - View commit details
-
Copy full SHA for 738bf20 - Browse repository at this point
Copy the full SHA 738bf20View commit details -
Configuration menu - View commit details
-
Copy full SHA for 848cabf - Browse repository at this point
Copy the full SHA 848cabfView commit details -
[Dy2Stat]Support convert sublayers in Sequential Container (#32978)
* Support convert sublayers in Sequential Container * remove paddle.jit.set_code_level
Configuration menu - View commit details
-
Copy full SHA for e409c7c - Browse repository at this point
Copy the full SHA e409c7cView commit details -
Configuration menu - View commit details
-
Copy full SHA for be8e94a - Browse repository at this point
Copy the full SHA be8e94aView commit details -
fix gather op and add logsumexp op on kunlun (#32931)
* fix gather op and add logsumexp op on kunlun * update xpu depence * update tests and fix elementwise_add
Configuration menu - View commit details
-
Copy full SHA for a96e8bc - Browse repository at this point
Copy the full SHA a96e8bcView commit details
Commits on May 21, 2021
-
[NPU] cast indices and label if their type is not consistent in accur…
…acy npu op (#33016) * cast indices and label if their type is not consistent * fix bug * add unittest
Configuration menu - View commit details
-
Copy full SHA for 70dc5f4 - Browse repository at this point
Copy the full SHA 70dc5f4View commit details -
update conda build script for cuda11 (#29594)
* update conda build script for cuda11 * update conda build script * modified wheel name * update conda_build * fix error * add cudnn8.1 for cuda11.2 * fix format error
Configuration menu - View commit details
-
Copy full SHA for 44668a7 - Browse repository at this point
Copy the full SHA 44668a7View commit details -
Configuration menu - View commit details
-
Copy full SHA for a85eddd - Browse repository at this point
Copy the full SHA a85edddView commit details -
fix model_benchmark ci (#33035)
* fix model_bhecnmark ci * fix model_bhecnmark ci
Configuration menu - View commit details
-
Copy full SHA for 0e5d832 - Browse repository at this point
Copy the full SHA 0e5d832View commit details -
optimize softmax with cross entropy hard label (#32290)
* optimize softmax with cross entropy hard label * label ignore_index cleaning
Configuration menu - View commit details
-
Copy full SHA for 7be6191 - Browse repository at this point
Copy the full SHA 7be6191View commit details -
Configuration menu - View commit details
-
Copy full SHA for 79ed717 - Browse repository at this point
Copy the full SHA 79ed717View commit details -
replace complex64/128 with complex template in cast Op (#33019)
* replace complex in set tensor from and to numpy * replace complex template in cast op
Configuration menu - View commit details
-
Copy full SHA for 79d918d - Browse repository at this point
Copy the full SHA 79d918dView commit details
Commits on May 22, 2021
-
Added oneDNN matmul grad BF16/FP32 kernel (#32968)
* added support for most matmul cases * added more functionality * full functionality of matmul op, fp32 only * added bf16 tests and functionality * added formatting * changes after review * minor change * added reviewers suggestions
Configuration menu - View commit details
-
Copy full SHA for e2a3a6f - Browse repository at this point
Copy the full SHA e2a3a6fView commit details -
Configuration menu - View commit details
-
Copy full SHA for a6dc68b - Browse repository at this point
Copy the full SHA a6dc68bView commit details
Commits on May 24, 2021
-
Configuration menu - View commit details
-
Copy full SHA for d6aea4a - Browse repository at this point
Copy the full SHA d6aea4aView commit details -
fix potential overflow problem & node add & node remove & node clear (#…
…33055) * graph engine demo * upload unsaved changes * fix dependency error * fix shard_num problem * py client * remove lock and graph-type * add load direct graph * add load direct graph * add load direct graph * batch random_sample * batch_sample_k * fix num_nodes size * batch brpc * batch brpc * add test * add test * add load_nodes; change add_node function * change sample return type to pair * resolve conflict * resolved conflict * resolved conflict * separate server and client * merge pair type * fix * resolved conflict * fixed segment fault; high-level VLOG for load edges and load nodes * random_sample return 0 * rm useless loop * test:load edge * fix ret -1 * test: rm sample * rm sample * random_sample return future * random_sample return int * test fake node * fixed here * memory leak * remove test code * fix return problem * add common_graph_table * random sample node &test & change data-structure from linkedList to vector * add common_graph_table * sample with srand * add node_types * optimize nodes sample * recover test * random sample * destruct weighted sampler * GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * pybind sample nodes api * pull nodes with step * fixed pull_graph_list bug; add test for pull_graph_list by step * add graph table;name * add graph table;name * add pybind * add pybind * add FeatureNode * add FeatureNode * add FeatureNode Serialize * add FeatureNode Serialize * get_feat_node * avoid local rpc * fix get_node_feat * fix get_node_feat * remove log * get_node_feat return py:bytes * merge develop with graph_engine * fix threadpool.h head * fix * fix typo * resolve conflict * fix conflict * recover lost content * fix pybind of FeatureNode * recover cmake * recover tools * resolve conflict * resolve linking problem * code style * change test_server port * fix code problems * remove shard_num config * remove redundent threads * optimize start server * remove logs * fix code problems by reviewers' suggestions * move graph files into a folder * code style change * remove graph operations from base table * optimize get_feat function of graph engine * fix long long count problem * remove redandunt graph files * remove unused shell * recover dropout_op_pass.h * fix potential stack overflow when request number is too large & node add & node clear & node remove Co-authored-by: Huang Zhengjie <270018958@qq.com> Co-authored-by: Weiyue Su <weiyue.su@gmail.com> Co-authored-by: suweiyue <suweiyue@baidu.com> Co-authored-by: luobin06 <luobin06@baidu.com> Co-authored-by: liweibin02 <liweibin02@baidu.com> Co-authored-by: tangwei12 <tangwei12@baidu.com>
Configuration menu - View commit details
-
Copy full SHA for 60ac160 - Browse repository at this point
Copy the full SHA 60ac160View commit details -
Configuration menu - View commit details
-
Copy full SHA for d0d5586 - Browse repository at this point
Copy the full SHA d0d5586View commit details -
[oneDNN] bump up oneDNN to 2.2.2 (#32685)
* - bump up oneDNN to 2.2.2 (should reduce perf drops of mobilenet) * - more recnet onednn 2.2.2 (some more bugfixes)
Configuration menu - View commit details
-
Copy full SHA for b8e4ec7 - Browse repository at this point
Copy the full SHA b8e4ec7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 99a11e3 - Browse repository at this point
Copy the full SHA 99a11e3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6ad5ece - Browse repository at this point
Copy the full SHA 6ad5eceView commit details -
[HybridParallel]Fix pipeline in dygraph (#33007)
* fix pipeline * fix mp pp dp * fix utest of hybrid parallel * add utest for tuple
Configuration menu - View commit details
-
Copy full SHA for 4920c47 - Browse repository at this point
Copy the full SHA 4920c47View commit details
Commits on May 25, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 88b43b5 - Browse repository at this point
Copy the full SHA 88b43b5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 86ea8dc - Browse repository at this point
Copy the full SHA 86ea8dcView commit details -
modify Ops to complex template (#33041)
* modify conj, real, imag OP to complex template * replace with complex template to dot Op * replace with complex template to Abs Op * add support for complex64 and complex128
Configuration menu - View commit details
-
Copy full SHA for 5fa44c3 - Browse repository at this point
Copy the full SHA 5fa44c3View commit details