-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
update #4
update #4
Commits on Apr 19, 2021
-
update
get_api_md5
, using the real api name as the map's key (#32224)* get_api_md5 should prefer use the real name rather than the alias names * case for ArgSpec style. update the unittests test=document_fix
Configuration menu - View commit details
-
Copy full SHA for 21dc044 - Browse repository at this point
Copy the full SHA 21dc044View commit details -
Configuration menu - View commit details
-
Copy full SHA for 76cb83e - Browse repository at this point
Copy the full SHA 76cb83eView commit details -
* fix sublayer error with include_sublayers=False * add ut * refactor include_sublayers related api * fix ut * fix ut of transformer * fix ut of transformer * remove useless code * change sublayer api * polish code * add test for include_self=True
Configuration menu - View commit details
-
Copy full SHA for 4d69eea - Browse repository at this point
Copy the full SHA 4d69eeaView commit details -
[Hybrid Parallel] Support dp & mp in dygraph (#32323)
* support dp & mp
Configuration menu - View commit details
-
Copy full SHA for ffd4086 - Browse repository at this point
Copy the full SHA ffd4086View commit details -
[NPU] cherry-pick gc/dataloader/save&load/optimization from ascendrc …
…to develop (#32294) * [NPU] support GarbageCollector for npu (#31874) * support GarbageCollector for npu * fix typo * fix gather_grad * disable NPUDefaultStreamGarbageCollector on NPU * [NPU] support npu for memcpy op (#31808) * support npu for memcpy op * add ut * fix ut * fix typo * 【NPU】fix bug of using temp vector (#31963) * fix bug when beta1_pow on cpu (#31995) * [NPU] support npu profiler (#31684) * support npu profiler * add python api * fix bugs * add wrapper for incomplete type * update profile proto * record npu wait * add xpu placeholder * fix adam (#32016) * [NPU] enable async copy and add wait before sync operation (#31956) * enable async copy and add wait before sync operation * remove unneccessary wait * add FillNpuTensorWithConstant * refine * fix fill_constant * make TensorFromVector/TensorToVector sync * [NPU] Support dataloader on npu place. (#31867) * [NPU] Wait on NPUPlace (#32086) * [NPU] fix cast op (#32121) * fix npu kernel of cast op to handle casting to same dtype * add comments * [NPU] support cann 20.3 (#32044) * fix compile problem on cann 20.3 * fix ut * fix test_mul * fix check_finite_and_scale * fix lookup_table_v2_grad * fix cmake * support print op * [NPU] Support npu save load (#31893) * support save load for NPU * add save load npu unittest * support np.array transform in NPU * fix errors * delete dygraph in unittest * add Wait * fix unittest * fix review comment * fix unittest problem * fix little problem * change aclrtSynchronizeDevice to aclrtSynchronizeStream for better performance (#32196) * change aclrtSynchronizeDevice to aclrtSynchronizeStream for better performace * refine code * fix NPUDeviceContext in all c++ unittest (#32198) * fix NPUDeviceContext in all c++ unittest * refine log Co-authored-by: pangyoki <pangyoki@126.com> * [NPU] Remove TensorFromVector and avoid sync copy in npu op kernel for better performance (#31994) * enable async copy and add wait before sync operation * remove unneccessary wait * add FillNpuTensorWithConstant * refine * fix fill_constant * change TensorFromVector to FillNpuTensorWithConstant * fix ignored api * delete extra unittest * fix little error * fix update_loss_scaling_op_npu and check_finite_and_unscale_op_npu * change TensorCopySync to TensorCopy * delete useless Wait and add StreamWait * fix npu_stream error * fix check_finite_and_unscale_op_npu TensorCopy * only save stream wait * fix NPUDeviceContext in all c++ unittest * delete wait Co-authored-by: zhiqiu <chenqiuliang@baidu.com> * delete useless unittest file (#32206) * Fix op test (#32231) * fix conditional block (#32243) * fix adam bug again (#32246) * fix compile * fix ut * fix ut Co-authored-by: liym27 <33742067+liym27@users.noreply.github.com> Co-authored-by: pangyoki <pangyoki@126.com>
Configuration menu - View commit details
-
Copy full SHA for cbe5c9f - Browse repository at this point
Copy the full SHA cbe5c9fView commit details -
add npu check nan and inf (#32340)
add npu check nan and inf (#32340)
Configuration menu - View commit details
-
Copy full SHA for 1e3a94b - Browse repository at this point
Copy the full SHA 1e3a94bView commit details
Commits on Apr 20, 2021
-
Configuration menu - View commit details
-
Copy full SHA for f0cc188 - Browse repository at this point
Copy the full SHA f0cc188View commit details -
Configuration menu - View commit details
-
Copy full SHA for 43926c8 - Browse repository at this point
Copy the full SHA 43926c8View commit details -
fix the bug that the error message is not displayed on mac ci (#32367)
* test for mac task,notest,test=mac_py3 * fix the bug that the error message is not displayed
Configuration menu - View commit details
-
Copy full SHA for 0dd28b8 - Browse repository at this point
Copy the full SHA 0dd28b8View commit details -
[heterps] optimize build task (#32358)
* build task cost * return pool
Configuration menu - View commit details
-
Copy full SHA for c09d645 - Browse repository at this point
Copy the full SHA c09d645View commit details -
Configuration menu - View commit details
-
Copy full SHA for f6f59e5 - Browse repository at this point
Copy the full SHA f6f59e5View commit details -
Configuration menu - View commit details
-
Copy full SHA for e0a52fd - Browse repository at this point
Copy the full SHA e0a52fdView commit details -
[Sharding]: update config DOC (#32299)
* sharding: update config DOC * update pipeline config * sharding update doc
Configuration menu - View commit details
-
Copy full SHA for e348901 - Browse repository at this point
Copy the full SHA e348901View commit details -
add paddle.nn.unfold #32297 (#32298)
* add paddle.nn.unfold * update Parameters of Unfold
Configuration menu - View commit details
-
Copy full SHA for 186682f - Browse repository at this point
Copy the full SHA 186682fView commit details -
[Optimize]SparseKV speedup and memory save (#32048)
Change-Id: Ie35a09772e46f7d90cb68ca82c1d18b9201d1abe * large scale kv store optimize Change-Id: I582cc661afdaa20749ec7493eae1b88c32b967f7 * replace std::unorded_map with roundrobin map Change-Id: I48ee0efef38853876c92d982cdfcac6603c52c88 * remove license * fix cpp lint Change-Id: Ia21fafa65adc09bb9094f7dbc987e31d5af2686e
Configuration menu - View commit details
-
Copy full SHA for 5e7e7c9 - Browse repository at this point
Copy the full SHA 5e7e7c9View commit details
Commits on Apr 21, 2021
-
remove fluid for auto_checkpoint. (#32157)
* remove fluid for auto_checkpoint. * fix bug.
Configuration menu - View commit details
-
Copy full SHA for 1593ee2 - Browse repository at this point
Copy the full SHA 1593ee2View commit details -
Configuration menu - View commit details
-
Copy full SHA for ead8342 - Browse repository at this point
Copy the full SHA ead8342View commit details -
add retry on gcda_clean.py (#32318)
* add retry on gcda_clean.py * add exit code for paddle_coverage.sh * fix format error * fix format error
Configuration menu - View commit details
-
Copy full SHA for 229f930 - Browse repository at this point
Copy the full SHA 229f930View commit details -
Configuration menu - View commit details
-
Copy full SHA for a2cbbe8 - Browse repository at this point
Copy the full SHA a2cbbe8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4898c38 - Browse repository at this point
Copy the full SHA 4898c38View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d19f8d - Browse repository at this point
Copy the full SHA 5d19f8dView commit details -
flush denormal in the tracer op, test=develop (#32350)
* flush denormal in the tracer op, test=develop * add cmake dependencies, test=develop * add a macro, test=develop * fix the windows case, test=develop
Configuration menu - View commit details
-
Copy full SHA for 9ff8556 - Browse repository at this point
Copy the full SHA 9ff8556View commit details -
[Kunlun]add collective ops for multi XPU cards training and add Kunlu…
…n multi XPU cards CI (#32302)
Configuration menu - View commit details
-
Copy full SHA for 2194ad1 - Browse repository at this point
Copy the full SHA 2194ad1View commit details -
remove thrust include files (#32395)
* remove thrust includes, test=develop * fix compilation error, test=develop * fix compilation of truncated_gaussian_random_op, test=develop
Configuration menu - View commit details
-
Copy full SHA for ab6f874 - Browse repository at this point
Copy the full SHA ab6f874View commit details -
[NPU] register npu finalize on exit (#32390)
* [NPU] register finalize on exit * fix
Configuration menu - View commit details
-
Copy full SHA for 8e4c193 - Browse repository at this point
Copy the full SHA 8e4c193View commit details -
optimize get-feat function of graph engine (#32261)
* graph engine demo * upload unsaved changes * fix dependency error * fix shard_num problem * py client * remove lock and graph-type * add load direct graph * add load direct graph * add load direct graph * batch random_sample * batch_sample_k * fix num_nodes size * batch brpc * batch brpc * add test * add test * add load_nodes; change add_node function * change sample return type to pair * resolve conflict * resolved conflict * resolved conflict * separate server and client * merge pair type * fix * resolved conflict * fixed segment fault; high-level VLOG for load edges and load nodes * random_sample return 0 * rm useless loop * test:load edge * fix ret -1 * test: rm sample * rm sample * random_sample return future * random_sample return int * test fake node * fixed here * memory leak * remove test code * fix return problem * add common_graph_table * random sample node &test & change data-structure from linkedList to vector * add common_graph_table * sample with srand * add node_types * optimize nodes sample * recover test * random sample * destruct weighted sampler * GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * pybind sample nodes api * pull nodes with step * fixed pull_graph_list bug; add test for pull_graph_list by step * add graph table;name * add graph table;name * add pybind * add pybind * add FeatureNode * add FeatureNode * add FeatureNode Serialize * add FeatureNode Serialize * get_feat_node * avoid local rpc * fix get_node_feat * fix get_node_feat * remove log * get_node_feat return py:bytes * merge develop with graph_engine * fix threadpool.h head * fix * fix typo * resolve conflict * fix conflict * recover lost content * fix pybind of FeatureNode * recover cmake * recover tools * resolve conflict * resolve linking problem * code style * change test_server port * fix code problems * remove shard_num config * remove redundent threads * optimize start server * remove logs * fix code problems by reviewers' suggestions * move graph files into a folder * code style change * remove graph operations from base table * optimize get_feat function of graph engine Co-authored-by: Huang Zhengjie <270018958@qq.com> Co-authored-by: Weiyue Su <weiyue.su@gmail.com> Co-authored-by: suweiyue <suweiyue@baidu.com> Co-authored-by: luobin06 <luobin06@baidu.com> Co-authored-by: liweibin02 <liweibin02@baidu.com> Co-authored-by: tangwei12 <tangwei12@baidu.com>
Configuration menu - View commit details
-
Copy full SHA for 2b68d20 - Browse repository at this point
Copy the full SHA 2b68d20View commit details -
Configuration menu - View commit details
-
Copy full SHA for 37bb334 - Browse repository at this point
Copy the full SHA 37bb334View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3da2c7f - Browse repository at this point
Copy the full SHA 3da2c7fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 661a1f6 - Browse repository at this point
Copy the full SHA 661a1f6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7bae5e9 - Browse repository at this point
Copy the full SHA 7bae5e9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4be3b05 - Browse repository at this point
Copy the full SHA 4be3b05View commit details -
Configuration menu - View commit details
-
Copy full SHA for bc90916 - Browse repository at this point
Copy the full SHA bc90916View commit details -
【NPU】Merge NPU ccl code (#32381)
* add allreduce and broadcast without test (#31024) add allreduce and broadcast without test * Refactor HCCLCommContext to be compatible with Paddle (#31359) Refactor HCCLCommContext to be compatible with Paddle (#31359) * [NPU] add npu kernel for communication op (#31437) * add allreduce and broadcast without test * add c_broadcast_test case * build c_comm_init and c_create_group operators * make the whole thing compile * add broadcast and init op test case but run failed * make unit test compile * fix broadcast test bug and change into hcom for ccl * change c_comm_init and c_create_group ops accordingly * make tests compile * transfer code to 27 * compiled successfully in 28, but run failed * test broadcast in 28, but failed * make hcom primitives work * change hccl data type for base.h * fix broadcast bug * make attributes work * fix group name bug * add allreduce but test failed * allreduce bug for qiuliang * allreduce finished * add allgather and reducescatter * merge all op code * add allgather test * finish run all ccl op test exclude send/recv * all all op and test exclude send/recv * send_v2_npu.cc recv_v2_npiu.cc compiled * fix ccl core dump bug and test allgather, reducescatter, broadcast op * fix allreduce bug just for test * hcom send&recv test pass, without hcom_destroy * for qiuliang test * Ascend Send&Recv Test Pass * all op (ex send/recv) ok * fix bug * merge all ccl op * style merge to PaddlePaddle * merge style * new merge style * merge style 2 * insert an empty at the end * disable ctest for hcom to pass ci Co-authored-by: void-main <voidmain1313113@gmail.com> Co-authored-by: f2hkop <f2huestc@outlook.com> * Add auto-increasing tag id for Hcom OPs (#31702) * add c_reduce_sum op (#31793) add c_reduce_sum op * update Ascendrc hccl to 20.3 (#32126) update Ascendrc hccl to 20.3 (#32126) * fix merge code * change cmake.txt1 * [NPU] Support npu kernel for c sync stream op (#31386) * sync stream npu op * add with_ascend_acl * update c++ unittest * compile all failed * try to pre commit * after pre commit * merge&compile&test hccl successfully! * fix code style * fix code style * fix bugs about hccl * fix some bugs * fix code style * fix style * fix style * fix * fixed * merge develop Co-authored-by: lw921014 <liuwei921014@yeah.net> Co-authored-by: Void Main <voidmain1313113@gmail.com> Co-authored-by: f2hkop <f2huestc@outlook.com> Co-authored-by: xiayanming <41795079@qq.com>
Configuration menu - View commit details
-
Copy full SHA for c315852 - Browse repository at this point
Copy the full SHA c315852View commit details -
[HotFix] Add support for optimizer with varbase input (#32362)
* add support for optimizer with varbase input * refine cond * fix failed unittest * add test for coverage
Configuration menu - View commit details
-
Copy full SHA for b47dd15 - Browse repository at this point
Copy the full SHA b47dd15View commit details -
Configuration menu - View commit details
-
Copy full SHA for bf0ec9b - Browse repository at this point
Copy the full SHA bf0ec9bView commit details
Commits on Apr 22, 2021
-
Configuration menu - View commit details
-
Copy full SHA for e58c705 - Browse repository at this point
Copy the full SHA e58c705View commit details -
support save/load binary format tensor. (#32211)
* support save/load binary format tensor * Fix error when create cudaplace * Fix error when create cudaplace * Fix error when create cudaplace * get devive context from pool. * move define of 'SerializeToStream' and 'DeserializeFromStream' to 'lod_tensor.cc' and 'selected_rows.cc'. * improve coverage. * improve coverage. * polish API * deal with conflict * disable save/load large file in unnittest * split unnittest.
Configuration menu - View commit details
-
Copy full SHA for f4d9adc - Browse repository at this point
Copy the full SHA f4d9adcView commit details -
* graph engine demo * upload unsaved changes * fix dependency error * fix shard_num problem * py client * remove lock and graph-type * add load direct graph * add load direct graph * add load direct graph * batch random_sample * batch_sample_k * fix num_nodes size * batch brpc * batch brpc * add test * add test * add load_nodes; change add_node function * change sample return type to pair * resolve conflict * resolved conflict * resolved conflict * separate server and client * merge pair type * fix * resolved conflict * fixed segment fault; high-level VLOG for load edges and load nodes * random_sample return 0 * rm useless loop * test:load edge * fix ret -1 * test: rm sample * rm sample * random_sample return future * random_sample return int * test fake node * fixed here * memory leak * remove test code * fix return problem * add common_graph_table * random sample node &test & change data-structure from linkedList to vector * add common_graph_table * sample with srand * add node_types * optimize nodes sample * recover test * random sample * destruct weighted sampler * GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * pybind sample nodes api * pull nodes with step * fixed pull_graph_list bug; add test for pull_graph_list by step * add graph table;name * add graph table;name * add pybind * add pybind * add FeatureNode * add FeatureNode * add FeatureNode Serialize * add FeatureNode Serialize * get_feat_node * avoid local rpc * fix get_node_feat * fix get_node_feat * remove log * get_node_feat return py:bytes * merge develop with graph_engine * fix threadpool.h head * fix * fix typo * resolve conflict * fix conflict * recover lost content * fix pybind of FeatureNode * recover cmake * recover tools * resolve conflict * resolve linking problem * code style * change test_server port * fix code problems * remove shard_num config * remove redundent threads * optimize start server * remove logs * fix code problems by reviewers' suggestions * move graph files into a folder * code style change * remove graph operations from base table * optimize get_feat function of graph engine * fix long long count problem Co-authored-by: Huang Zhengjie <270018958@qq.com> Co-authored-by: Weiyue Su <weiyue.su@gmail.com> Co-authored-by: suweiyue <suweiyue@baidu.com> Co-authored-by: luobin06 <luobin06@baidu.com> Co-authored-by: liweibin02 <liweibin02@baidu.com> Co-authored-by: tangwei12 <tangwei12@baidu.com>
Configuration menu - View commit details
-
Copy full SHA for 73d0b0e - Browse repository at this point
Copy the full SHA 73d0b0eView commit details -
Configuration menu - View commit details
-
Copy full SHA for e727820 - Browse repository at this point
Copy the full SHA e727820View commit details -
Configuration menu - View commit details
-
Copy full SHA for b2ee838 - Browse repository at this point
Copy the full SHA b2ee838View commit details -
[HybridParallel] Add ClipGradByGlobalNorm & check_finite_and_unscale …
…in Dygraph (#32354) * add clip/check * add amp & clip grad in dygraph * add logging
Configuration menu - View commit details
-
Copy full SHA for 7ea999f - Browse repository at this point
Copy the full SHA 7ea999fView commit details -
Configuration menu - View commit details
-
Copy full SHA for bec4b16 - Browse repository at this point
Copy the full SHA bec4b16View commit details -
modify conv2d_transpose docs (#32410)
* modify conv2d_transpose docs
Configuration menu - View commit details
-
Copy full SHA for 1064f2b - Browse repository at this point
Copy the full SHA 1064f2bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 890d6bc - Browse repository at this point
Copy the full SHA 890d6bcView commit details -
import sequence_* API to new namespace (#32089)
* import sequence_* API to new namespace * fix typos, remove alias marking * update sample code * fix sample code * fix docstring for sequence_mask
Feiyu Chan authoredApr 22, 2021 Configuration menu - View commit details
-
Copy full SHA for f12c943 - Browse repository at this point
Copy the full SHA f12c943View commit details -
Configuration menu - View commit details
-
Copy full SHA for d03b0b1 - Browse repository at this point
Copy the full SHA d03b0b1View commit details -
Configuration menu - View commit details
-
Copy full SHA for c481570 - Browse repository at this point
Copy the full SHA c481570View commit details -
Configuration menu - View commit details
-
Copy full SHA for a1a527f - Browse repository at this point
Copy the full SHA a1a527fView commit details -
support int32 and int64 kernel for clip operator (#32373)
support int32 and int64 kernel for clip operator
Configuration menu - View commit details
-
Copy full SHA for c332828 - Browse repository at this point
Copy the full SHA c332828View commit details -
Configuration menu - View commit details
-
Copy full SHA for f8ca5a9 - Browse repository at this point
Copy the full SHA f8ca5a9View commit details
Commits on Apr 23, 2021
-
Fix seven error message (#32397)
* fix two error message * fix two error message * fix error * fix error * fix error * fix error * fix some error message * fix some error * fix error * fix some error * fix some error * fix some error * fix one error * fix some error * fix seven error message * fix error * fix error * fix error * fix error
Configuration menu - View commit details
-
Copy full SHA for 203ac4f - Browse repository at this point
Copy the full SHA 203ac4fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 49773f3 - Browse repository at this point
Copy the full SHA 49773f3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7879477 - Browse repository at this point
Copy the full SHA 7879477View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1dc8393 - Browse repository at this point
Copy the full SHA 1dc8393View commit details -
Configuration menu - View commit details
-
Copy full SHA for 51bcd97 - Browse repository at this point
Copy the full SHA 51bcd97View commit details -
Configuration menu - View commit details
-
Copy full SHA for b6f8ccd - Browse repository at this point
Copy the full SHA b6f8ccdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2b108a0 - Browse repository at this point
Copy the full SHA 2b108a0View commit details -
solve hccl communicate conflict (#32447)
solve hccl communicate conflict (#32447)
Configuration menu - View commit details
-
Copy full SHA for 0e74eea - Browse repository at this point
Copy the full SHA 0e74eeaView commit details -
fix Windows CI MP compile and environment install script and openblas…
… CI (#32378) * fix Windows CI MP compile and environment install script * clear Windows CI environment * clear Windows CI environment * clear Windows CI environment
Configuration menu - View commit details
-
Copy full SHA for 7a681f0 - Browse repository at this point
Copy the full SHA 7a681f0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1b83de2 - Browse repository at this point
Copy the full SHA 1b83de2View commit details -
move semantic checks to op_teller (#32279)
* move semantic checks to op_teller * more ops * more ops * revert block related change * part1 * revert activation * remove if * remove const_cast * reslove conflict * remove const_cast * delete useless var * replace vlog(1) with vlog(3), replace assert with PADDLE_ENFORCE * down to 19 files
Configuration menu - View commit details
-
Copy full SHA for 7c38114 - Browse repository at this point
Copy the full SHA 7c38114View commit details -
Configuration menu - View commit details
-
Copy full SHA for a01b510 - Browse repository at this point
Copy the full SHA a01b510View commit details -
[NPU] refactor check_finite_and_scale npu kernel (#32407)
* refactor_check_finite_and_scale_npu_kernel * fix compile * add alloc_float_status op * add alloc_float_status op * add FloatStatus for check_finite_and_unscale * refine code * remove unneccessary logic * refine for fleet
Configuration menu - View commit details
-
Copy full SHA for 39a59dc - Browse repository at this point
Copy the full SHA 39a59dcView commit details -
Polish ParallelExectuor constructor into small functions (#32191)
* Refine Constructor logic of ParallelExecutor * refine function name * refine code comment
Configuration menu - View commit details
-
Copy full SHA for faa8c70 - Browse repository at this point
Copy the full SHA faa8c70View commit details -
Ut test conv3d op timeout (#32216)
* remove ut from parallel_ut_rule caused by timeout * remove timeout ut from parallel_ut_rule file * move convert_model2dot_ernie to TWO_PARALLEL_JOB list
Configuration menu - View commit details
-
Copy full SHA for de94743 - Browse repository at this point
Copy the full SHA de94743View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8fa8a37 - Browse repository at this point
Copy the full SHA 8fa8a37View commit details -
[CustomOp] Remove useless extension headers for old custom op (#32463)
* remove useless ext headers * fix boost header compile failed
Configuration menu - View commit details
-
Copy full SHA for 7d4998a - Browse repository at this point
Copy the full SHA 7d4998aView commit details
Commits on Apr 24, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 8beb170 - Browse repository at this point
Copy the full SHA 8beb170View commit details -
Fix test_yolov3 Random Failure (#32496)
Reduce max iter size to fix windows openblas test_yolov3 random failure. Decrease batch size to fix pe related unittest random failure.
Configuration menu - View commit details
-
Copy full SHA for 9bf9092 - Browse repository at this point
Copy the full SHA 9bf9092View commit details -
Configuration menu - View commit details
-
Copy full SHA for 18d3e2c - Browse repository at this point
Copy the full SHA 18d3e2cView commit details -
Configuration menu - View commit details
-
Copy full SHA for f8caa58 - Browse repository at this point
Copy the full SHA f8caa58View commit details -
print the real name for Functions instead of the ArgSpec (#32379)
* print the real name for Functions instead of the ArgSpec class function method * some API's name is not __module__ + __name__ so, we disard them temporarily. * update the logging format for console * ommit the top level of paddle package. * these APIs have been removed. test=document_fix * Another Error occerd * print_signatures.py 's stdout is redirect to spec file, so should not print any other info. so sad. * print the error msg to stderr * disable the __init__ magic method * update unittest for sampcd_processor.py update unittest for sampcd_processor.py * PR-CI-APPROVAL 's python interpreter name is not 'python3'. it's a python3.9; it does not have paddle installed yet. 此句在CI流水线竟然不可以用。报python3找不到 此句在CI流水线竟然不可以用。因为环境没有安装paddle * testing only extract api from __all__ paddle module(the top module) does not have __add__ test=document_fix * should import paddle here * update the mechanism of extractiong and executing for the sample-codes test. 更新抽取代码和执行代码的逻辑 优化输出打印 * good code style
Configuration menu - View commit details
-
Copy full SHA for ef8671e - Browse repository at this point
Copy the full SHA ef8671eView commit details
Commits on Apr 25, 2021
-
* Add dlnne engine runtime * Fix log * Remove <const_cast> and remove unrelated modify with dlnne, +clang-format * Fix CMakeList format error * Add copyright message * Fix dlnne CMakeList.txt * Add some paddlepaddle_pass to support more networks * Fix some format bug
Configuration menu - View commit details
-
Copy full SHA for feb2e47 - Browse repository at this point
Copy the full SHA feb2e47View commit details -
Configuration menu - View commit details
-
Copy full SHA for 83580ee - Browse repository at this point
Copy the full SHA 83580eeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 136ef09 - Browse repository at this point
Copy the full SHA 136ef09View commit details -
[NPU] refine lookup_table_v2_grad npu_kernel (#32497)
* use ZerosLike instead of NPUMemsetAsync * fix compile
Configuration menu - View commit details
-
Copy full SHA for fb7590d - Browse repository at this point
Copy the full SHA fb7590dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4db2cc9 - Browse repository at this point
Copy the full SHA 4db2cc9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3b61d06 - Browse repository at this point
Copy the full SHA 3b61d06View commit details -
let paddle.utils.install_check support CPU package with GPU device (#…
…32428) * let paddle.utils.install_check support CPU package with GPU device * use use_cuda in dygraph checking * add unittest for install_check
Configuration menu - View commit details
-
Copy full SHA for 06276f4 - Browse repository at this point
Copy the full SHA 06276f4View commit details -
* fix tc trt shape * fix fc dynamic shape * add fc shape assert * update
Configuration menu - View commit details
-
Copy full SHA for f272e59 - Browse repository at this point
Copy the full SHA f272e59View commit details -
[BUG FIX] when x.dim < y.dim, the result of compare_op is inverse (#3…
…2470) * fix bug: when x.dim < y.dim, the result of compare_op is inverse to expected result * support the cuda for fix the compare broadcast bug
Configuration menu - View commit details
-
Copy full SHA for 78eff52 - Browse repository at this point
Copy the full SHA 78eff52View commit details -
Configuration menu - View commit details
-
Copy full SHA for 976fe6f - Browse repository at this point
Copy the full SHA 976fe6fView commit details -
[HybridParallel] Add pipeline layer in dygraph (#32449)
* add pipeline layer
Configuration menu - View commit details
-
Copy full SHA for 7ef1de6 - Browse repository at this point
Copy the full SHA 7ef1de6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2f351ed - Browse repository at this point
Copy the full SHA 2f351edView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3b4dcad - Browse repository at this point
Copy the full SHA 3b4dcadView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7a4cbb3 - Browse repository at this point
Copy the full SHA 7a4cbb3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 486946a - Browse repository at this point
Copy the full SHA 486946aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 92dc9b2 - Browse repository at this point
Copy the full SHA 92dc9b2View commit details -
Add hub Module for easy to use pre-trained models. (#31873)
* add Hub Module for easy to use pre-trained models. * support list, load, help fucntions. * support load models by github, gitee, local Co-authored-by: LielinJiang <jianglielin@baidu.com>
Configuration menu - View commit details
-
Copy full SHA for 4e460d7 - Browse repository at this point
Copy the full SHA 4e460d7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 74824fd - Browse repository at this point
Copy the full SHA 74824fdView commit details -
paddle.save/load support nested structure and layer (#32446)
* support save/load binary format tensor * Fix error when create cudaplace * Fix error when create cudaplace * Fix error when create cudaplace * get devive context from pool. * move define of 'SerializeToStream' and 'DeserializeFromStream' to 'lod_tensor.cc' and 'selected_rows.cc'. * support complex object * improve coverage. * improve coverage * improve coverage. * fix a bug. * polish API * save/load program * paddle.save/load: layer * deal with conflict * if PY2, block test_paddle_save_load.TestSaveLoadLayer * polish code. * polish code * edit unnittest * The condition for object to be identified as state_dict becomes strict * use 'core._cuda_synchronize'
Configuration menu - View commit details
-
Copy full SHA for 727b28d - Browse repository at this point
Copy the full SHA 727b28dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1896c77 - Browse repository at this point
Copy the full SHA 1896c77View commit details -
Configuration menu - View commit details
-
Copy full SHA for 541d702 - Browse repository at this point
Copy the full SHA 541d702View commit details -
[Paddle-TRT] Add trt runtime version check (#32443)
* add trt runtime version check * use different wrap, and change to major version check
Configuration menu - View commit details
-
Copy full SHA for b055676 - Browse repository at this point
Copy the full SHA b055676View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5943ff7 - Browse repository at this point
Copy the full SHA 5943ff7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 25e723e - Browse repository at this point
Copy the full SHA 25e723eView commit details -
Configuration menu - View commit details
-
Copy full SHA for aceec7f - Browse repository at this point
Copy the full SHA aceec7fView commit details -
support python39 and delete python35 in Dockerfile (#32385)
* support python39 and delete python35 * support python39 in Dockerfile.centos * fix ubuntu18 bug * update Dockerfile.ubuntu setuptools * fix centos py39 errors * fix centos py39 error2
Configuration menu - View commit details
-
Copy full SHA for 78fc74b - Browse repository at this point
Copy the full SHA 78fc74bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 29e081b - Browse repository at this point
Copy the full SHA 29e081bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5468de0 - Browse repository at this point
Copy the full SHA 5468de0View commit details -
Cleanup the unused codes for samplecode testing (#32525)
* update testcases * remove unused codes * update the docstring for samcd_processor * no need to import the six module * 我也不知道为何有一个前导空格,但现在有单元测试,取消这个空格没啥问题 * add unittests for print_signatures; add the first case for 'required' mechanism when executing sample code testing * there is no paddle installed in PR-CI-APPROVAL test=document_fix
Configuration menu - View commit details
-
Copy full SHA for 2328921 - Browse repository at this point
Copy the full SHA 2328921View commit details -
[Paddle-TRT] Fix AI-Rank BERT emb_eltwise_layernorm input order (#32482)
* fix airank bert emb order * move input num check to converter * add input num check * add unused var check white list
Configuration menu - View commit details
-
Copy full SHA for fba46ea - Browse repository at this point
Copy the full SHA fba46eaView commit details -
Make range API set its out shape when possible (#32472)
`range` API set its output shape in dygraph but not in static graph, which can cause Dy2stat error. This PR set the shape of `range` API when possible.
Configuration menu - View commit details
-
Copy full SHA for f16981b - Browse repository at this point
Copy the full SHA f16981bView commit details -
* Dygraph reocmpute * unitest for Dygraph reocmpute * dy recompute remove unitest for win and mac
Configuration menu - View commit details
-
Copy full SHA for 583ebab - Browse repository at this point
Copy the full SHA 583ebabView commit details -
add pipeline for dynamic graph (#32511)
* add pp dygraph, test=develop
lilong12 authoredApr 25, 2021 Configuration menu - View commit details
-
Copy full SHA for 561dc71 - Browse repository at this point
Copy the full SHA 561dc71View commit details
Commits on Apr 26, 2021
-
[DOC] Clarify the difference of paddle.norm and np.linalg.norm (#32530)
* [DOC] Clarify the difference between paddle.norm and np.linalg.norm
Configuration menu - View commit details
-
Copy full SHA for 33ca455 - Browse repository at this point
Copy the full SHA 33ca455View commit details -
Configuration menu - View commit details
-
Copy full SHA for d0751d0 - Browse repository at this point
Copy the full SHA d0751d0View commit details -
[Dy2stat] Support paddle.to_tensor with int, float, bool. (#32420)
paddle.to_tensor will be translated to paddle.assign in Dy2stat, however paddle.assign doesn't support int, float, bool. This PR added the supports.
Configuration menu - View commit details
-
Copy full SHA for 1b9a3bf - Browse repository at this point
Copy the full SHA 1b9a3bfView commit details -
add norm_by_times param to ctc_loss (#32490)
* add norm_by_times param to ctc_loss * fix doc,test=develop
Configuration menu - View commit details
-
Copy full SHA for 6c03ea5 - Browse repository at this point
Copy the full SHA 6c03ea5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 756f463 - Browse repository at this point
Copy the full SHA 756f463View commit details -
[AMP] Autocast to fp32 for op has no fp16 kernel (#32543)
* skip op has no fp16 kernel * add ut
Configuration menu - View commit details
-
Copy full SHA for d2b31a1 - Browse repository at this point
Copy the full SHA d2b31a1View commit details -
fix acc typo and shape error, and remove 'users' subjects in amp doc,…
… test=document_fix (#32476)
Configuration menu - View commit details
-
Copy full SHA for ab3d2bf - Browse repository at this point
Copy the full SHA ab3d2bfView commit details -
optimize slice op and slice grad op (#32266)
* optimize slice op and slice grad op, test=develop * optimize variable name and annotation information, test=develop
Configuration menu - View commit details
-
Copy full SHA for 5161f71 - Browse repository at this point
Copy the full SHA 5161f71View commit details -
Configuration menu - View commit details
-
Copy full SHA for fd85a4a - Browse repository at this point
Copy the full SHA fd85a4aView commit details -
support backward return None, when corresponding input tensor without…
… gradient (#32494) * support backward return None. * edit unittest. * edit code according to CI * Improve error information
Configuration menu - View commit details
-
Copy full SHA for 8e66046 - Browse repository at this point
Copy the full SHA 8e66046View commit details -
Configuration menu - View commit details
-
Copy full SHA for 40e51b2 - Browse repository at this point
Copy the full SHA 40e51b2View commit details -
[HybridParallel]Fix model parallel bug by using C++ op (#32536)
* fix model parallel * rm parallel_help.py * add embedding
Configuration menu - View commit details
-
Copy full SHA for ea465fa - Browse repository at this point
Copy the full SHA ea465faView commit details -
change prepend_op to append_op in initializer (#32177)
* change prepend to append * fix ut * add testcase * fix ut * fix test_custom_relu_model
Configuration menu - View commit details
-
Copy full SHA for 8fec3c6 - Browse repository at this point
Copy the full SHA 8fec3c6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 41bfec8 - Browse repository at this point
Copy the full SHA 41bfec8View commit details -
Unset ReserveSpace of batch_norm for inference program. (#32493)
* Unset ReserveSpace for inference program. * Support training from an inference program.
Configuration menu - View commit details
-
Copy full SHA for 202b0ea - Browse repository at this point
Copy the full SHA 202b0eaView commit details -
fix dataloader exit error (#32550)
* fix dataloader exit error if user exit program when dataloader is still iterating. test=develop
Configuration menu - View commit details
-
Copy full SHA for eae3405 - Browse repository at this point
Copy the full SHA eae3405View commit details -
Modified the return value of tensor.grad from numpy to tensor. (#32142)
* Modified the return value of tensor.grad from numpy as tensor. * Modify unittests. * fixed bugs. * Add warning info for x.grad * fixed unittests which used x.grad * fixed bug.
Configuration menu - View commit details
-
Copy full SHA for c40c16a - Browse repository at this point
Copy the full SHA c40c16aView commit details -
[2.1 API] Modified params of some APIs to support tuple and list. (#3…
…2528) * Modified params of some APIs to support tuple and list. * fixed bug.
Configuration menu - View commit details
-
Copy full SHA for 400c3aa - Browse repository at this point
Copy the full SHA 400c3aaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 78908b4 - Browse repository at this point
Copy the full SHA 78908b4View commit details -
Make assign Doc Same for creation.py and layers/tensor.py, test=docum…
Configuration menu - View commit details
-
Copy full SHA for 7f162b5 - Browse repository at this point
Copy the full SHA 7f162b5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 913317f - Browse repository at this point
Copy the full SHA 913317fView commit details -
[PsCore] optimize performance of large kv (#32535)
* optimize pull sparse * optimize pull sparse * change macro * format
Configuration menu - View commit details
-
Copy full SHA for 4b7242b - Browse repository at this point
Copy the full SHA 4b7242bView commit details -
Optimize where_index_op(prefix sum) (#30601)
* new optimize for where_index_op with prefix sum version. * write a scan prefix sum kernel with stream for where index op. * optimize where_index by using cub::DeviceScan::InclusiveSum instead of imperfect self-kernel. * remove CheckTrue struct and rename stide_array for readable. * optimize variable name for readable. * optimize function name and annotation.
Configuration menu - View commit details
-
Copy full SHA for 6ec4e64 - Browse repository at this point
Copy the full SHA 6ec4e64View commit details -
Fix OPENBLAS ci and fix windows CPU CI to parallel compile (#32548)
* clear CUDA compile environment on windows * fix Windows CI * fix Windows CI * fix Windows CI
Configuration menu - View commit details
-
Copy full SHA for 1ec9525 - Browse repository at this point
Copy the full SHA 1ec9525View commit details -
Configuration menu - View commit details
-
Copy full SHA for fcd18ef - Browse repository at this point
Copy the full SHA fcd18efView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4ba49af - Browse repository at this point
Copy the full SHA 4ba49afView commit details -
Configuration menu - View commit details
-
Copy full SHA for a7be32c - Browse repository at this point
Copy the full SHA a7be32cView commit details -
Configuration menu - View commit details
-
Copy full SHA for c47bafc - Browse repository at this point
Copy the full SHA c47bafcView commit details
Commits on Apr 27, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 0bc97e9 - Browse repository at this point
Copy the full SHA 0bc97e9View commit details -
Configuration menu - View commit details
-
Copy full SHA for f1bc322 - Browse repository at this point
Copy the full SHA f1bc322View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9930a58 - Browse repository at this point
Copy the full SHA 9930a58View commit details -
[HybridParallel] Fix amp bug in ModelParallel (#32579)
* fix amp bug * fix name of wordsize
Configuration menu - View commit details
-
Copy full SHA for c1db7e3 - Browse repository at this point
Copy the full SHA c1db7e3View commit details -
Check for cuda errors immediately after kernel launch (#32557)
Co-authored-by: Yang Zhang <yangzhang@live.com>
Configuration menu - View commit details
-
Copy full SHA for 19eefef - Browse repository at this point
Copy the full SHA 19eefefView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6579432 - Browse repository at this point
Copy the full SHA 6579432View commit details -
Configuration menu - View commit details
-
Copy full SHA for 809ac03 - Browse repository at this point
Copy the full SHA 809ac03View commit details -
Configuration menu - View commit details
-
Copy full SHA for 85e697d - Browse repository at this point
Copy the full SHA 85e697dView commit details -
Support list and tuple for args. (#32344)
* Support list and tuple for parameters of layer_norm, multiprocess_reader, DatasetFolder and ImageFolder. * add unittest for layer_norm. * add require gpu for example.
Configuration menu - View commit details
-
Copy full SHA for a08a118 - Browse repository at this point
Copy the full SHA a08a118View commit details -
str in python2 is different to python3's, it make mistakes for some a…
…pi's docstring (#32588) * UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 1788: ordinal not in range(128) test=document_fix str(doc) in python2 test=document_fix * update md5 function in count_api_without_core_ops.py str in py2 is different. test=document_fix
Configuration menu - View commit details
-
Copy full SHA for 97794ec - Browse repository at this point
Copy the full SHA 97794ecView commit details -
fix cross_entropy calculation error (#32545)
* fix cross_entropy calculation error * add unittest and fix static
Configuration menu - View commit details
-
Copy full SHA for 23d3e36 - Browse repository at this point
Copy the full SHA 23d3e36View commit details -
[Docs] Modified the docs of some api for supporting list/tuple args. (#…
…32360) * fixed docs. * Fixed docs. test=document_fix code bak. fixed docs. test=document_fix * Revert to previous version of python/paddle/fluid/backward.py * fixed bugs. * test=document_fix. Fixed examples.
Configuration menu - View commit details
-
Copy full SHA for 1515892 - Browse repository at this point
Copy the full SHA 1515892View commit details -
'jit.save/load' support save/load function without parameters. (#32430)
* jit.save/load support function. * delete unnittest test_jit_load_model_incomplete. * edit code according to CI * Modify the documentation. * add note to doc.
Configuration menu - View commit details
-
Copy full SHA for 0372f1d - Browse repository at this point
Copy the full SHA 0372f1dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6f6e159 - Browse repository at this point
Copy the full SHA 6f6e159View commit details -
Configuration menu - View commit details
-
Copy full SHA for eca8dcc - Browse repository at this point
Copy the full SHA eca8dccView commit details -
[OPs] Bug fix, fix the segment mean for illegal syncthreads usage. (#…
…32596) * [OPs] Bug fix, fix the segment mean for illegal syncthreads usage.
Configuration menu - View commit details
-
Copy full SHA for 1afe1ac - Browse repository at this point
Copy the full SHA 1afe1acView commit details -
Configuration menu - View commit details
-
Copy full SHA for f285f4c - Browse repository at this point
Copy the full SHA f285f4cView commit details -
clear 'BasicEngine' when an exception occurs in the backward. (#32546)
* clear 'BasicEngine' when an exception occurs in the backward. * deal with conflict. * deal with conflict.
Configuration menu - View commit details
-
Copy full SHA for 797b2df - Browse repository at this point
Copy the full SHA 797b2dfView commit details -
edit paddle.save/load API (#32532)
* edit paddle.save/load API * Update io.py edit doc * delete cpython-37.pyc * Update io.py edit doc * Update io.py recommit * Update io.py recommit * Update io.py recommit * Update io.py recommit
Configuration menu - View commit details
-
Copy full SHA for 79f7ba6 - Browse repository at this point
Copy the full SHA 79f7ba6View commit details -
update 2.0 public api in paddle.init (#32034)
Co-authored-by: XiaoguangHu <46782768+XiaoguangHu01@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 125e481 - Browse repository at this point
Copy the full SHA 125e481View commit details -
update 2.0 public api in nn (#31912)
* update 2.0 public api in nn * replace Chinese character cause error in ci;synchronization with pr:#32588 to avoid 'ascii' codec in python2 * numbers used in paddle.nn.functional.norm but not imported
Configuration menu - View commit details
-
Copy full SHA for 3b81f2b - Browse repository at this point
Copy the full SHA 3b81f2bView commit details -
[Docker] support cuda11.2 and using gcc5.4 in cuda10.1 (#32531)
* support cuda11.2 and using gcc5.4 in cuda10.1 * fix manylinux py36 bug * support cuda11.2 * fix python36 pip version problem in ubuntu * save cuda11.0
Configuration menu - View commit details
-
Copy full SHA for 3132695 - Browse repository at this point
Copy the full SHA 3132695View commit details -
Configuration menu - View commit details
-
Copy full SHA for db41b74 - Browse repository at this point
Copy the full SHA db41b74View commit details
Commits on Apr 28, 2021
-
Optimize update_loss_scaling_op (#32554)
* optimize update_loss_scaling_op by fused for loop to one kernel, test=develop * remove useless while loop and optimize variable name, test=develop * optimize variable name from out_addrs_tensor to out_addrs_mem, test=develop * optimize variable name for readable by change prefix identifier from t_ to local_
Configuration menu - View commit details
-
Copy full SHA for 0dc02dc - Browse repository at this point
Copy the full SHA 0dc02dcView commit details -
[oneDNN] Added clearing oneDNN cache per executor (#32499)
* - Added clearing oneDNN per executor * - Executor is nt always having FLAGS_use_mkldnn set to true
Configuration menu - View commit details
-
Copy full SHA for ba61076 - Browse repository at this point
Copy the full SHA ba61076View commit details -
Reduce the time cost for the elementwise_add test case (#32628)
Reduce the time cost for the elementwise_add test case (#32628)
Configuration menu - View commit details
-
Copy full SHA for 6d3eb3d - Browse repository at this point
Copy the full SHA 6d3eb3dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7a245b7 - Browse repository at this point
Copy the full SHA 7a245b7View commit details -
Fix some error message (#32614)
* fix two error message * fix two error message * fix error * fix error * fix error * fix error * fix some error message * fix some error * fix error * fix some error * fix some error * fix some error * fix one error * fix some error * fix seven error message * fix error * fix error * fix error * fix error * fix some error message * fix error * fix some error * fix some error
Configuration menu - View commit details
-
Copy full SHA for 9ee709f - Browse repository at this point
Copy the full SHA 9ee709fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4ead9a5 - Browse repository at this point
Copy the full SHA 4ead9a5View commit details -
Configuration menu - View commit details
-
Copy full SHA for bda0e60 - Browse repository at this point
Copy the full SHA bda0e60View commit details -
* Add dlnne engine runtime * Fix log * Remove <const_cast> and remove unrelated modify with dlnne, +clang-format * Fix CMakeList format error * Add copyright message * Fix dlnne CMakeList.txt * Add some paddlepaddle_pass to support more networks * Fix some format bug * Add delete dropout_op pass * Fix some format bug * Fix format bug
Configuration menu - View commit details
-
Copy full SHA for abcb3f5 - Browse repository at this point
Copy the full SHA abcb3f5View commit details -
Add fake interface for register_hook in static mode (#32642)
* add fake interface for hook in static mode * add unittests * fix failed unittests
Configuration menu - View commit details
-
Copy full SHA for 9aad752 - Browse repository at this point
Copy the full SHA 9aad752View commit details -
Configuration menu - View commit details
-
Copy full SHA for bc379ca - Browse repository at this point
Copy the full SHA bc379caView commit details -
[NPU] add input EpsilonTensor for adam (#32605)
* add input EpsilonTensor for adam * update python api * add unit test * add npu test * add more ut
Configuration menu - View commit details
-
Copy full SHA for 119cda3 - Browse repository at this point
Copy the full SHA 119cda3View commit details
Commits on Apr 29, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 243b432 - Browse repository at this point
Copy the full SHA 243b432View commit details -
[Paddle-TRT] Implement MHA fp16 order same as training (#32629)
* implement MHA order same as training * fix fp16 compile issue on old architecture * fix format * fix format
Configuration menu - View commit details
-
Copy full SHA for 75282e7 - Browse repository at this point
Copy the full SHA 75282e7View commit details -
Configuration menu - View commit details
-
Copy full SHA for dec8ab8 - Browse repository at this point
Copy the full SHA dec8ab8View commit details -
Add BF16 uniform random initializer (#32468)
* Add bf16 uniform random initializer * Remove duplicated section * Change UT to CPU place only * Put detail functions into anonymous namespace
Configuration menu - View commit details
-
Copy full SHA for f46f15a - Browse repository at this point
Copy the full SHA f46f15aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8ccf549 - Browse repository at this point
Copy the full SHA 8ccf549View commit details -
Configuration menu - View commit details
-
Copy full SHA for b7ddd7d - Browse repository at this point
Copy the full SHA b7ddd7dView commit details -
Configuration menu - View commit details
-
Copy full SHA for b6ca6a5 - Browse repository at this point
Copy the full SHA b6ca6a5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 10c493a - Browse repository at this point
Copy the full SHA 10c493aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7a73692 - Browse repository at this point
Copy the full SHA 7a73692View commit details -
Add op read_file and decode_jpeg (#32564)
* add op read_file and decode_jpeg
Configuration menu - View commit details
-
Copy full SHA for b22f6d6 - Browse repository at this point
Copy the full SHA b22f6d6View commit details -
add __all__=[] to python files not in API public list; import * only …
…support in API public list files (#32643)
Configuration menu - View commit details
-
Copy full SHA for 69d237c - Browse repository at this point
Copy the full SHA 69d237cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0f578db - Browse repository at this point
Copy the full SHA 0f578dbView commit details -
Configuration menu - View commit details
-
Copy full SHA for a3e7719 - Browse repository at this point
Copy the full SHA a3e7719View commit details
Commits on Apr 30, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 43527a2 - Browse repository at this point
Copy the full SHA 43527a2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8fd724a - Browse repository at this point
Copy the full SHA 8fd724aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5ada032 - Browse repository at this point
Copy the full SHA 5ada032View commit details -
Configuration menu - View commit details
-
Copy full SHA for bd8d35a - Browse repository at this point
Copy the full SHA bd8d35aView commit details -
* 单测全量列表修改 * 单测全量列表修改 * 去除挂掉的windows单测 * 去除挂掉的windows单测
Configuration menu - View commit details
-
Copy full SHA for 9b4fabf - Browse repository at this point
Copy the full SHA 9b4fabfView commit details -
Add 12 inplace APIs including auto generated (#32573)
* add relu6_ hardsigmoid_ leaky_relu_ Inplace APIs * add softmax_with_cross_entropy_ Inplace API * add clip_ scale_ add_ subtract_ Inplace APIs * add wlist * fix parameter of scale api * add add_n_ Inplace API and remove log_ Inplace API * fix elementwise_add_ and elementwise_sub_ broadcast problem * elementwise inplace api give error message before run the op * use broadcast_shape in elementwise inplace op * add 8 inplace apis that is auto generated * add unittest for all inplace apis * add decorator for inplace apis in static mode * fix windows blas fail of exp inplace api, change array_equal to allclose * add flatten inplace api * add flatten unittest * fix flatten unittest * add decorator * fix grad.numpy in test_pylayer_op * unsupport softmax_with_cross_entropy_ * add test_inplace_softmax_with_cross_entropy to static_mode_white_list * delete __all__ in inplace_utils * delete activation inplace function and add Tensor.inplace_func * change paddle.inplace_ to Tensor.inplace_ * fix little problem * add paddle in inplace_utils
Configuration menu - View commit details
-
Copy full SHA for 308073d - Browse repository at this point
Copy the full SHA 308073dView commit details -
revert data_generator __init__.py (#32670)
* revert data_generator * test * add setup.py
Configuration menu - View commit details
-
Copy full SHA for eb13c19 - Browse repository at this point
Copy the full SHA eb13c19View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7e2b60a - Browse repository at this point
Copy the full SHA 7e2b60aView commit details -
Configuration menu - View commit details
-
Copy full SHA for c6713bc - Browse repository at this point
Copy the full SHA c6713bcView commit details -
Support transforms for paddle tensor image (#31970)
* add to_grayscale, normalize * add rotate * add vfip and hflip * add crop center_crop * add padding, support constant, reflect, replicate, circular same as paddle.pad * add get-image-[n,c,w,h] axis utils
Configuration menu - View commit details
-
Copy full SHA for 6ab43f7 - Browse repository at this point
Copy the full SHA 6ab43f7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 109fdf1 - Browse repository at this point
Copy the full SHA 109fdf1View commit details -
avoid polluting logging's root logger (#32673)
avoid polluting logging's root logger
Feiyu Chan authoredApr 30, 2021 Configuration menu - View commit details
-
Copy full SHA for 4d95c8c - Browse repository at this point
Copy the full SHA 4d95c8cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0a0f324 - Browse repository at this point
Copy the full SHA 0a0f324View commit details -
[Dy2stat] Fix to_tensor Bug Reported from QA (#32701)
Dy2stat failed when user writes return paddle.to_tensor(xxx), the reason is that visit_Expr doesn't work when the Expr is in return. Some other statements may trigger same bug. To fix it, we re-wrote a transformer to transform paddle.to_tensor to paddle.assign for all Call nodes.
Configuration menu - View commit details
-
Copy full SHA for 0026819 - Browse repository at this point
Copy the full SHA 0026819View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3cc11a3 - Browse repository at this point
Copy the full SHA 3cc11a3View commit details -
Configuration menu - View commit details
-
Copy full SHA for f4a3f85 - Browse repository at this point
Copy the full SHA f4a3f85View commit details
Commits on May 3, 2021
-
Fix the bug in pipeline for dygraph mode (#32716)
* update, test=develop
lilong12 authoredMay 3, 2021 Configuration menu - View commit details
-
Copy full SHA for a0f4ac5 - Browse repository at this point
Copy the full SHA a0f4ac5View commit details -
fix the bug in processing subblock in pipeline (#32727)
lilong12 authoredMay 3, 2021 Configuration menu - View commit details
-
Copy full SHA for d0de2d8 - Browse repository at this point
Copy the full SHA d0de2d8View commit details
Commits on May 5, 2021
-
lilong12 authored
May 5, 2021 Configuration menu - View commit details
-
Copy full SHA for a259076 - Browse repository at this point
Copy the full SHA a259076View commit details
Commits on May 6, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 8b1b214 - Browse repository at this point
Copy the full SHA 8b1b214View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9599c3b - Browse repository at this point
Copy the full SHA 9599c3bView commit details -
Configuration menu - View commit details
-
Copy full SHA for f1c68a0 - Browse repository at this point
Copy the full SHA f1c68a0View commit details -
Configuration menu - View commit details
-
Copy full SHA for c5ae21f - Browse repository at this point
Copy the full SHA c5ae21fView commit details -
fix l1 decay for inplace (#32717)
littletomatodonkey authoredMay 6, 2021 Configuration menu - View commit details
-
Copy full SHA for efdb0a7 - Browse repository at this point
Copy the full SHA efdb0a7View commit details -
[ROCM] bugfix for unittest (#32392)
* fix test_unpool_op * fix test_inplace_addto_strategy * fix test_conv2d_fusion_op * fix test_imperative_lod_tensor_to_selected_rows, test_imperative_selected_rows_to_lod_tensor * fix test_dot_op * fix test_correlation_op * fix tracer * fix test_memcpy_op
Configuration menu - View commit details
-
Copy full SHA for 3139262 - Browse repository at this point
Copy the full SHA 3139262View commit details -
* [Rocm] fix test_expand_as_op * [Rocm] fix test_expand_as_op * [Rocm] fix test_expand_as_op * [Rocm] fix test_expand_as_op * [Rocm] fix test_expand_as_op * [Rocm] fix test_expand_as_op
Configuration menu - View commit details
-
Copy full SHA for 2fe4580 - Browse repository at this point
Copy the full SHA 2fe4580View commit details -
Configuration menu - View commit details
-
Copy full SHA for 28d42a9 - Browse repository at this point
Copy the full SHA 28d42a9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 70eb435 - Browse repository at this point
Copy the full SHA 70eb435View commit details -
[Rocm] fix tests of inplace_abn_op & grid_sampler_op (#32703)
* [Rocm] fix tests of inplace_abn_op & grid_sampler_op * [Rocm] fix tests of inplace_abn_op & grid_sampler_op
Configuration menu - View commit details
-
Copy full SHA for 7c27541 - Browse repository at this point
Copy the full SHA 7c27541View commit details -
[2.1 API] Enable printing deprecated warning info. (#32712)
* Add deprecated warning info. * Add unittest for deprecated decorator. * Add warning info for tensor.grad
Configuration menu - View commit details
-
Copy full SHA for 51b39a9 - Browse repository at this point
Copy the full SHA 51b39a9View commit details
Commits on May 7, 2021
-
Mechanism that converts startup_program initializers to BF16 (#32720)
* Add casting initializers for bf16 training * Changes after review * Correct test and add comment
Configuration menu - View commit details
-
Copy full SHA for ce2bdb0 - Browse repository at this point
Copy the full SHA ce2bdb0View commit details -
Refactor
dot
op's CPU kernel for better performance (#32589)* OP dot: refactor CPU kernels and get better loop performance. * Minor fix on code format. * Fixed minor errors.
Configuration menu - View commit details
-
Copy full SHA for 97a9552 - Browse repository at this point
Copy the full SHA 97a9552View commit details -
bug fix, test=develop (#32752)
lilong12 authoredMay 7, 2021 Configuration menu - View commit details
-
Copy full SHA for 9b65d4c - Browse repository at this point
Copy the full SHA 9b65d4cView commit details -
Remove paddle_custom_op dynamic libraries, and link to FLUID_CORE on …
…Windows (#32583) * Remove paddle_custom_op dynamic libraries, change link to FLUID_CORE on windows, and check copy_to * fix CI
Configuration menu - View commit details
-
Copy full SHA for 7610c2b - Browse repository at this point
Copy the full SHA 7610c2bView commit details