-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
develop #1
develop #1
Commits on Jul 27, 2021
-
Configuration menu - View commit details
-
Copy full SHA for ede001f - Browse repository at this point
Copy the full SHA ede001fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6ab0a6a - Browse repository at this point
Copy the full SHA 6ab0a6aView commit details -
This PR add some primitives api related files. Those files are almost empty for the moment, but will be filled later.
Configuration menu - View commit details
-
Copy full SHA for 563a669 - Browse repository at this point
Copy the full SHA 563a669View commit details
Commits on Jul 28, 2021
-
apply pass strategy to sub graph (#34158)
When Graph has sub-graph, apply pass to it and all sub-graph. And add single test script .
Configuration menu - View commit details
-
Copy full SHA for 5e27d16 - Browse repository at this point
Copy the full SHA 5e27d16View commit details -
[NPU] add NPU ops of compare, test=develop (#34365)
* [NPU] add NPU ops&uts of compare, test=develop * testing * try style-format * [NPU] update compare_op_npu uts * [NPU] fix code sytle of test_compare_op_npu.py
Configuration menu - View commit details
-
Copy full SHA for 68b4a2c - Browse repository at this point
Copy the full SHA 68b4a2cView commit details -
Configuration menu - View commit details
-
Copy full SHA for a59f215 - Browse repository at this point
Copy the full SHA a59f215View commit details -
graph_to_program save parameter and stop_gradient information (#33771)
This PR added optional boolean is_parameter and stop_gradient in the VarDesc proto, and remove them during save_inference_model
Configuration menu - View commit details
-
Copy full SHA for 8a7dee3 - Browse repository at this point
Copy the full SHA 8a7dee3View commit details -
[CPU-PSLIB] Fix bug for consistency insepection of op's embedding nam…
…e and sparse table name in config_fleet.py (#34441)
Configuration menu - View commit details
-
Copy full SHA for f1654de - Browse repository at this point
Copy the full SHA f1654deView commit details -
Configuration menu - View commit details
-
Copy full SHA for 167523e - Browse repository at this point
Copy the full SHA 167523eView commit details -
[Paddle-TRT] Fix TRT8 cuda error before program exit (#34403)
* trt buildEngineWithConfig is deprecated * add trt version control
Configuration menu - View commit details
-
Copy full SHA for 995195f - Browse repository at this point
Copy the full SHA 995195fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0fb15d9 - Browse repository at this point
Copy the full SHA 0fb15d9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9f60492 - Browse repository at this point
Copy the full SHA 9f60492View commit details -
[NPU] Support ScaleTensor for scale npu kernel (#34418)
* support ScaleTensor for scale npu kernel * add more tests for adam npu * fix compile * fix unittest * refine adam optimizer
Configuration menu - View commit details
-
Copy full SHA for f17ba93 - Browse repository at this point
Copy the full SHA f17ba93View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0b2e510 - Browse repository at this point
Copy the full SHA 0b2e510View commit details -
Configuration menu - View commit details
-
Copy full SHA for a83a368 - Browse repository at this point
Copy the full SHA a83a368View commit details -
Configuration menu - View commit details
-
Copy full SHA for 54cc065 - Browse repository at this point
Copy the full SHA 54cc065View commit details -
Support C++ import python on windows for paddle (#34312)
* Support C++ import python on windows for paddle * Support C++ import python on windows for paddle
Configuration menu - View commit details
-
Copy full SHA for cf12ea5 - Browse repository at this point
Copy the full SHA cf12ea5View commit details -
[Dy2Stat]Add build_strategy in @to_static to support open pass (#34347)
* Add build_strategy in @to_static to support open pass * fix os.environ * add timeout * disable test_build_strategy on openblas
Configuration menu - View commit details
-
Copy full SHA for eb27d8b - Browse repository at this point
Copy the full SHA eb27d8bView commit details
Commits on Jul 29, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 92d8fed - Browse repository at this point
Copy the full SHA 92d8fedView commit details -
Configuration menu - View commit details
-
Copy full SHA for e958316 - Browse repository at this point
Copy the full SHA e958316View commit details -
* tile op * more uts * disable tile if trt6.0 * typo * fix timeout issue * opteller * opteller remove duplicate code * comments. test=document_fix * modify PADDLE_ENFORCE. * fix reduce_mean issue
Configuration menu - View commit details
-
Copy full SHA for cffa15c - Browse repository at this point
Copy the full SHA cffa15cView commit details -
Configuration menu - View commit details
-
Copy full SHA for d3dae0c - Browse repository at this point
Copy the full SHA d3dae0cView commit details -
add parameter of input in model.summary (#34165)
* add input option in model.summary
Configuration menu - View commit details
-
Copy full SHA for 40bd7a7 - Browse repository at this point
Copy the full SHA 40bd7a7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 104c82b - Browse repository at this point
Copy the full SHA 104c82bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9e1af38 - Browse repository at this point
Copy the full SHA 9e1af38View commit details -
Configuration menu - View commit details
-
Copy full SHA for b451ff2 - Browse repository at this point
Copy the full SHA b451ff2View commit details -
add persistent_workers (#34017)
* add persistent_workers. test=develop
Configuration menu - View commit details
-
Copy full SHA for 76710e5 - Browse repository at this point
Copy the full SHA 76710e5View commit details -
Configuration menu - View commit details
-
Copy full SHA for d71b9ba - Browse repository at this point
Copy the full SHA d71b9baView commit details -
Configuration menu - View commit details
-
Copy full SHA for 76f94f8 - Browse repository at this point
Copy the full SHA 76f94f8View commit details -
Configuration menu - View commit details
-
Copy full SHA for b56dbe0 - Browse repository at this point
Copy the full SHA b56dbe0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 02cc3c5 - Browse repository at this point
Copy the full SHA 02cc3c5View commit details -
fix paddle.summary's bug when outputs contains non-tensor (#34160)
* fix paddle.summary's bug when output contains non-tensor
Configuration menu - View commit details
-
Copy full SHA for b7fac0f - Browse repository at this point
Copy the full SHA b7fac0fView commit details -
recover capacity check (#34478)
* graph engine demo * upload unsaved changes * fix dependency error * fix shard_num problem * py client * remove lock and graph-type * add load direct graph * add load direct graph * add load direct graph * batch random_sample * batch_sample_k * fix num_nodes size * batch brpc * batch brpc * add test * add test * add load_nodes; change add_node function * change sample return type to pair * resolve conflict * resolved conflict * resolved conflict * separate server and client * merge pair type * fix * resolved conflict * fixed segment fault; high-level VLOG for load edges and load nodes * random_sample return 0 * rm useless loop * test:load edge * fix ret -1 * test: rm sample * rm sample * random_sample return future * random_sample return int * test fake node * fixed here * memory leak * remove test code * fix return problem * add common_graph_table * random sample node &test & change data-structure from linkedList to vector * add common_graph_table * sample with srand * add node_types * optimize nodes sample * recover test * random sample * destruct weighted sampler * GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * pybind sample nodes api * pull nodes with step * fixed pull_graph_list bug; add test for pull_graph_list by step * add graph table;name * add graph table;name * add pybind * add pybind * add FeatureNode * add FeatureNode * add FeatureNode Serialize * add FeatureNode Serialize * get_feat_node * avoid local rpc * fix get_node_feat * fix get_node_feat * remove log * get_node_feat return py:bytes * merge develop with graph_engine * fix threadpool.h head * fix * fix typo * resolve conflict * fix conflict * recover lost content * fix pybind of FeatureNode * recover cmake * recover tools * resolve conflict * resolve linking problem * code style * change test_server port * fix code problems * remove shard_num config * remove redundent threads * optimize start server * remove logs * fix code problems by reviewers' suggestions * move graph files into a folder * code style change * remove graph operations from base table * optimize get_feat function of graph engine * fix long long count problem * remove redandunt graph files * remove unused shell * recover dropout_op_pass.h * fix potential stack overflow when request number is too large & node add & node clear & node remove * when sample k is larger than neigbor num, return directly * using random seed generator of paddle to speed up * fix bug of random sample k * fix code style * fix code style * fix blocking_queue problem * fix style * fix * recover capacity check Co-authored-by: Huang Zhengjie <270018958@qq.com> Co-authored-by: Weiyue Su <weiyue.su@gmail.com> Co-authored-by: suweiyue <suweiyue@baidu.com> Co-authored-by: luobin06 <luobin06@baidu.com> Co-authored-by: liweibin02 <liweibin02@baidu.com> Co-authored-by: tangwei12 <tangwei12@baidu.com>
Configuration menu - View commit details
-
Copy full SHA for b9d6c98 - Browse repository at this point
Copy the full SHA b9d6c98View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9d985ca - Browse repository at this point
Copy the full SHA 9d985caView commit details -
add fix op run order pass (#34427)
* add fix op run order pass * add ut for fix_op_run_order * fix ci error * improve coverage * improve coverge again and fix cpu test case * follow some comments
Configuration menu - View commit details
-
Copy full SHA for 79e758c - Browse repository at this point
Copy the full SHA 79e758cView commit details
Commits on Jul 30, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 423ea97 - Browse repository at this point
Copy the full SHA 423ea97View commit details -
fix lr in param group (#34468)
* fix lr in param group * add unittest for adamw
Configuration menu - View commit details
-
Copy full SHA for 3041605 - Browse repository at this point
Copy the full SHA 3041605View commit details -
Support setitem by None index (#34442)
* Support setitem by None index * remove unreachable code * Add Checkpoint for set_value_op because add a new attribute
Configuration menu - View commit details
-
Copy full SHA for f775bfc - Browse repository at this point
Copy the full SHA f775bfcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6c09496 - Browse repository at this point
Copy the full SHA 6c09496View commit details -
Configuration menu - View commit details
-
Copy full SHA for ba19398 - Browse repository at this point
Copy the full SHA ba19398View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5571c98 - Browse repository at this point
Copy the full SHA 5571c98View commit details -
[NPU] add clip and clip_grad on NPU, test=develop (#34429)
* [NPU] add clip and clip_grad on NPU, test=develop * address review comments, test=develop * update, test=develop
Configuration menu - View commit details
-
Copy full SHA for b68e36d - Browse repository at this point
Copy the full SHA b68e36dView commit details -
Added expand_v2 BF16/FP32 FWD/BWD kernels (#34284)
* added expand_v2 bf16/fp32 kernel * minor change * CI fix * added missing test file * added formatting * reduced binary size * CI fix
Configuration menu - View commit details
-
Copy full SHA for 41c4f72 - Browse repository at this point
Copy the full SHA 41c4f72View commit details -
add trainer desc config to distributed strategy (#34457)
* add trainer desc config to distributed strategy * code style modified
Configuration menu - View commit details
-
Copy full SHA for e6aacd1 - Browse repository at this point
Copy the full SHA e6aacd1View commit details -
Added reshape, reshape2, squeeze and squeeze2 BF16/FP32 FWD/BWD kerne…
…ls (#34219) * test version of matmul_v2 * added matmul_v2 grad kernel * minor changes * minor changes * minor change for CI approval * CI fix * CI fix * added squeeze and squeeze2 kernels * CI fix * CI fix * CI fix * disabled tests when compiled with cuda * added setting format_tag by strides * added sigmoid BF16 FWD/BWD and gelu BF16 BWD * changes after review * Revert "added sigmoid BF16 FWD/BWD and gelu BF16 BWD" This reverts commit 6e3f767. * Revert "Merge branch 'matmul_v2_grad' into squeeze2_op" This reverts commit 06fcf67, reversing changes made to 6e3f767. * minor change * added reshape1/2 kernels * moved some functions into private block * CI fix * CI fix * CI fix
Configuration menu - View commit details
-
Copy full SHA for 22c4c18 - Browse repository at this point
Copy the full SHA 22c4c18View commit details -
Configuration menu - View commit details
-
Copy full SHA for 87148a5 - Browse repository at this point
Copy the full SHA 87148a5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2ad1e4c - Browse repository at this point
Copy the full SHA 2ad1e4cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 06b55ea - Browse repository at this point
Copy the full SHA 06b55eaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 44e4d57 - Browse repository at this point
Copy the full SHA 44e4d57View commit details -
Added matmul_v2 BF16/FP32 BWD kernel (#34192)
* test version of matmul_v2 * added matmul_v2 grad kernel * minor changes * minor changes * minor change for CI approval * CI fix * CI fix * trigger CI * changes after review, not working yet * moved ops to anonymous namespaces * changes after review
Configuration menu - View commit details
-
Copy full SHA for 0be7157 - Browse repository at this point
Copy the full SHA 0be7157View commit details -
Configuration menu - View commit details
-
Copy full SHA for 72a9c8f - Browse repository at this point
Copy the full SHA 72a9c8fView commit details
Commits on Jul 31, 2021
-
add resnet50 trt tests in pr-ci-inference (#34465)
* add resnet50 trt test in pr-ci-inference test
Configuration menu - View commit details
-
Copy full SHA for 8b72a1a - Browse repository at this point
Copy the full SHA 8b72a1aView commit details
Commits on Aug 1, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 41e2d41 - Browse repository at this point
Copy the full SHA 41e2d41View commit details
Commits on Aug 2, 2021
-
Configuration menu - View commit details
-
Copy full SHA for a6f55e4 - Browse repository at this point
Copy the full SHA a6f55e4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 393a0b1 - Browse repository at this point
Copy the full SHA 393a0b1View commit details -
Fix Inference CE Error by Topo Order (#34521)
The comment background message is too long, see details at #34521
Configuration menu - View commit details
-
Copy full SHA for 508b40e - Browse repository at this point
Copy the full SHA 508b40eView commit details -
* [NPU] add reduce_max * [NPU] delete skipIf * [NPU] add atrrs support or check * [NPU] add attr out_dtype * [NPU] delete debug codes
Configuration menu - View commit details
-
Copy full SHA for de53f2b - Browse repository at this point
Copy the full SHA de53f2bView commit details -
Change Bbenchmark-CI CPU/GPU (#34305)
* notest;test=cpu-benchmark * benchmark-cpu * notest;test=cpu-benchmark * notest;benchmark-cpu * notest;benchmark-cpu * notest;benchmark-cpu * notest;benchmark-cpu * notest;benchmark-cpu * notest;benchmark-cpu * fix * fix * add test_ci_model_benchmark.sh
Configuration menu - View commit details
-
Copy full SHA for 9d54a53 - Browse repository at this point
Copy the full SHA 9d54a53View commit details -
* test=develop * update identity * add unittest * notest,test=mac_py3 * modify comment & testname * test=document_fix * update comment * test=document_fix * activate all of the CI
Configuration menu - View commit details
-
Copy full SHA for 80f7f7e - Browse repository at this point
Copy the full SHA 80f7f7eView commit details -
Configuration menu - View commit details
-
Copy full SHA for c7cc5ac - Browse repository at this point
Copy the full SHA c7cc5acView commit details -
Configuration menu - View commit details
-
Copy full SHA for af88699 - Browse repository at this point
Copy the full SHA af88699View commit details -
Add basic functions of Program Pass (#34524)
* add basic APIs * add attr_types * follow comments * change pass attr types * add set pass attribute codes * refine PADDLE_THROW
Configuration menu - View commit details
-
Copy full SHA for 145cdb5 - Browse repository at this point
Copy the full SHA 145cdb5View commit details -
Change formula error in paddle.optimizer (#34539)
* fix paddle.optimizer test=document_fix * fix paddle.optimizer test=document_fix
Configuration menu - View commit details
-
Copy full SHA for 3b5fc2a - Browse repository at this point
Copy the full SHA 3b5fc2aView commit details -
[HybridParallel]Support 1f1b for PipelineParallel (#34483)
* support 1f1b for pipeline * add utest * add send_partial/recv_partial * support amp for pp * fix logger
Configuration menu - View commit details
-
Copy full SHA for 9e0bb91 - Browse repository at this point
Copy the full SHA 9e0bb91View commit details
Commits on Aug 3, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 61e51c1 - Browse repository at this point
Copy the full SHA 61e51c1View commit details -
Configuration menu - View commit details
-
Copy full SHA for e7dcdb7 - Browse repository at this point
Copy the full SHA e7dcdb7View commit details -
support more dim for mul op npu (#34546)
* support more dim for mul op npu * update unit test according to reviewer's comment.
Configuration menu - View commit details
-
Copy full SHA for 2aedf16 - Browse repository at this point
Copy the full SHA 2aedf16View commit details -
* support Kunlun2 * support KL2 * support KL2
Configuration menu - View commit details
-
Copy full SHA for 2d0f3d9 - Browse repository at this point
Copy the full SHA 2d0f3d9View commit details -
Configuration menu - View commit details
-
Copy full SHA for d7493df - Browse repository at this point
Copy the full SHA d7493dfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 45fa14f - Browse repository at this point
Copy the full SHA 45fa14fView commit details -
[docker] delete install_trt in release docker (#34266)
* delete install_trt in release docker * fix whl package path
Configuration menu - View commit details
-
Copy full SHA for 2714fc7 - Browse repository at this point
Copy the full SHA 2714fc7View commit details -
[HybridParallel] Support segment for PipelineParallel (#34529)
* add layer segment * add segement for transformer * add utest
Configuration menu - View commit details
-
Copy full SHA for 9b6c7eb - Browse repository at this point
Copy the full SHA 9b6c7ebView commit details -
Configuration menu - View commit details
-
Copy full SHA for 56b7ebb - Browse repository at this point
Copy the full SHA 56b7ebbView commit details
Commits on Aug 4, 2021
-
Configuration menu - View commit details
-
Copy full SHA for c79fa1c - Browse repository at this point
Copy the full SHA c79fa1cView commit details -
supplement the function of slice. (#34172)
* supplement the function of slice * edit unittest * strided_slice_op support . * polish error message. * polish error message. * polish code. * polish unittest. * polish code. * polish code * polish error message.
Configuration menu - View commit details
-
Copy full SHA for 1f0f5d3 - Browse repository at this point
Copy the full SHA 1f0f5d3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 54b6c39 - Browse repository at this point
Copy the full SHA 54b6c39View commit details -
* fix backward bug * format code style * add test case for grad tensor accumulator
Configuration menu - View commit details
-
Copy full SHA for a7c3836 - Browse repository at this point
Copy the full SHA a7c3836View commit details -
Support npu kernel for Crop op (#34542)
* add crop npu op * code style check * update code to remove usless headers * add testcase and upsate code
Configuration menu - View commit details
-
Copy full SHA for 3ce14a3 - Browse repository at this point
Copy the full SHA 3ce14a3View commit details -
[NPU] Support npu kernel for assign_value op (#34568)
* [NPU] Support npu kernel for assign_value op * move test_assign_value_op_npu.py into unittests/npu folder * correce copyright year; add TestAssignApi class using NPUplace in test files
Configuration menu - View commit details
-
Copy full SHA for f39c3a5 - Browse repository at this point
Copy the full SHA f39c3a5View commit details -
* notest;benchmark-cpu * notest;benchmark-cpu * notest;benchmark-cpu * notest;benchmark-cpu * notest;benchmark-cpu * notest;benchmark-cpu * notest;benchmark-cpu * notest;benchmark-cpu * notest;benchmark-cpu * notest;benchmark-cpu * notest;benchmark-cpu * notest;benchmark-cpu * fix
Configuration menu - View commit details
-
Copy full SHA for ee60e82 - Browse repository at this point
Copy the full SHA ee60e82View commit details -
Configuration menu - View commit details
-
Copy full SHA for 91be876 - Browse repository at this point
Copy the full SHA 91be876View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1f76a2f - Browse repository at this point
Copy the full SHA 1f76a2fView commit details -
Add gradient with optimizer API (#34395)
* add gradients_with_optimizer api * modify gradients_with_optimizer * add gradients_with_optimizer api into paddle.auto.backward_mode * add gradients_with_optimizer test case * add doc for gradients_with_optimizer * add doc for gradients_with_optimizer
Configuration menu - View commit details
-
Copy full SHA for d9e63a8 - Browse repository at this point
Copy the full SHA d9e63a8View commit details -
paddle/nn/functional docs' bug fix (#34580)
* fix paddle.optimizer test=document_fix * fix paddle.optimizer test=document_fix * fix bugs in paddle.nn.functional document test=document_fix * fix bugs in paddle.nn.functional document test=document_fix * fix bugs in paddle.nn.functional document test=document_fix * fix bugs in paddle.nn.functional document test=document_fix
Configuration menu - View commit details
-
Copy full SHA for 420570c - Browse repository at this point
Copy the full SHA 420570cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0989211 - Browse repository at this point
Copy the full SHA 0989211View commit details
Commits on Aug 5, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 090c863 - Browse repository at this point
Copy the full SHA 090c863View commit details -
[HybridParallel]Fix bug of p2p for partial_send/recv (#34615)
* fix bug of p2p for partial * fix error
Configuration menu - View commit details
-
Copy full SHA for 4cc3d9a - Browse repository at this point
Copy the full SHA 4cc3d9aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8144a73 - Browse repository at this point
Copy the full SHA 8144a73View commit details -
* add not_equal NPU op * add not_equal NPU op * add not_equal NPU op * add not_equal NPU op
Configuration menu - View commit details
-
Copy full SHA for 7e707ce - Browse repository at this point
Copy the full SHA 7e707ceView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4d6f8f2 - Browse repository at this point
Copy the full SHA 4d6f8f2View commit details -
Configuration menu - View commit details
-
Copy full SHA for e47d8a5 - Browse repository at this point
Copy the full SHA e47d8a5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7a38b76 - Browse repository at this point
Copy the full SHA 7a38b76View commit details -
add NPU support for zero_copy_tensor. (#34629)
* add NPU support for zero_copy_tensor. * revert unnesessary codes. * revert unnesessary codes.
Configuration menu - View commit details
-
Copy full SHA for a68709d - Browse repository at this point
Copy the full SHA a68709dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1d7b75d - Browse repository at this point
Copy the full SHA 1d7b75dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 911c859 - Browse repository at this point
Copy the full SHA 911c859View commit details -
remove boost::algorithm::ends_with ,boost macro and boost::lexical_ca…
…st apis (#34310) * replace boost::algorithm::ends_with with self define ends_with function * remove BOOST macro in certain operators * remove boost::lexical_cast * add test for string_helper * add more test case for string_helper * modify join_string func and test case * fix build_strategy_test failed bug * remove string_helper_test from parallel_UT_rule.py
Configuration menu - View commit details
-
Copy full SHA for bb7b4c0 - Browse repository at this point
Copy the full SHA bb7b4c0View commit details -
[Dy2Stat]Support Mixed Precision training in @to_static (#34562)
* Support Mixed Precision training in @to_static * fix block.vars logic * fix GPU training loss diff * remove unused code
Configuration menu - View commit details
-
Copy full SHA for a842828 - Browse repository at this point
Copy the full SHA a842828View commit details -
fix output dtype for paddle.sum (#34313)
* support bool dtype for paddle.sum
Configuration menu - View commit details
-
Copy full SHA for ff062a4 - Browse repository at this point
Copy the full SHA ff062a4View commit details -
[Dy2st]Integrated gast library to fix compatibility problem permanent…
…ly (#34556) * integrated gast library * integrated gast library * fix unittest and remove ast2.py * remove 'gast' from __all__ in __init__.py * add copyright in other files * fix copyright
Configuration menu - View commit details
-
Copy full SHA for a9ee383 - Browse repository at this point
Copy the full SHA a9ee383View commit details -
* first test version * add test exec; * add data transfer; test=develop * add new exec head; * add memcpy; test=develop * add python fetch * add new test * add graph node; test=develop * remove useless new executor test; test=develop * remove gperf dependency; test=develop * fix compile bugs; test=develop * remove useless code; test=develop * remove useless code; test=develop * add uni test; test=develop * polish code; test=develop * polish code; test=develop * add interpreter cmakefile; test=develop * remove useless code; test=develop
Configuration menu - View commit details
-
Copy full SHA for 012d12b - Browse repository at this point
Copy the full SHA 012d12bView commit details -
[NPU] Add relu6 and relu6_grad npu op (#34596)
* Add relu6 and relu6_grad npu op * fixed pre-commit-config.yaml * fixed for CI
Configuration menu - View commit details
-
Copy full SHA for 6839994 - Browse repository at this point
Copy the full SHA 6839994View commit details -
[NPU] Support npu op: (1) cos (2) cos_grad (#34573)
* [NPU] Support npu op: (1) cos (2) cos_grad * Update test_cos_op_npu.py * Update activation_op_npu.cc * rm redundant {1}
Configuration menu - View commit details
-
Copy full SHA for 6151ccd - Browse repository at this point
Copy the full SHA 6151ccdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6c8a10a - Browse repository at this point
Copy the full SHA 6c8a10aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4a52c0c - Browse repository at this point
Copy the full SHA 4a52c0cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 68377b4 - Browse repository at this point
Copy the full SHA 68377b4View commit details
Commits on Aug 6, 2021
-
Fix ut test_pe_fix_op_run_order by using smaller model and batch size (…
…#34603) * fix ut * decrease gpu memory consumption * remove exclusive
Configuration menu - View commit details
-
Copy full SHA for 06651c4 - Browse repository at this point
Copy the full SHA 06651c4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 436a9f1 - Browse repository at this point
Copy the full SHA 436a9f1View commit details -
Configuration menu - View commit details
-
Copy full SHA for c91b1e0 - Browse repository at this point
Copy the full SHA c91b1e0View commit details -
Configuration menu - View commit details
-
Copy full SHA for ce73349 - Browse repository at this point
Copy the full SHA ce73349View commit details -
Configuration menu - View commit details
-
Copy full SHA for c16421c - Browse repository at this point
Copy the full SHA c16421cView commit details -
Support npu kernel for eye op (#34543)
* add eye npu op * remove useless headers * code style * Update eye_op_npu.cc * Update eye_op_npu.cc * remove useless code in test file * code style check * change Copyright to 2021 * add test case and do some fix * fix * update code * fix for CI * return * fix
Configuration menu - View commit details
-
Copy full SHA for 6e442e6 - Browse repository at this point
Copy the full SHA 6e442e6View commit details -
paddle/nn fix formula bugs (#34643)
* fix paddle.optimizer test=document_fix * fix paddle.optimizer test=document_fix * fix bugs in paddle.nn.functional document test=document_fix * fix bugs in paddle.nn.functional document test=document_fix * fix bugs in paddle.nn.functional document test=document_fix * fix bugs in paddle.nn.functional document test=document_fix * fix nn formula bugs test=document_fix * fix nn formula bugs test=document_fix * fix nn formula bugs test=document_fix
Configuration menu - View commit details
-
Copy full SHA for 0f19ac7 - Browse repository at this point
Copy the full SHA 0f19ac7View commit details -
* [NPU] add reduce_prod * [NPU] delete check_dygraph=False * [NPU] delete skipIf * add attrs support or check * [NPU] delete extra codes for test_reduce_max_op_npu * [NPU] add attr out_dtype
Configuration menu - View commit details
-
Copy full SHA for 47d81b0 - Browse repository at this point
Copy the full SHA 47d81b0View commit details -
[NPU] Support npu kernel for atan and atan_grad op, test=develop (#34658
Configuration menu - View commit details
-
Copy full SHA for cabfb4a - Browse repository at this point
Copy the full SHA cabfb4aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8a9dc5d - Browse repository at this point
Copy the full SHA 8a9dc5dView commit details -
Configuration menu - View commit details
-
Copy full SHA for fa16c21 - Browse repository at this point
Copy the full SHA fa16c21View commit details -
support kunlun black list and add kl1 op (#34605)
* support kunlun black list and add kl1 op * xpu_op_list add device_context dependence
Configuration menu - View commit details
-
Copy full SHA for 21beef9 - Browse repository at this point
Copy the full SHA 21beef9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4caf60d - Browse repository at this point
Copy the full SHA 4caf60dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 52e38a0 - Browse repository at this point
Copy the full SHA 52e38a0View commit details
Commits on Aug 7, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 46808af - Browse repository at this point
Copy the full SHA 46808afView commit details
Commits on Aug 8, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 338f9e0 - Browse repository at this point
Copy the full SHA 338f9e0View commit details
Commits on Aug 9, 2021
-
[NPU] add broadcast supporting for elementwise_add_op_npu (#34057)
* add broadcast supporting for elementwise_add * add broadcast supporting for elementwise_add * add more tests * remove the redundant code * update * fix place error in unittest * remove skip.If
Configuration menu - View commit details
-
Copy full SHA for b7355d8 - Browse repository at this point
Copy the full SHA b7355d8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0dff82c - Browse repository at this point
Copy the full SHA 0dff82cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 898acb1 - Browse repository at this point
Copy the full SHA 898acb1View commit details -
Configuration menu - View commit details
-
Copy full SHA for a3cc2d0 - Browse repository at this point
Copy the full SHA a3cc2d0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 56759ff - Browse repository at this point
Copy the full SHA 56759ffView commit details -
[NPU] add one_hot_op_npu and tests (#34258)
* add one_hot_op and tests * update * make code clear
Configuration menu - View commit details
-
Copy full SHA for 4c1ba73 - Browse repository at this point
Copy the full SHA 4c1ba73View commit details -
Configuration menu - View commit details
-
Copy full SHA for aab4d6e - Browse repository at this point
Copy the full SHA aab4d6eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3380778 - Browse repository at this point
Copy the full SHA 3380778View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7afd31b - Browse repository at this point
Copy the full SHA 7afd31bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8009257 - Browse repository at this point
Copy the full SHA 8009257View commit details -
Configuration menu - View commit details
-
Copy full SHA for e285258 - Browse repository at this point
Copy the full SHA e285258View commit details -
Configuration menu - View commit details
-
Copy full SHA for bf54534 - Browse repository at this point
Copy the full SHA bf54534View commit details
Commits on Aug 10, 2021
-
Fix error of HSigmoidLoss (#34719)
* Fix error of HSigmoidLoss * update unittest * update unittest
Configuration menu - View commit details
-
Copy full SHA for 3f32b73 - Browse repository at this point
Copy the full SHA 3f32b73View commit details -
Support npu kernel for expand_as_v2 op (#34620)
* Support npu kernel for expand_as_v2 op * mofify the registry data type name * fix test unit * fix npu compile error, test=develop * fix compute function Co-authored-by: qili93 <qili93@qq.com>
Configuration menu - View commit details
-
Copy full SHA for 202c240 - Browse repository at this point
Copy the full SHA 202c240View commit details -
Support npu kernel for tile op (#34606)
* Support npu kernel for tile op * modify according to the comments * fix compute function
Configuration menu - View commit details
-
Copy full SHA for 8a6aa59 - Browse repository at this point
Copy the full SHA 8a6aa59View commit details -
Configuration menu - View commit details
-
Copy full SHA for 84eb675 - Browse repository at this point
Copy the full SHA 84eb675View commit details -
* fix for div zero * fix err;test=develop * fix lod
Configuration menu - View commit details
-
Copy full SHA for d86c26d - Browse repository at this point
Copy the full SHA d86c26dView commit details -
copy boost/any.hpp to utils and replace boost::any with self defined …
…any (#34613) * add any.hpp to utils and replace boost::any with self defined paddle::any * add copy any.hpp to custom op depends * modify any.hpp include path * remove boost from setup.py.in * add copy any.hpp to custom op depends * move any.hpp to paddle/utils/ dirs * move any.h to extension/include direction * copy utils to right directions
Configuration menu - View commit details
-
Copy full SHA for 1289292 - Browse repository at this point
Copy the full SHA 1289292View commit details -
Configuration menu - View commit details
-
Copy full SHA for f30a5c4 - Browse repository at this point
Copy the full SHA f30a5c4View commit details -
Configuration menu - View commit details
-
Copy full SHA for a160379 - Browse repository at this point
Copy the full SHA a160379View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4f4662b - Browse repository at this point
Copy the full SHA 4f4662bView commit details -
Configuration menu - View commit details
-
Copy full SHA for cfd49ac - Browse repository at this point
Copy the full SHA cfd49acView commit details -
[NPU] Support op kernel for Fill constant batch size like op (#34721)
* fix npu compile error, test=develop * add fill constant batch size lilke op npu,test=develop Co-authored-by: qili93 <qili93@qq.com>
Configuration menu - View commit details
-
Copy full SHA for ed2641c - Browse repository at this point
Copy the full SHA ed2641cView commit details -
Support npu op fill_any_like (#34518)
* Support npu kernel for fill_any_like op * modify the description of exception * remove useless template element * remove useless decorator * fix the code format error
Configuration menu - View commit details
-
Copy full SHA for e8df322 - Browse repository at this point
Copy the full SHA e8df322View commit details -
[NPU] add squared_l2_norm squared_l2_norm_grad and tests (#34708)
* [NPU] add squared_l2_norm squared_l2_norm and tests * [NPU] replace Square&ReduceSumD with SquareSumV1
Configuration menu - View commit details
-
Copy full SHA for b64312f - Browse repository at this point
Copy the full SHA b64312fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8b9bd16 - Browse repository at this point
Copy the full SHA 8b9bd16View commit details -
Kernel primitives api (#34672)
添加Kernel primitives api: ReadData, WriteData ComputeFunctor
Configuration menu - View commit details
-
Copy full SHA for 8f9d573 - Browse repository at this point
Copy the full SHA 8f9d573View commit details -
[NPU] Support npu kernel for flatten_contiguous_range op, test=develop (
#34642) * fix npu compile error, test=develop * [NPU] Support npu kernel for flatten_contiguous_range op, test=develop * [NPU] Support npu kernel for flatten_contiguous_range op, test=develop * [NPU] Support npu kernel for flatten_contiguous_range op, test=develop * [NPU] Support npu kernel for flatten_contiguous_range op, test=develop * [NPU] Support npu kernel for flatten_contiguous_range op, test=develop * [NPU] Support npu kernel for flatten_contiguous_range op, test=develop * [NPU] Support npu kernel for flatten_contiguous_range op, test=develop * Update flatten_op_npu.cc * Update flatten_op_npu.cc Co-authored-by: qili93 <qili93@qq.com>
Configuration menu - View commit details
-
Copy full SHA for 79be842 - Browse repository at this point
Copy the full SHA 79be842View commit details
Commits on Aug 11, 2021
-
Add no need output to gc check list (#34754)
* add not used output var to gc_check_list; test=develop * add useless output to gc check list; test=develop
Configuration menu - View commit details
-
Copy full SHA for 17c1dae - Browse repository at this point
Copy the full SHA 17c1daeView commit details -
[NPU] Support NPU kernel for TopKV2 op (#34599)
* Add NPU kernel for TopKV2 op * deleted unnecessary cache file static_mode_white_list.cpython-37.pyc * A draft for error checking * A commit with accuracy error for float32 data * Modify codes according to the review comments * Modify codes according to the review comments
Configuration menu - View commit details
-
Copy full SHA for bb01b12 - Browse repository at this point
Copy the full SHA bb01b12View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6a9fac1 - Browse repository at this point
Copy the full SHA 6a9fac1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4d2994c - Browse repository at this point
Copy the full SHA 4d2994cView commit details -
Configuration menu - View commit details
-
Copy full SHA for addd5fc - Browse repository at this point
Copy the full SHA addd5fcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 45af4f2 - Browse repository at this point
Copy the full SHA 45af4f2View commit details -
[NPU] Add exp and exp_grad npu op (#34612)
* add exp and exp_grad npu op * modify support register type * remove empty line and remove exp_grad support data type int/int64 * move exp and epx_grad kernel to activation_op_npu.cc, delete attrs * move code to activation_op_npu.cc
Configuration menu - View commit details
-
Copy full SHA for b5ec65e - Browse repository at this point
Copy the full SHA b5ec65eView commit details -
[HybridParallel] Support save/load for PipeLineParallel (#34768)
* add save/load for pipelineparallel * add save/load
Configuration menu - View commit details
-
Copy full SHA for 88f2f4a - Browse repository at this point
Copy the full SHA 88f2f4aView commit details -
add the basic apis for auto_parallel (#33804)
* add auto_parallel apis
lilong12 authoredAug 11, 2021 Configuration menu - View commit details
-
Copy full SHA for 3f962e7 - Browse repository at this point
Copy the full SHA 3f962e7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4d7af37 - Browse repository at this point
Copy the full SHA 4d7af37View commit details -
Add ext_tensor.slice() API (#34227)
* Add ext_tensor.slice() API, test=develop * Call Tensor::mutable_data first to fix bugs and add test for writing to sliced tensor * Fix unit test bug * Fix code format problem, test=develop * Fix code format problem * Fix code format problem * strengthen unit test * Use CustomTensorUtils::ShareDataFrom to simplify codes
Hao Lin authoredAug 11, 2021 Configuration menu - View commit details
-
Copy full SHA for 3f011d8 - Browse repository at this point
Copy the full SHA 3f011d8View commit details -
[NPU] add batch_norm_op_npu and test (#34056)
* add batch_norm_op_npu and tests * remove skip.If * fix bug
Configuration menu - View commit details
-
Copy full SHA for 9ed5db2 - Browse repository at this point
Copy the full SHA 9ed5db2View commit details -
[NPU] add reduce_mean_op_npu and test (#34053)
* add reduce_mean_op_npu and test * remove skip.If * update
Configuration menu - View commit details
-
Copy full SHA for f6fab55 - Browse repository at this point
Copy the full SHA f6fab55View commit details -
[NPU] add momentum_op_npu and test (#34082)
* add momentum_op_npu and test * update * fix hang
Configuration menu - View commit details
-
Copy full SHA for 9e3e08f - Browse repository at this point
Copy the full SHA 9e3e08fView commit details -
Configuration menu - View commit details
-
Copy full SHA for d45d311 - Browse repository at this point
Copy the full SHA d45d311View commit details -
[NPU] add while, read_from_array and write_to_array npu op (#34755)
* add while read_from_array write_to_array npu op * optimize unittest
Configuration menu - View commit details
-
Copy full SHA for 234c21a - Browse repository at this point
Copy the full SHA 234c21aView commit details -
Configuration menu - View commit details
-
Copy full SHA for fc537d4 - Browse repository at this point
Copy the full SHA fc537d4View commit details -
[Paddle TRT]fix_fc_int8_convert; fix_reshape_convert (#34787)
* fix_fc_reshape_convert * fix
Configuration menu - View commit details
-
Copy full SHA for 3429c04 - Browse repository at this point
Copy the full SHA 3429c04View commit details -
set_value_grad
propagate gradients toInput
andTensorValue
(#3……4304) * add set_value_grad op * add unittest. * polish unittest. * polish code. * support cuda kernel * polish code according to CI * polish code. * polish code * remove *.pyc * polish code. * add unittest to improve coverage. * polish code.
Configuration menu - View commit details
-
Copy full SHA for 9d02313 - Browse repository at this point
Copy the full SHA 9d02313View commit details -
[AMP] add state_dict and load_state_dict and unittest for class GradS…
…caler (#34300) * add state_dict and load_state_dict and unittest for class GradScaler * refine unittest for coverage of load_state_dict * refine comments of code-block * refine some comments * refine state_dict code and unittest * add #require gpu, xpu for GradScaler get/set example code * add #require gpu, xpu for GradScaler get/set example code * refine example code * refine unittest for state_dict * refine unittest for state_dict * fix bug of DataLoader in TestGradScalerStateDict * add flag FLAGS_cudnn_deterministic
Configuration menu - View commit details
-
Copy full SHA for 99f8f5c - Browse repository at this point
Copy the full SHA 99f8f5cView commit details -
[oneDNN] Fix to issue #34554 (#34623)
* - Added softmax without caching * - Binary is no longer manually cached * - Activation onednn caching removed * - Removed manual caching of activation * - modified UT * - fix * - fix * - fixes to building * - fix * - fix * - fix to UT * - Faulty UT workaround * - approval workaround * - Fixes after review * - compilation fixes * - more lint fixes * - more fixes after review * - fixes after another round of review
Configuration menu - View commit details
-
Copy full SHA for 0a5c99e - Browse repository at this point
Copy the full SHA 0a5c99eView commit details
Commits on Aug 12, 2021
-
add det_mv3_db & LeViT test case in pr-ci-inference (#34803)
* add det_mv3_db & LeViT test case in pr-ci-inference * fix LeViT model dir bugs * fix grammar error
Configuration menu - View commit details
-
Copy full SHA for 1c31d9d - Browse repository at this point
Copy the full SHA 1c31d9dView commit details -
[NPU] Support npu op expand_v2 and expand_v2_grad (#34764)
* [NPU] Support npu op expand_v2 and expand_v2_grad * [NPU] Support npu op expand_v2 and expand_v2_grad * [NPU] Support npu op expand_v2 and expand_v2_grad * update test_expand_v2_op_npu.py * update test_expand_v2_op_npu.py * modify expand_v2_op_npu.cc * modify expand_v2_op_npu.cc
Configuration menu - View commit details
-
Copy full SHA for bc543e3 - Browse repository at this point
Copy the full SHA bc543e3View commit details -
Configuration menu - View commit details
-
Copy full SHA for cfa6913 - Browse repository at this point
Copy the full SHA cfa6913View commit details -
[HybridParallel]Add Recompute for PipeLineParallel (#34607)
* add recompute for pp * add recompute offload * add recompute partition
Configuration menu - View commit details
-
Copy full SHA for 589d13c - Browse repository at this point
Copy the full SHA 589d13cView commit details -
Fix safety-bug of functional.linear (#34696)
* Fix safety-bug of functional.linear * Fix safety-bug of functional.linear * Fix safety-bug of functional.linear * Fix safety-bug of functional.linear
Configuration menu - View commit details
-
Copy full SHA for 0e28c8b - Browse repository at this point
Copy the full SHA 0e28c8bView commit details -
This PR adds fused transformer related files defining c interface including class, function etc..
Configuration menu - View commit details
-
Copy full SHA for 016cc56 - Browse repository at this point
Copy the full SHA 016cc56View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6326c3e - Browse repository at this point
Copy the full SHA 6326c3eView commit details -
Configuration menu - View commit details
-
Copy full SHA for dffb0b2 - Browse repository at this point
Copy the full SHA dffb0b2View commit details -
Configuration menu - View commit details
-
Copy full SHA for dc62a22 - Browse repository at this point
Copy the full SHA dc62a22View commit details -
Remove incorrect signal error stack trace (#34842)
* remove unmatched signal error stack * fix error writing for cond
Configuration menu - View commit details
-
Copy full SHA for 572adcc - Browse repository at this point
Copy the full SHA 572adccView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3f71e8d - Browse repository at this point
Copy the full SHA 3f71e8dView commit details
Commits on Aug 13, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 2164ad6 - Browse repository at this point
Copy the full SHA 2164ad6View commit details -
Configuration menu - View commit details
-
Copy full SHA for e92f038 - Browse repository at this point
Copy the full SHA e92f038View commit details -
[Bug-Fix]fix bug of py36 import utils (#34873)
* fix bug of py36 import
Configuration menu - View commit details
-
Copy full SHA for 507ea06 - Browse repository at this point
Copy the full SHA 507ea06View commit details -
Configuration menu - View commit details
-
Copy full SHA for 17a9976 - Browse repository at this point
Copy the full SHA 17a9976View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5b86b99 - Browse repository at this point
Copy the full SHA 5b86b99View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8bc4d85 - Browse repository at this point
Copy the full SHA 8bc4d85View commit details -
Add EmptyGradOpMaker CI Approval (#34810)
* Add EmptyGradOpMaker CI Approval, test=develop * Fix typo in echo_line
Hao Lin authoredAug 13, 2021 Configuration menu - View commit details
-
Copy full SHA for ac56d54 - Browse repository at this point
Copy the full SHA ac56d54View commit details -
Configuration menu - View commit details
-
Copy full SHA for f421741 - Browse repository at this point
Copy the full SHA f421741View commit details -
Bug fix : Can't load multiple modules of custom c++ op (#34505)
* Fix a bug : can't load more than one custom op module * Fix a bug : can't load more than one custom op module * add test for load multiple modules of custom c++ op * add config for Coverage CI
Configuration menu - View commit details
-
Copy full SHA for fc6b4a5 - Browse repository at this point
Copy the full SHA fc6b4a5View commit details -
Configuration menu - View commit details
-
Copy full SHA for ff4bdac - Browse repository at this point
Copy the full SHA ff4bdacView commit details -
* OP dot: refactor CPU kernels and get better loop performance. * Minor fix on code format. * Fixed minor errors. * Add new API: einsum * Update the Einsum unit test. One case failed with matmul_v2, where the dtype is int64: a = np.arange(2 * 3 * 1).reshape(2, 3, 1) b = np.arange(1) paddle.einsum("...i, ...i", a, b) * Test cases in test_einsum test floating point dtypes only. As of now Paddle only supports float/double dtypes in matmul, which is one of building blocks of this Einsum implementation. We decide not to test einsum against other dtypes. * Polish format. * More formatting. * Format... * Einsum: improve test coverage. * Einsum: bug fixes and more testcases for testing error messages * Einsum: fix format.. * Einsum: fixed typo and format. * Einsum: format again... * Einsum: applied suggested changes. * Einsum API: improve API documentation. * Einsum API: apply suggested changes. * Einsum API: Add dygraph only note. * Einsum API: Add dygraph only note. * Einsum API: fixed unittest.
Configuration menu - View commit details
-
Copy full SHA for 8c8667f - Browse repository at this point
Copy the full SHA 8c8667fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2cd05d5 - Browse repository at this point
Copy the full SHA 2cd05d5View commit details
Commits on Aug 16, 2021
-
[NPU] add p_norm_op_npu (#34695)
* add p_norm_op_npu * remove p_norm_grad op * update
Configuration menu - View commit details
-
Copy full SHA for 7316018 - Browse repository at this point
Copy the full SHA 7316018View commit details -
Op-benchmark CI cpu and gpu (#34631)
* notest;pm-op-benchmark * notest;pm-op-benchmark * notest;pm-op-benchmark * notest;pm-op-benchmark * notest;pm-op-benchmark * notest;pm-op-benchmark * notest;test=op_benchmark * notest;test=op_benchmark * notest;op_benchmark * notest;op_benchmark * notest;op_benchmark * notest;op_benchmark * notest;op_benchmark * notest;op_benchmark * notest;test=op_benchmark * notest;op_benchmark * notest;op_benchmark * notest;op_benchmark * fix * fix
Configuration menu - View commit details
-
Copy full SHA for 8fb17fc - Browse repository at this point
Copy the full SHA 8fb17fcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 34d188b - Browse repository at this point
Copy the full SHA 34d188bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 28279f6 - Browse repository at this point
Copy the full SHA 28279f6View commit details -
Configuration menu - View commit details
-
Copy full SHA for e84b2e9 - Browse repository at this point
Copy the full SHA e84b2e9View commit details -
Configuration menu - View commit details
-
Copy full SHA for ad6c3b9 - Browse repository at this point
Copy the full SHA ad6c3b9View commit details -
Support npu op hard_swish and hard_swish_grad (#34608)
* Support NPU OP hard_swish and hard_swish_grad * Support NPU OP hard_swish and hard_swish_grad * add the unittest to compare the result between npu ans cpu * format the prompt of exception * replace Min and Max op by ClipByValue op * fix the precision problem for fp16 * Using HardtanhGrad to improve performace
Configuration menu - View commit details
-
Copy full SHA for fd92d94 - Browse repository at this point
Copy the full SHA fd92d94View commit details -
Configuration menu - View commit details
-
Copy full SHA for dc439a1 - Browse repository at this point
Copy the full SHA dc439a1View commit details -
support margin loss (arcface, cosface, sphereface) for single GPU and…
… cross GPUs (#34247) * support margin loss (arcface, cosface, sphereface)
Configuration menu - View commit details
-
Copy full SHA for b0cb414 - Browse repository at this point
Copy the full SHA b0cb414View commit details -
[amp] dygraph amp support param_group (#34899)
* dygraph amp support param_group * remove unused code * fix doc
Configuration menu - View commit details
-
Copy full SHA for e29c2d1 - Browse repository at this point
Copy the full SHA e29c2d1View commit details -
add unique_consecutive_op (#34334)
* add unique_consecutive_op * add unique_consecutive_op * add unique_consecutive_op * add unique_consecutive_op * add unique_consecutive_op * add unique_consecutive_op * add unique_consecutive_op * add unique_consecutive_op * remove unity build * add unique_consecutive op * add unique_consecutive op * add enable static * add noqa * add space line * add default case. * add comma * add space line * modify unique_consecutive unittest * optimize ut coverage * rebase develop * improve coverage * update en docs * update en docs * update en docs * update en docs * update en docs * update en doc
Configuration menu - View commit details
-
Copy full SHA for 875cfd5 - Browse repository at this point
Copy the full SHA 875cfd5View commit details -
[NPU] Support NPU kernel for nearest_interp and nearest_interp_grad op (
#34881) * Add NPU kernel for nearest_interp op * Add grad op * Modify codes according to the review comments * Modify codes according to the review comments
Configuration menu - View commit details
-
Copy full SHA for e4e8cc9 - Browse repository at this point
Copy the full SHA e4e8cc9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3b9f040 - Browse repository at this point
Copy the full SHA 3b9f040View commit details -
Configuration menu - View commit details
-
Copy full SHA for f6d8ab5 - Browse repository at this point
Copy the full SHA f6d8ab5View commit details -
[oneDNN] Fix to 34554 (same as previous PR but should build with GPU) (…
…#34859) * - Added softmax without caching * - Binary is no longer manually cached * - Activation onednn caching removed * - Removed manual caching of activation * - modified UT * - fix * - fix * - fixes to building * - fix * - fix * - fix to UT * - Faulty UT workaround * - approval workaround * - Fixes after review * - compilation fixes * - more lint fixes * - more fixes after review * - fixes after another round of review * - hopefully compilation fix - compilation fix
Configuration menu - View commit details
-
Copy full SHA for 9cb6565 - Browse repository at this point
Copy the full SHA 9cb6565View commit details -
Fix elementwise_add quantization (#34820)
* Remove force_fp32_output from elementwise_add quantization * Fix cpu_quantize_placement test * Review related changes
Configuration menu - View commit details
-
Copy full SHA for ae80df9 - Browse repository at this point
Copy the full SHA ae80df9View commit details -
Configuration menu - View commit details
-
Copy full SHA for d028214 - Browse repository at this point
Copy the full SHA d028214View commit details -
Change the invoking method of settiem by Ellipsis and None index from…
… numpy to set_value op (#34911) * Change invoking mathod of the settiem by Ellipsis and None index from numpy to set_value op * add none_axes into attr of set_value_op in dygraph mode
Configuration menu - View commit details
-
Copy full SHA for 2e30134 - Browse repository at this point
Copy the full SHA 2e30134View commit details -
[NPU] Add size npu op (#34636)
* add size npu op * modify support data type * no longer use NPU size OP * remove useless comments, add test case * fix copyright, remove useless include
Configuration menu - View commit details
-
Copy full SHA for 4981894 - Browse repository at this point
Copy the full SHA 4981894View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6b4b9fe - Browse repository at this point
Copy the full SHA 6b4b9feView commit details -
* change bilinear thread for nano and tx2 * change bilinear thread for nano and tx2
feng_shuai authoredAug 16, 2021 Configuration menu - View commit details
-
Copy full SHA for 2a4ed08 - Browse repository at this point
Copy the full SHA 2a4ed08View commit details -
[NPU] Support npu op:(1)arg_min (2)arg_max (#34867)
* [NPU] Support npu op:(1)arg_min (2)arg_max * Modify and add unit test cases * Modify unit test cases
Configuration menu - View commit details
-
Copy full SHA for b1cc4a4 - Browse repository at this point
Copy the full SHA b1cc4a4View commit details -
Fix typos in English docs for diag and diagflat. (#34869)
* Fix typos in english docs for diag and diagflat.
Configuration menu - View commit details
-
Copy full SHA for 35ef418 - Browse repository at this point
Copy the full SHA 35ef418View commit details -
Configuration menu - View commit details
-
Copy full SHA for ed6624a - Browse repository at this point
Copy the full SHA ed6624aView commit details
Commits on Aug 17, 2021
-
fix a bug in nlp: text_matching/sentence_transformers when last dim i…
…s 1 and reduce mid dim (#34941)
Configuration menu - View commit details
-
Copy full SHA for 181f7ce - Browse repository at this point
Copy the full SHA 181f7ceView commit details -
fix drop_last not work on IterableDataset (#34801)
* fix drop_last not work in IterableDataset. test=develop
Configuration menu - View commit details
-
Copy full SHA for 1614608 - Browse repository at this point
Copy the full SHA 1614608View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5de576b - Browse repository at this point
Copy the full SHA 5de576bView commit details -
Add some passes which can be applied to Program (#34730)
* add inplace passes and tests * update * fix use_cuda undefined fix compile error of op compat * add more ut * fix CPU CI error * check adam unique * fix mac/windows ci, improve coverage * fix ci error * follow weihang's comment * fix BlockDesc::MoveFrom * follow qiuliang's comment * update * follow huihuang's comments
Configuration menu - View commit details
-
Copy full SHA for 8046e33 - Browse repository at this point
Copy the full SHA 8046e33View commit details -
Align CTC grad scale same with ESPNet (#34729)
* dygraph support more ctc grad scale * scale for 1.x * fix unitest * fix unitest * format code * fix unittest * fix log info * unittest cov * fix format;notest,test=cpu,coverage * skip ctc_loss egs;test=cpu * warpctc grad cov;test=coverage * add dygraph test;test=coverage * format;test=cpu,coverage * format;test=cpu * add api compat;test=cpu * add cpu test * rename * rename * fix * fix test * format * eigen cpu * eigen gpu grad pass * cuda gpu pass * format * fix ci
Configuration menu - View commit details
-
Copy full SHA for 10f9644 - Browse repository at this point
Copy the full SHA 10f9644View commit details -
add mkl multi-thread test cases in PR-CI-INFERENCE (#34946)
* add mkl multi-thread test cases * fix codestyle * fix codestyle & enable ernie mkl test
Configuration menu - View commit details
-
Copy full SHA for 9d4f00b - Browse repository at this point
Copy the full SHA 9d4f00bView commit details -
[bug fix] fix unfold negative_size_param (#34943)
* [bug fix] fix unfold negative_size_param
Configuration menu - View commit details
-
Copy full SHA for 8ef1bf8 - Browse repository at this point
Copy the full SHA 8ef1bf8View commit details -
Modify the name of class in unittest with the same name (#34952)
* polish unittest. * polish code * polish code
Configuration menu - View commit details
-
Copy full SHA for 01a3a2e - Browse repository at this point
Copy the full SHA 01a3a2eView commit details -
add exclude rules of pre-commit for paddle/utils and third_party (#34880
Configuration menu - View commit details
-
Copy full SHA for 7b3295a - Browse repository at this point
Copy the full SHA 7b3295aView commit details -
[oneDNN ] disabling more ops caching (#34830)
* - disabled caching of layer norm - fix in compilation - compilation fix - transpose caching disabled - compilation fix - more compilation fixes - sum caching disabled - compilation fix * - LRN with disabled cache * lint fixes
Configuration menu - View commit details
-
Copy full SHA for f1c1d9e - Browse repository at this point
Copy the full SHA f1c1d9eView commit details -
Copy boost optional to Paddle (#34780)
* copy boost optional.hpp to paddle * copy boost optional.hpp to paddle * move directions * del fluid/utils * modify .hpp to .h * move directions * modify to paddle::optional * add modification description * format code stype for the files in paddle/utils * format code stype
Configuration menu - View commit details
-
Copy full SHA for 9be4144 - Browse repository at this point
Copy the full SHA 9be4144View commit details -
Update op-benchmark CI (#34962)
* fix op-benchmark * test=document_fix
Configuration menu - View commit details
-
Copy full SHA for 690f583 - Browse repository at this point
Copy the full SHA 690f583View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1ef2185 - Browse repository at this point
Copy the full SHA 1ef2185View commit details -
Configuration menu - View commit details
-
Copy full SHA for b4474fb - Browse repository at this point
Copy the full SHA b4474fbView commit details
Commits on Aug 18, 2021
-
[Paddle-TRT] unitest_quant_dequant (#34929)
* unitest_quant_dequant * fix * fix * deleted: test_trt_quant_conv2d_dequant_fuse_pass.py * fix
Configuration menu - View commit details
-
Copy full SHA for c7070cb - Browse repository at this point
Copy the full SHA c7070cbView commit details -
support class center sample of PartialFC (#34106)
* support class center sample of PartialFC
Configuration menu - View commit details
-
Copy full SHA for 100db44 - Browse repository at this point
Copy the full SHA 100db44View commit details -
[NPU] add retry on HcclGetRootInfo to fix "bind fail" (#34977)
* add retry for HcclGetRootInfo * refine code * reduce retry interval
Configuration menu - View commit details
-
Copy full SHA for 52a7b0c - Browse repository at this point
Copy the full SHA 52a7b0cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 12bf046 - Browse repository at this point
Copy the full SHA 12bf046View commit details -
Configuration menu - View commit details
-
Copy full SHA for a137371 - Browse repository at this point
Copy the full SHA a137371View commit details -
lilong12 authored
Aug 18, 2021 Configuration menu - View commit details
-
Copy full SHA for 2e9a31e - Browse repository at this point
Copy the full SHA 2e9a31eView commit details -
Add function to disable paddle signal handler (#34577)
* Add function to disable paddle signal handler Paddle used google::InstallFaultSignalHandler to handle selected system signals, mainly for debugging and bug report purposes. However, this can be conflicted with other python packages whoever captures similar signals. Such python package involves tvm and more To resolve this issue, we support a function to disable signal handler * Remove signal test from WIN32 platform * Remove redundant return from disable_signal_handler() function * Add detailed messages to en_doc
Configuration menu - View commit details
-
Copy full SHA for dd533dd - Browse repository at this point
Copy the full SHA dd533ddView commit details -
[CustomOp] Fix ext_tensor.cast failed bug (#34884)
* fix ext_tensor.cast failed bug * remove useless deps * fix windows cmake failed * try to fix windows make failed * fix make error on windwos
Configuration menu - View commit details
-
Copy full SHA for 4d88cdb - Browse repository at this point
Copy the full SHA 4d88cdbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8967a66 - Browse repository at this point
Copy the full SHA 8967a66View commit details -
[CPU-PSLIB] Add consistency insepection of use_var_list and data_gene…
…rator data, test=develop (#34463)
Configuration menu - View commit details
-
Copy full SHA for 209075a - Browse repository at this point
Copy the full SHA 209075aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 51939c8 - Browse repository at this point
Copy the full SHA 51939c8View commit details -
[Hybrid Performance] Move the cast op of AMP which cast fp32 param to…
… fp16 param to the optimizer (#34965)
Configuration menu - View commit details
-
Copy full SHA for a9673b4 - Browse repository at this point
Copy the full SHA a9673b4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 40f6273 - Browse repository at this point
Copy the full SHA 40f6273View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1b71a71 - Browse repository at this point
Copy the full SHA 1b71a71View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1b747de - Browse repository at this point
Copy the full SHA 1b747deView commit details -
code refactoring for new executor (#34970)
* code refactoring, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
Configuration menu - View commit details
-
Copy full SHA for 40d4d83 - Browse repository at this point
Copy the full SHA 40d4d83View commit details -
* fix pad outliers err * fix pad api input type and doc * fix example of pad * add unittest for pad3d * fix unittest * fix error format * fix pad doc
Configuration menu - View commit details
-
Copy full SHA for 248e27b - Browse repository at this point
Copy the full SHA 248e27bView commit details -
Add NPU kernel for norm Op: float16 and float32 (#34609)
* Add NPU kernel for norm Op: float16 and float32 * fix code for code review * fix for code review * add type for paddle_throw * remove unnecessary head file.\nAdd more testcase * remove a broadcast
Configuration menu - View commit details
-
Copy full SHA for 755c8a1 - Browse repository at this point
Copy the full SHA 755c8a1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9cbba97 - Browse repository at this point
Copy the full SHA 9cbba97View commit details