-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
merge develop to gpugraph #77
Commits on Jul 15, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 1f7f719 - Browse repository at this point
Copy the full SHA 1f7f719View commit details -
Fix random seed for several unit tests (PaddlePaddle#44135)
* Fix test_functional_conv2d_transpose random seed * Fix random seed and use np.testing * Fix random seed for test_lu_unpack_op * Fix test_autograd_functional_dynamic random seed
Configuration menu - View commit details
-
Copy full SHA for f913083 - Browse repository at this point
Copy the full SHA f913083View commit details -
Configuration menu - View commit details
-
Copy full SHA for d2e59e1 - Browse repository at this point
Copy the full SHA d2e59e1View commit details -
add fused token prune op and plugin (PaddlePaddle#44281)
* add fused token prune op and plugin
Configuration menu - View commit details
-
Copy full SHA for d881d69 - Browse repository at this point
Copy the full SHA d881d69View commit details -
Configuration menu - View commit details
-
Copy full SHA for 676d0b4 - Browse repository at this point
Copy the full SHA 676d0b4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9181a99 - Browse repository at this point
Copy the full SHA 9181a99View commit details -
[IPU] add custom-op UTs 0/N (PaddlePaddle#44328)
* add custom-op UTs 0 * add authors Co-authored-by: Allen Guo <alleng@graphcore.ai> Co-authored-by: Zhixin Yao <zhixiny@graphcore.ai> Co-authored-by: Zhaorui Chen <zhaoruic@graphcore.ai> Co-authored-by: Zhixin Yao <zhixiny@graphcore.ai> Co-authored-by: Zhaorui Chen <zhaoruic@graphcore.ai>
Configuration menu - View commit details
-
Copy full SHA for c8e26fe - Browse repository at this point
Copy the full SHA c8e26feView commit details -
[IPU] add custom-op UTs 1/N (PaddlePaddle#44329)
* add custom-op UTs 1 * add authors Co-authored-by: Allen Guo <alleng@graphcore.ai> Co-authored-by: Zhixin Yao <zhixiny@graphcore.ai> Co-authored-by: Zhaorui Chen <zhaoruic@graphcore.ai> * update url Co-authored-by: Zhixin Yao <zhixiny@graphcore.ai> Co-authored-by: Zhaorui Chen <zhaoruic@graphcore.ai>
Configuration menu - View commit details
-
Copy full SHA for 2c8c841 - Browse repository at this point
Copy the full SHA 2c8c841View commit details -
support KL2 multi-card training, *test=kunlun (PaddlePaddle#43889)
* update xccl lib * use separate streams for compute/comm on XPU * add broadcast op to xpu2_op_list
Configuration menu - View commit details
-
Copy full SHA for 270f25e - Browse repository at this point
Copy the full SHA 270f25eView commit details -
Remove auto to_pascal_case for args in op generator (PaddlePaddle#44350)
* remove auto to_pascal_case for args in op generator * fix yaml config
Configuration menu - View commit details
-
Copy full SHA for 0dafbb0 - Browse repository at this point
Copy the full SHA 0dafbb0View commit details -
Standard sparse conv name (PaddlePaddle#44353)
zhangkaihuo authoredJul 15, 2022 Configuration menu - View commit details
-
Copy full SHA for 8744383 - Browse repository at this point
Copy the full SHA 8744383View commit details -
[Eager] eager variable back sync (PaddlePaddle#44343)
* eager variable back sync
Configuration menu - View commit details
-
Copy full SHA for 13d01e6 - Browse repository at this point
Copy the full SHA 13d01e6View commit details -
[ Phi Kernel ] Transfer as_real to phi. (PaddlePaddle#44263)
* transfer as_real to phi * fix erros * blocking: True -> False
Configuration menu - View commit details
-
Copy full SHA for 068f48d - Browse repository at this point
Copy the full SHA 068f48dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4a4a036 - Browse repository at this point
Copy the full SHA 4a4a036View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6f7550e - Browse repository at this point
Copy the full SHA 6f7550eView commit details
Commits on Jul 16, 2022
-
[Phi] Migrate solve kernel to phi (PaddlePaddle#44363)
* draft version * draft version * draft version * migrate solve kernel to phi * polish * polish * re useless header file, fix a bug in grad_kernel_impl * add header file in need
Configuration menu - View commit details
-
Copy full SHA for c0a7830 - Browse repository at this point
Copy the full SHA c0a7830View commit details
Commits on Jul 18, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 876e2ff - Browse repository at this point
Copy the full SHA 876e2ffView commit details -
Configuration menu - View commit details
-
Copy full SHA for fd6dcdf - Browse repository at this point
Copy the full SHA fd6dcdfView commit details -
[Paddle-TRT] reshape fill_constant (PaddlePaddle#44314)
* reshape fill_constant * commit * commit
Configuration menu - View commit details
-
Copy full SHA for b7db845 - Browse repository at this point
Copy the full SHA b7db845View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0fd974b - Browse repository at this point
Copy the full SHA 0fd974bView commit details -
[Paddle-TRT] remove useless code in fc (PaddlePaddle#44382)
* remove useless code in fc
Configuration menu - View commit details
-
Copy full SHA for 5c29173 - Browse repository at this point
Copy the full SHA 5c29173View commit details -
[Paddle-TRT] Fix cast (PaddlePaddle#44312)
* fix_cast * fix_cast * commit
Configuration menu - View commit details
-
Copy full SHA for 7a85ced - Browse repository at this point
Copy the full SHA 7a85cedView commit details -
Configuration menu - View commit details
-
Copy full SHA for 39e5dd2 - Browse repository at this point
Copy the full SHA 39e5dd2View commit details -
Enable inference multi stream ci test (PaddlePaddle#44275)
* test * update
Configuration menu - View commit details
-
Copy full SHA for 3c074de - Browse repository at this point
Copy the full SHA 3c074deView commit details -
Configuration menu - View commit details
-
Copy full SHA for 74412df - Browse repository at this point
Copy the full SHA 74412dfView commit details -
add xpu resnet_unit (PaddlePaddle#44297)
* add xpu resnet_unit *test=kunlun * tmp *test=kunlun
Configuration menu - View commit details
-
Copy full SHA for 02e9453 - Browse repository at this point
Copy the full SHA 02e9453View commit details -
Configuration menu - View commit details
-
Copy full SHA for b83138d - Browse repository at this point
Copy the full SHA b83138dView commit details -
[Plugin] Fix Custom device in eager mode, test=develop (PaddlePaddle#…
…43952) * [Plugin] Fix Custom device in eager mode, test=develop * update test case, test=develop * update ut for coverage, test=develop
Configuration menu - View commit details
-
Copy full SHA for 04e5558 - Browse repository at this point
Copy the full SHA 04e5558View commit details -
Configuration menu - View commit details
-
Copy full SHA for fbedf77 - Browse repository at this point
Copy the full SHA fbedf77View commit details -
fix typos in template for codegen of operators (PaddlePaddle#44364)
Feiyu Chan authoredJul 18, 2022 Configuration menu - View commit details
-
Copy full SHA for 4c1e77d - Browse repository at this point
Copy the full SHA 4c1e77dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1d12832 - Browse repository at this point
Copy the full SHA 1d12832View commit details -
Configuration menu - View commit details
-
Copy full SHA for b2224e6 - Browse repository at this point
Copy the full SHA b2224e6View commit details -
Configuration menu - View commit details
-
Copy full SHA for c6bf881 - Browse repository at this point
Copy the full SHA c6bf881View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3f70b1d - Browse repository at this point
Copy the full SHA 3f70b1dView commit details -
Configuration menu - View commit details
-
Copy full SHA for dd0a07f - Browse repository at this point
Copy the full SHA dd0a07fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 08cada9 - Browse repository at this point
Copy the full SHA 08cada9View commit details
Commits on Jul 19, 2022
-
[new api] add new api paddle.vision.ops.distribute_fpn_proposals (Pad…
…dlePaddle#43736) * add distribute_fpn_proposals * change to new dygraph * fix doc and example code * change fluid impl to current version
Configuration menu - View commit details
-
Copy full SHA for 130c108 - Browse repository at this point
Copy the full SHA 130c108View commit details -
Configuration menu - View commit details
-
Copy full SHA for d5f0ed4 - Browse repository at this point
Copy the full SHA d5f0ed4View commit details -
[Paddle-TRT] Shape sum fix scale (PaddlePaddle#44394)
* shape sum * add shape, sum trt layer
Configuration menu - View commit details
-
Copy full SHA for 6fb2958 - Browse repository at this point
Copy the full SHA 6fb2958View commit details -
[Phi] Migrate infermeta and add yaml for solve op (PaddlePaddle#44379)
* migrate solve kernel to phi * re useless header file, fix a bug in grad_kernel_impl * add header file in need * add yaml for solve op * fix solve_sig.cc ArgumentMapping and update tests case * disable legacy dygraph check in op_test * rm solve_op.cc / solve_sig.cc and migrate yaml config * Update op_test.py disable legacy dygraph check when check_eager is True
Configuration menu - View commit details
-
Copy full SHA for 5dfb87d - Browse repository at this point
Copy the full SHA 5dfb87dView commit details -
add labels for infer ut (PaddlePaddle#44279)
* add labels for infer ut * add RUN_TYPE=INFER for cpp ut * fix formaterror * update
Configuration menu - View commit details
-
Copy full SHA for fea05f1 - Browse repository at this point
Copy the full SHA fea05f1View commit details -
Configuration menu - View commit details
-
Copy full SHA for a8148ce - Browse repository at this point
Copy the full SHA a8148ceView commit details -
Configuration menu - View commit details
-
Copy full SHA for d4bb2ad - Browse repository at this point
Copy the full SHA d4bb2adView commit details -
Rename BOOST_GET macros (PaddlePaddle#44368)
* Rename BOOST_GET macros * Fix conflicts
Configuration menu - View commit details
-
Copy full SHA for 4b085c5 - Browse repository at this point
Copy the full SHA 4b085c5View commit details -
[new API] add paddle.vision.ops.generate_proposals (PaddlePaddle#43611)
* add generate_proposals into paddle.vision * remove class api * im_info -> img_size * change fluid impl to current version
Configuration menu - View commit details
-
Copy full SHA for 2a2bc0b - Browse repository at this point
Copy the full SHA 2a2bc0bView commit details -
Configuration menu - View commit details
-
Copy full SHA for a8680f5 - Browse repository at this point
Copy the full SHA a8680f5View commit details -
Added pad3d and pad2d FP32 FWD oneDNN kernels (PaddlePaddle#43990)
* Piotrek's changes for pad3d * my changes * first version of pad3d, single copy, unnecessary reads * optimized pad3d kernel * test upadte * removed magic numbers * added support for pad2d * reverted two files * reverted one old change * added support for Paddings tensor * CI fix * CI fix * fixed timeout of tests * fixed typo * changes to GetKernelTypeForVar * Revert "changes to GetKernelTypeForVar" This reverts commit 4691061. * added AsExtra() to pad2d Co-authored-by: Piotr Paturej <piotr.paturej@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 2792b8d - Browse repository at this point
Copy the full SHA 2792b8dView commit details -
add save_cache/patch (PaddlePaddle#44420)
* add save_cache/patch * add pybind * remove pybind * remove const_cast * add fleet
Configuration menu - View commit details
-
Copy full SHA for f382eb0 - Browse repository at this point
Copy the full SHA f382eb0View commit details -
Standard name of sparse pool (PaddlePaddle#44344)
zhangkaihuo authoredJul 19, 2022 Configuration menu - View commit details
-
Copy full SHA for 9e30722 - Browse repository at this point
Copy the full SHA 9e30722View commit details -
move eig operator from fluid to phi (PaddlePaddle#44398)
* move eig operator from fluid to phi * add eig_grad unitest, upgrade IsComplexType() from fluid to phi
Configuration menu - View commit details
-
Copy full SHA for 3788f5e - Browse repository at this point
Copy the full SHA 3788f5eView commit details -
[Phi]Move angle op to phi (PaddlePaddle#44393)
* Move angle op to phi * Replace mutable_data using Alloc * Remove some include * Try to fix windows ci error * include math.h to fix windows ci error * Fix kernel name * Move angle_grad infershape
Configuration menu - View commit details
-
Copy full SHA for 547075e - Browse repository at this point
Copy the full SHA 547075eView commit details -
[Eager]release gil when run backward (PaddlePaddle#44433)
* release gil when run backward
Configuration menu - View commit details
-
Copy full SHA for 4e1f769 - Browse repository at this point
Copy the full SHA 4e1f769View commit details -
compile phi/backends into one static library (PaddlePaddle#44373)
* compile into one static library * fix xpu compile * fix xpu compile * fix inference compile * fix inference compile * add custom test * revert one file
Configuration menu - View commit details
-
Copy full SHA for 1047cb1 - Browse repository at this point
Copy the full SHA 1047cb1View commit details
Commits on Jul 20, 2022
-
[IPU] Add more Ops (PaddlePaddle#44414)
* [IPU] Add more Ops * update boost API
yaozhixin authoredJul 20, 2022 Configuration menu - View commit details
-
Copy full SHA for 7daae98 - Browse repository at this point
Copy the full SHA 7daae98View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3ed5328 - Browse repository at this point
Copy the full SHA 3ed5328View commit details -
Add dependency for read op in standalone executor (PaddlePaddle#44362)
* Add dependency for read op in standalone executor * Fix CI errors * Add UT * add_dependency -> dependency_utils * Fix CI errors
Configuration menu - View commit details
-
Copy full SHA for 2ee3202 - Browse repository at this point
Copy the full SHA 2ee3202View commit details -
Add distro in ci docker (PaddlePaddle#44332)
* add distro zstd * test * test * add pip3.8
Configuration menu - View commit details
-
Copy full SHA for 55427f1 - Browse repository at this point
Copy the full SHA 55427f1View commit details -
[Phi] migrate as_complex kernel to phi (PaddlePaddle#44438)
* migrate as_complex kernel to phi * support as_complex and as_real in phi * rm GetExpectedKernelType for AsRealOp
Configuration menu - View commit details
-
Copy full SHA for 0e2dd2f - Browse repository at this point
Copy the full SHA 0e2dd2fView commit details -
[GPUPS]FleetWrapper initialize (PaddlePaddle#44441)
* fix FleetWrapper initialize
Configuration menu - View commit details
-
Copy full SHA for 28cb006 - Browse repository at this point
Copy the full SHA 28cb006View commit details -
[XPU][NPU] (1) add device_guard. (2) add support for LoDTensorArray o…
…f sum op. (PaddlePaddle#44367) * device_guard support xpu. test=kunlun * sum op of xpu support LoDTensorArray. add test for while op of xpu. test=kunlun.
Configuration menu - View commit details
-
Copy full SHA for 8753a2b - Browse repository at this point
Copy the full SHA 8753a2bView commit details -
[IPU] add Op uts (PaddlePaddle#44415)
yaozhixin authoredJul 20, 2022 Configuration menu - View commit details
-
Copy full SHA for 54c7dfa - Browse repository at this point
Copy the full SHA 54c7dfaView commit details -
transfer block_id to CreateVarNode in multi_devices_graph_pass (Paddl…
…ePaddle#44366) * fix CreateVarNode in multi_devices_graph_pass * Revert "Fix var duplication bug for graph_to_program_pass (PaddlePaddle#44278)" This reverts commit a2c4c86.
Configuration menu - View commit details
-
Copy full SHA for 1882ffd - Browse repository at this point
Copy the full SHA 1882ffdView commit details -
【GPUPS】Adam accessor (PaddlePaddle#43919)
* add adam/sharedadam optimzier for gpups;edit optimizer struct;test=develop
Configuration menu - View commit details
-
Copy full SHA for b8d106e - Browse repository at this point
Copy the full SHA b8d106eView commit details -
Configuration menu - View commit details
-
Copy full SHA for c99c70c - Browse repository at this point
Copy the full SHA c99c70cView commit details -
[GPUPS]Fix psgpuwrapper initialization (PaddlePaddle#44468)
* Update ps_gpu_wrapper.h * Update ps_gpu_wrapper.h * Update ps_gpu_wrapper.cc
Configuration menu - View commit details
-
Copy full SHA for 99bf700 - Browse repository at this point
Copy the full SHA 99bf700View commit details -
[Phi] migrate exponential kernel to phi (PaddlePaddle#44376)
* [Phi] migrate exponential kernel to phi * fix comment * fix CI
Configuration menu - View commit details
-
Copy full SHA for 889bdde - Browse repository at this point
Copy the full SHA 889bddeView commit details -
[PHI] move diag_embed op to phi. (PaddlePaddle#44408)
* move diag_embed to phi.
Configuration menu - View commit details
-
Copy full SHA for 41f11d2 - Browse repository at this point
Copy the full SHA 41f11d2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1e1a4b9 - Browse repository at this point
Copy the full SHA 1e1a4b9View commit details -
Update api changing approve members (PaddlePaddle#44463)
* update api approve members, test=document_fix * add qingqnig into list, test=document_fix
Configuration menu - View commit details
-
Copy full SHA for e0b4efa - Browse repository at this point
Copy the full SHA e0b4efaView commit details -
Configuration menu - View commit details
-
Copy full SHA for dafe855 - Browse repository at this point
Copy the full SHA dafe855View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2883e4b - Browse repository at this point
Copy the full SHA 2883e4bView commit details -
Configuration menu - View commit details
-
Copy full SHA for fbfdea5 - Browse repository at this point
Copy the full SHA fbfdea5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 15dd94a - Browse repository at this point
Copy the full SHA 15dd94aView commit details -
[PHI]Seperate xshape kernel from normal kernel (PaddlePaddle#44315)
* seperate xshape kernel from normal kernel * fix bugs in infermeta * fix compile bugs * fix compile bugs
Configuration menu - View commit details
-
Copy full SHA for 98e9685 - Browse repository at this point
Copy the full SHA 98e9685View commit details
Commits on Jul 21, 2022
-
[AutoParallel] fix unittest with paddle.distributed.launch (PaddlePad…
…dle#44439) * fix unittest * fix log_dir * _enable_legacy_dygraph
Configuration menu - View commit details
-
Copy full SHA for 438ca7f - Browse repository at this point
Copy the full SHA 438ca7fView commit details -
[Phi] add temporal_shift yaml (PaddlePaddle#44409)
* add temporal_shift yaml and unittest
Configuration menu - View commit details
-
Copy full SHA for 0243c6c - Browse repository at this point
Copy the full SHA 0243c6cView commit details -
[Paddle inference] Add conv_fusion_fp16 (PaddlePaddle#44435)
* convfusionfp16 * convfusionfp16 * convfusionfp16
Configuration menu - View commit details
-
Copy full SHA for 3745571 - Browse repository at this point
Copy the full SHA 3745571View commit details -
fix some convert error found in tipc. (PaddlePaddle#44457)
* fix some error found in tipc. * update
Configuration menu - View commit details
-
Copy full SHA for d373f4f - Browse repository at this point
Copy the full SHA d373f4fView commit details -
[BugFix]Fix randint_like bugs when save program that don't need use t…
…ensor's value (PaddlePaddle#44446) * fix bugs of random * fix unittest error * fix unittest bugs
Configuration menu - View commit details
-
Copy full SHA for 5414694 - Browse repository at this point
Copy the full SHA 5414694View commit details -
add adaptive pool and softmax with cross entropy supports different a…
…xis, * test = kunlun (PaddlePaddle#44428) * add xpu pnorm op and fix pool op, *test=kunlun * add adaptive pool, and softmax with cross entropy supports different axis, *test=kunlun
Configuration menu - View commit details
-
Copy full SHA for 1a7f2de - Browse repository at this point
Copy the full SHA 1a7f2deView commit details -
add slot attr for push sparse op (PaddlePaddle#44422)
* add slot attr for push sparse op * add pybind * remove fleet * add unittest * fix
Configuration menu - View commit details
-
Copy full SHA for 85c6937 - Browse repository at this point
Copy the full SHA 85c6937View commit details -
[Dy2Sta]Fix Segment Fault while training multi-card if params have no…
… grad (PaddlePaddle#44485) * [Dy2Sta]Fix Segment Fault while training multi-card if params have no grad * fix unittest
Configuration menu - View commit details
-
Copy full SHA for 32c97a9 - Browse repository at this point
Copy the full SHA 32c97a9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 32b3469 - Browse repository at this point
Copy the full SHA 32b3469View commit details -
Replace with dygraph op calling method. (PaddlePaddle#44331)
* Replace with dygraph op calling method.
Configuration menu - View commit details
-
Copy full SHA for 5b3f91d - Browse repository at this point
Copy the full SHA 5b3f91dView commit details -
[JitLayer]Pybind PEFunction and call phi api in layer_test (PaddlePad…
…dle#44465) * Support predictor function in JitLayer * Pybind PEFunction * Pybind PEFunction and call phi api in layer_test * Call sqrt phi API * Polish flags * Fix comments
Configuration menu - View commit details
-
Copy full SHA for a0bccd9 - Browse repository at this point
Copy the full SHA a0bccd9View commit details -
[Sparse] Add sparse addmm kernel (dense+coo*dense->dense,dense+csr*de…
…nse->dense) (PaddlePaddle#44451)
Configuration menu - View commit details
-
Copy full SHA for 78b5c10 - Browse repository at this point
Copy the full SHA 78b5c10View commit details -
[Eager] bilinear_tensor_product yaml (PaddlePaddle#44459)
* bilinear_tensor_product yaml
Configuration menu - View commit details
-
Copy full SHA for 55e5ab8 - Browse repository at this point
Copy the full SHA 55e5ab8View commit details -
[ Phi ] svd transfer (PaddlePaddle#44392)
* svd cpu forward * svd gpu forward * transfer the backward of svd * remove cusolver in svd_grad * svd kernel bug fix * fix bugs * fix bugs. * fix bug
Configuration menu - View commit details
-
Copy full SHA for ba89a3d - Browse repository at this point
Copy the full SHA ba89a3dView commit details -
[Paddle-TRT] fix_fill_constant (PaddlePaddle#44481)
* fix_fill_constant * fix_fill_constant * fix_ernie
Configuration menu - View commit details
-
Copy full SHA for c3ba805 - Browse repository at this point
Copy the full SHA c3ba805View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7ab0e33 - Browse repository at this point
Copy the full SHA 7ab0e33View commit details -
[jit] jit support property.proto (PaddlePaddle#44337)
* add property.proto, can compiled * property get and deserilize * support get float * format code * format code * add unittest * add more set method * fix grammar error * Update paddle/fluid/jit/property.h Co-authored-by: Aurelius84 <zhangliujie@baidu.com> * Update paddle/fluid/jit/property.cc Co-authored-by: Aurelius84 <zhangliujie@baidu.com> * Update paddle/fluid/jit/property.cc Co-authored-by: Aurelius84 <zhangliujie@baidu.com> * Update paddle/fluid/jit/property.cc Co-authored-by: Aurelius84 <zhangliujie@baidu.com> * fix comment * fix error throw * fix property save unit test * fix error info * fix copyright and header import * reorder jit property tensor datatype Co-authored-by: Aurelius84 <zhangliujie@baidu.com>
Configuration menu - View commit details
-
Copy full SHA for 0aa344f - Browse repository at this point
Copy the full SHA 0aa344fView commit details -
[ Dy2static ] infer_program may be incorrect in amp mode. (PaddlePadd…
…le#44487) * fix the outputs of net is x,x * add unittest for duplicate output * fix * fix _infer_program use the original program not the amp program. * get _***program_id back and avoid duplicate cache ing * fix
Configuration menu - View commit details
-
Copy full SHA for 185a900 - Browse repository at this point
Copy the full SHA 185a900View commit details -
* fc support fp16 * add a ‘,’ on paddle_pass_builder.cc * fc support fp16 on non-cuda.
Configuration menu - View commit details
-
Copy full SHA for 3e1280e - Browse repository at this point
Copy the full SHA 3e1280eView commit details
Commits on Jul 22, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 4f86092 - Browse repository at this point
Copy the full SHA 4f86092View commit details -
Configuration menu - View commit details
-
Copy full SHA for a2b3932 - Browse repository at this point
Copy the full SHA a2b3932View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5ee4a21 - Browse repository at this point
Copy the full SHA 5ee4a21View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1c0120e - Browse repository at this point
Copy the full SHA 1c0120eView commit details -
Configuration menu - View commit details
-
Copy full SHA for db864f0 - Browse repository at this point
Copy the full SHA db864f0View commit details -
[CustomDevice] register Copy for custom device (PaddlePaddle#44200)
* [CustomDevice] register Copy for custom device * [CustomDevice] register Copy for custom device * [CustomDevice] register Copy for custom device * merge and add uts * merge and add uts * fix for blocking and unittests coverage
Configuration menu - View commit details
-
Copy full SHA for 3b0aa75 - Browse repository at this point
Copy the full SHA 3b0aa75View commit details -
Configuration menu - View commit details
-
Copy full SHA for fcfaa10 - Browse repository at this point
Copy the full SHA fcfaa10View commit details -
Add code of occupancy computing on DCU and avoid threadID bug for DCU…
… profiler (PaddlePaddle#44520)
Configuration menu - View commit details
-
Copy full SHA for 8037901 - Browse repository at this point
Copy the full SHA 8037901View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8ccbb86 - Browse repository at this point
Copy the full SHA 8ccbb86View commit details -
[phi] move inverse op from fluid to phi (PaddlePaddle#44471)
* move inverse from fluid to phi with unitest bug * fix bug, add eager op yaml
Configuration menu - View commit details
-
Copy full SHA for ea4b2c5 - Browse repository at this point
Copy the full SHA ea4b2c5View commit details -
support send_partial, recv_partial and allgather_partial in ProcessGr…
…oupNCCL (PaddlePaddle#44444)
Configuration menu - View commit details
-
Copy full SHA for 18c7732 - Browse repository at this point
Copy the full SHA 18c7732View commit details -
Configuration menu - View commit details
-
Copy full SHA for 19d9c73 - Browse repository at this point
Copy the full SHA 19d9c73View commit details -
* (modified) fc support fp16 * __CUDA_ARCH__ version * delete half * delete half
Configuration menu - View commit details
-
Copy full SHA for 6b6f7a2 - Browse repository at this point
Copy the full SHA 6b6f7a2View commit details
Commits on Jul 25, 2022
-
Fix bug of amp code-gen (PaddlePaddle#44570)
* fix bug of amp code_gen * fix bug
Configuration menu - View commit details
-
Copy full SHA for e32e4a1 - Browse repository at this point
Copy the full SHA e32e4a1View commit details -
[JitLayer]Fix jit.save error when save params combined (PaddlePaddle#…
…44504) * Fix jit.save error when save params combined * Change dict_value to list
Configuration menu - View commit details
-
Copy full SHA for c0a29d2 - Browse repository at this point
Copy the full SHA c0a29d2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3e17016 - Browse repository at this point
Copy the full SHA 3e17016View commit details -
add swish using TensorRT layer (PaddlePaddle#44561)
* update * empty commit * update * update * update
Configuration menu - View commit details
-
Copy full SHA for c5a1e49 - Browse repository at this point
Copy the full SHA c5a1e49View commit details -
Phi gird sampler migration (PaddlePaddle#44562)
* add_ymal_utest for phi grid_sampler op
Configuration menu - View commit details
-
Copy full SHA for b470bd1 - Browse repository at this point
Copy the full SHA b470bd1View commit details -
Configuration menu - View commit details
-
Copy full SHA for ead696a - Browse repository at this point
Copy the full SHA ead696aView commit details -
[dy2st]Add ProgramHelper to polish build program logic in autoparalle…
…l.Engine (PaddlePaddle#44513) * [dy2st]Add ProgramHelper to polish build program logic in autoparallel.Engine * refine code
Configuration menu - View commit details
-
Copy full SHA for 243acdb - Browse repository at this point
Copy the full SHA 243acdbView commit details -
【Hackathon No.21】为 Paddle 新增 SoftMarginLoss (PaddlePaddle#42364)
* 2022-04-28 * 2022-04-28_V2 * 2022-04-30 * 2022-04-30_V2 * 2022-05-01 * 2022-05-02 * 2022-05-02_V2 * 2022-05-05_V1 * 2022-05-06_V1 * 2022-05-07_V1 * Update loss.py * 2022-05-07_V2 * 2022-05-13_V1 * Update test_soft_margin_loss.py * Update loss.py * Update loss.py * 2022-05-16_V1 * 2022-05-19_V1 * 2022-05-20_V1 * Update test_soft_margin_loss.py * 2022-06-01_V1 * 2022-06-05 * 2022-06-07 * 2022-06-07 * 2022-06-08 * 2022-06-08_V2 * 2022-06-17-code_style * Modify python * 2022-06-20 * for * for CI;test=document_fix Co-authored-by: Ligoml <39876205+Ligoml@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for f9cd526 - Browse repository at this point
Copy the full SHA f9cd526View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3244a9d - Browse repository at this point
Copy the full SHA 3244a9dView commit details -
* (modified) fc support fp16 * __CUDA_ARCH__ version * delete half * delete half * add half support * add half support * add half support
Configuration menu - View commit details
-
Copy full SHA for a54c695 - Browse repository at this point
Copy the full SHA a54c695View commit details -
[Auto Parallel] Add dist op cost (PaddlePaddle#44146)
* update comp cost * add dist default op cost * add dist fill constant batch size like op cost * add elewise op cost * add fill_constant_batch_size_like op cost unittest * add unittest and remove fill_constant_batch_size_like grad op cost * add to cmakelist * fix unittest bug
Configuration menu - View commit details
-
Copy full SHA for d0f4465 - Browse repository at this point
Copy the full SHA d0f4465View commit details
Commits on Jul 26, 2022
-
Improve CI unittest parallel execution strategy (PaddlePaddle#44334)
* paralle_test_daily * test=paralle_test_daily * test=paralle_test_daily * test=parallel_test_daily * test=paralle_test_daily * test=paralle_test_daily * test=paralle_test_daily * test=paralle_test_daily * test=paralle_test_daily * test=paralle_test_daily * test=paralle_test_daily * test=paralle_test_daily * test pre_test_bak * test cfs * test_cfs,test=paralle_test_daily * test_cfs,test=paralle_test_daily * fix nightly test name,test=paralle_test_daily * fix nightly test name,test=paralle_test_daily * test ci parallel speed * refine parallel rule,test=paralle_test_daily
Configuration menu - View commit details
-
Copy full SHA for ff216f1 - Browse repository at this point
Copy the full SHA ff216f1View commit details -
Configuration menu - View commit details
-
Copy full SHA for fb80048 - Browse repository at this point
Copy the full SHA fb80048View commit details -
[PHI]Move slogdeterminant op to phi (PaddlePaddle#44547)
* Move slogdeterminant op to phi * Add yaml and unit test for slogdeterminant
Configuration menu - View commit details
-
Copy full SHA for 9bc54c8 - Browse repository at this point
Copy the full SHA 9bc54c8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1a22226 - Browse repository at this point
Copy the full SHA 1a22226View commit details -
Configuration menu - View commit details
-
Copy full SHA for 65ad58b - Browse repository at this point
Copy the full SHA 65ad58bView commit details -
inference multi stream support handle lazy init. (PaddlePaddle#44563)
* multi stream support handle lazy init. * support eigen lazy init * update * fix ci problem
Configuration menu - View commit details
-
Copy full SHA for 1892a44 - Browse repository at this point
Copy the full SHA 1892a44View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8d3672f - Browse repository at this point
Copy the full SHA 8d3672fView commit details -
transfer the svd infer into phi infermeta (PaddlePaddle#44528)
* transfer the svd infer into phi infermeta * remove the svd.h * modify svd api * fix svd error by insert optional
Configuration menu - View commit details
-
Copy full SHA for 25d3dce - Browse repository at this point
Copy the full SHA 25d3dceView commit details -
Einsum grad complex (PaddlePaddle#44598)
* add complex for einsum grad kernel * pass the ci
Configuration menu - View commit details
-
Copy full SHA for e0dd7f3 - Browse repository at this point
Copy the full SHA e0dd7f3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6198ff2 - Browse repository at this point
Copy the full SHA 6198ff2View commit details -
Set more attrs in ReplaceScaleLossGradOp (PaddlePaddle#44576)
* Set more attrs in ReplaceScaleLossGradOp * Fix typos * Fix CI errors * Add UT
Configuration menu - View commit details
-
Copy full SHA for ab198b4 - Browse repository at this point
Copy the full SHA ab198b4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 98f8fa4 - Browse repository at this point
Copy the full SHA 98f8fa4View commit details -
fix behavior of device_id=None in Tensor.cuda (PaddlePaddle#44515)
* fix behavior of device_id=None in Tensor.cuda * fix CI
Configuration menu - View commit details
-
Copy full SHA for 50de8a4 - Browse repository at this point
Copy the full SHA 50de8a4View commit details -
Configuration menu - View commit details
-
Copy full SHA for e3ee510 - Browse repository at this point
Copy the full SHA e3ee510View commit details -
add horizontal federation learning ps feature (PaddlePaddle#44327)
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * . * fix bug * . * . * fl-ps with coordinator ready * merge dev * update message parse only * update fl client scheduler * fix bug * update multithreads sync * fix ci errors * update role_maker.py * update role_maker.py * fix ci error: windows py import error * fix ci error: windows py import error * fix windows ci pylib import error * add dump fields & params * try to fix windows import fleet error * fix ps FLAGS error
Configuration menu - View commit details
-
Copy full SHA for 4bc22b6 - Browse repository at this point
Copy the full SHA 4bc22b6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 356ff43 - Browse repository at this point
Copy the full SHA 356ff43View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0d51fcf - Browse repository at this point
Copy the full SHA 0d51fcfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 22342d5 - Browse repository at this point
Copy the full SHA 22342d5View commit details -
Optimize sparse convolution (PaddlePaddle#43576)
zhangkaihuo authoredJul 26, 2022 Configuration menu - View commit details
-
Copy full SHA for 9841b30 - Browse repository at this point
Copy the full SHA 9841b30View commit details -
Configuration menu - View commit details
-
Copy full SHA for b6e8480 - Browse repository at this point
Copy the full SHA b6e8480View commit details -
Configuration menu - View commit details
-
Copy full SHA for 33cc0f7 - Browse repository at this point
Copy the full SHA 33cc0f7View commit details -
Add a feed op before each input parameter var. (PaddlePaddle#44499)
* Add a feed op before each input parameter var. * Fix some issues about the unit test build_cinn_pass_test.
Configuration menu - View commit details
-
Copy full SHA for 9b662be - Browse repository at this point
Copy the full SHA 9b662beView commit details -
fix record event for operator type in new dygraph (PaddlePaddle#44582)
* fix new dygraph record event for op * update unit test
Configuration menu - View commit details
-
Copy full SHA for 963163e - Browse repository at this point
Copy the full SHA 963163eView commit details
Commits on Jul 27, 2022
-
fix bug of elementwise_add_grad, *test=kunlun (PaddlePaddle#44545)
* fix bug of elementwise_add_grad, *test=kunlun * fix bug, *test=kunlun * rm pooling_t, *test=kunlun * fix bug of ew_add_grad when inplace, *test=kunlun
Configuration menu - View commit details
-
Copy full SHA for 35ca1ce - Browse repository at this point
Copy the full SHA 35ca1ceView commit details -
[IPU] small bug fix (PaddlePaddle#44473)
* sync misc changes * add authors Co-authored-by: Zhaorui Chen <zhaoruic@graphcore.ai> * up x * Revert "up x" This reverts commit f3fde45. * add guarg for ipu Co-authored-by: Zhaorui Chen <zhaoruic@graphcore.ai>
Configuration menu - View commit details
-
Copy full SHA for 42d58dd - Browse repository at this point
Copy the full SHA 42d58ddView commit details -
Configuration menu - View commit details
-
Copy full SHA for 15c0c9d - Browse repository at this point
Copy the full SHA 15c0c9dView commit details -
Configuration menu - View commit details
-
Copy full SHA for a71cfd8 - Browse repository at this point
Copy the full SHA a71cfd8View commit details -
Configuration menu - View commit details
-
Copy full SHA for ce7c799 - Browse repository at this point
Copy the full SHA ce7c799View commit details -
Configuration menu - View commit details
-
Copy full SHA for d62af8b - Browse repository at this point
Copy the full SHA d62af8bView commit details -
[PHI]Add yaml and unittest for bmm op (PaddlePaddle#44625)
Add yaml and unittest for bmm op
Configuration menu - View commit details
-
Copy full SHA for 122fff4 - Browse repository at this point
Copy the full SHA 122fff4View commit details -
Phi average accumulates migration (PaddlePaddle#44554)
* move average_accumulates op to phi kernel
Configuration menu - View commit details
-
Copy full SHA for eafd428 - Browse repository at this point
Copy the full SHA eafd428View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7ee442c - Browse repository at this point
Copy the full SHA 7ee442cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2a5437a - Browse repository at this point
Copy the full SHA 2a5437aView commit details -
[CustomDevice] add process_group_xccl ut (PaddlePaddle#44632)
* [CustomDevice] add process_group_xccl ut * update
Configuration menu - View commit details
-
Copy full SHA for efb4d5c - Browse repository at this point
Copy the full SHA efb4d5cView commit details -
Fix conv api name (PaddlePaddle#44636)
zhangkaihuo authoredJul 27, 2022 Configuration menu - View commit details
-
Copy full SHA for e7c7280 - Browse repository at this point
Copy the full SHA e7c7280View commit details -
Configuration menu - View commit details
-
Copy full SHA for 28aa0c6 - Browse repository at this point
Copy the full SHA 28aa0c6View commit details -
[JitLayer]Remove include fluid head files in JitLayer (PaddlePaddle#4…
…4597) * Remove include fluid head files in JitLayer * Format code * Remove const to fix ci error * Fix param error * Polish jit layer include and cp some headers to python/include * Fix comment
Configuration menu - View commit details
-
Copy full SHA for 0dae79a - Browse repository at this point
Copy the full SHA 0dae79aView commit details -
[jit] jit.save support property serialization (PaddlePaddle#44581)
* jit.save support peropty serilization * extract set property function * fix property test file name * fix typing error * fix typing error * fix test coverage
Configuration menu - View commit details
-
Copy full SHA for 2bf5745 - Browse repository at this point
Copy the full SHA 2bf5745View commit details -
Replaced add_custom_command with add_custom_target in xpu_kp_cmake (P…
…addlePaddle#44619) * Replaced add_custom_command with add_custom_target in xpu_kp_cmake
Configuration menu - View commit details
-
Copy full SHA for 16506d8 - Browse repository at this point
Copy the full SHA 16506d8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4b7fe61 - Browse repository at this point
Copy the full SHA 4b7fe61View commit details -
[phi] move crop_tensor kernel from fluid to phi (PaddlePaddle#44574)
* move crop_tensor from fluid to phi * delete fluid header files * fix crop_tensor_op dygraph_mode bug * modify header files, add out tensor check
Configuration menu - View commit details
-
Copy full SHA for b20f771 - Browse repository at this point
Copy the full SHA b20f771View commit details -
fix RemoveIntermediateOut in fuse_elewise_add_act_pass while converti…
…ng graph to program (PaddlePaddle#44593) * fix RemoveNode in fuse_elewise_add_act_pass * fix * change pointer to share_ptr * fix * fix * fix format * fix * fix graph_safe_remove_nodes
Configuration menu - View commit details
-
Copy full SHA for be13271 - Browse repository at this point
Copy the full SHA be13271View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8a07d02 - Browse repository at this point
Copy the full SHA 8a07d02View commit details -
[IPU] add more loss ops (PaddlePaddle#44646)
* add more loss ops * add authors Co-authored-by: Zhaorui Chen <zhaoruic@graphcore.ai> Co-authored-by: Zhaorui Chen <zhaoruic@graphcore.ai>
Configuration menu - View commit details
-
Copy full SHA for 8bf7cd8 - Browse repository at this point
Copy the full SHA 8bf7cd8View commit details -
Configuration menu - View commit details
-
Copy full SHA for a6d0577 - Browse repository at this point
Copy the full SHA a6d0577View commit details -
Configuration menu - View commit details
-
Copy full SHA for 84d595f - Browse repository at this point
Copy the full SHA 84d595fView commit details -
[MLU]fix sync_batch_norm and concat_grad op (PaddlePaddle#44586)
qipengh authoredJul 27, 2022 Configuration menu - View commit details
-
Copy full SHA for f49b0cb - Browse repository at this point
Copy the full SHA f49b0cbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5be7a1f - Browse repository at this point
Copy the full SHA 5be7a1fView commit details -
Configuration menu - View commit details
-
Copy full SHA for ae25ab5 - Browse repository at this point
Copy the full SHA ae25ab5View commit details -
Configuration menu - View commit details
-
Copy full SHA for ea91ca2 - Browse repository at this point
Copy the full SHA ea91ca2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8fc1cf6 - Browse repository at this point
Copy the full SHA 8fc1cf6View commit details
Commits on Jul 28, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 822e42d - Browse repository at this point
Copy the full SHA 822e42dView commit details -
Change the way to set attributes for grad op maker (PaddlePaddle#44514)
* fix typos in template for codegen of operators * change the way to set attributes for grad op maker
Feiyu Chan authoredJul 28, 2022 Configuration menu - View commit details
-
Copy full SHA for 8ee9140 - Browse repository at this point
Copy the full SHA 8ee9140View commit details -
[XPU] add top_k op (PaddlePaddle#44656)
* [XPU] add top_k op. test=kunlun * [XPU] add top_k op. test=kunlun * use PADDLE_ENFORCE_XDNN_NOT_NULL to check pointer. test=kunlun
Configuration menu - View commit details
-
Copy full SHA for acf07c7 - Browse repository at this point
Copy the full SHA acf07c7View commit details -
Configuration menu - View commit details
-
Copy full SHA for a90b8dc - Browse repository at this point
Copy the full SHA a90b8dcView commit details -
[PHI] Move spectral_norm to phi (PaddlePaddle#44577)
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Fix: Move out_grad to first * Register new kernels * Remove old kernels * Move out_grad to last * Fix bugs * Transfer infermeta * Add yaml files * Add blank line * Fix code style * Optimize directory structure Co-authored-by: Bobholamovic <linmanhui@baidu.com>
Configuration menu - View commit details
-
Copy full SHA for 768e50c - Browse repository at this point
Copy the full SHA 768e50cView commit details -
Configuration menu - View commit details
-
Copy full SHA for d4cf02b - Browse repository at this point
Copy the full SHA d4cf02bView commit details -
[Eager] refactor general_grad and fix some bugs (PaddlePaddle#44611)
* refactor general_grad and fix some bugs * add TODO: support prune logic deeper
Configuration menu - View commit details
-
Copy full SHA for acde295 - Browse repository at this point
Copy the full SHA acde295View commit details -
Configuration menu - View commit details
-
Copy full SHA for 067107a - Browse repository at this point
Copy the full SHA 067107aView commit details -
[LAUNCH] add distributed launch check tools (PaddlePaddle#44495)
* add launch test * launch test for cpu * bs 1
Configuration menu - View commit details
-
Copy full SHA for 9a3e1bc - Browse repository at this point
Copy the full SHA 9a3e1bcView commit details -
Move api(lgamma) from legacy_api.yaml to api.yaml (PaddlePaddle#44355)
* Move api(lgamma) from legacy_api.yaml to api.yaml * Move api(lgamma) from legacy_api.yaml to api.yaml * Move api(lgamma) from legacy_api.yaml to api.yaml * modify code style * add x to X mapping * add definition of lgamma * delete redundant lgamma definitions * Modify code comments * Modify ops.py code format * add lgamma single test and lgamma api in fluid * Optimized lgamma unittest
Configuration menu - View commit details
-
Copy full SHA for 511a2c1 - Browse repository at this point
Copy the full SHA 511a2c1View commit details -
Move frame kernel to phi (PaddlePaddle#44615)
* Move frame OP to phi、add frame OP yaml config and supplement single test * add Header file of in_dygraph_mode * Modify variable name and FrameGradInferMeta multiplex UnchangedInferMeta * move seq2col to phi
Configuration menu - View commit details
-
Copy full SHA for 28b4b2f - Browse repository at this point
Copy the full SHA 28b4b2fView commit details -
Configuration menu - View commit details
-
Copy full SHA for dfeb194 - Browse repository at this point
Copy the full SHA dfeb194View commit details -
Configuration menu - View commit details
-
Copy full SHA for a9f76d0 - Browse repository at this point
Copy the full SHA a9f76d0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2cec4c8 - Browse repository at this point
Copy the full SHA 2cec4c8View commit details -
Fix some problem of kernel fallback in C++ API (PaddlePaddle#44681)
* support auto fallback to cpu kernel for cusom device * fix some problem of kernel fallback
Configuration menu - View commit details
-
Copy full SHA for 55aaeb3 - Browse repository at this point
Copy the full SHA 55aaeb3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2781740 - Browse repository at this point
Copy the full SHA 2781740View commit details -
migrate dirichlet kernel to phi (PaddlePaddle#44434)
* migrate dirichlet op kernel to phi * fix dirichlet sample memory leak
Configuration menu - View commit details
-
Copy full SHA for 798a4ea - Browse repository at this point
Copy the full SHA 798a4eaView commit details -
[phi]move softsign from fluid to phi (PaddlePaddle#44616)
* test_activation_op unitest error, yaml & activation.py in_dygraph_mode incomplete * fix test_activation_op unitest error, add yaml and dygraph test * fix code style with pre-commit * try to fix namespace error of abs in activation_functor.h * fix namespace error of abs
Configuration menu - View commit details
-
Copy full SHA for 20759c3 - Browse repository at this point
Copy the full SHA 20759c3View commit details -
[Paddle Inference] Support depthwise_conv2d fp16. (PaddlePaddle#44642)
* depthwise_fp16 * depthwise_fp16 * depthwise_fp16 * depthwise_fp16
Configuration menu - View commit details
-
Copy full SHA for ed85758 - Browse repository at this point
Copy the full SHA ed85758View commit details -
fix logging debug level (PaddlePaddle#44684)
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * . * fix bug * . * . * fl-ps with coordinator ready * merge dev * update message parse only * update fl client scheduler * fix bug * update multithreads sync * fix ci errors * update role_maker.py * update role_maker.py * fix ci error: windows py import error * fix ci error: windows py import error * fix windows ci pylib import error * add dump fields & params * try to fix windows import fleet error * fix ps FLAGS error * fix logging risk * fix logging possible risk
Configuration menu - View commit details
-
Copy full SHA for 8aa286d - Browse repository at this point
Copy the full SHA 8aa286dView commit details -
Configuration menu - View commit details
-
Copy full SHA for e9b9201 - Browse repository at this point
Copy the full SHA e9b9201View commit details -
Configuration menu - View commit details
-
Copy full SHA for bd813d3 - Browse repository at this point
Copy the full SHA bd813d3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 72b65d6 - Browse repository at this point
Copy the full SHA 72b65d6View commit details
Commits on Jul 29, 2022
-
Configuration menu - View commit details
-
Copy full SHA for e61f48c - Browse repository at this point
Copy the full SHA e61f48cView commit details -
fused_fc_elementwise_layernorm_op support fp16 (PaddlePaddle#44710)
* fused_fc_elementwise_layernorm support fp16 * fused_fc_elementwise_layernorm support double
Configuration menu - View commit details
-
Copy full SHA for 856f741 - Browse repository at this point
Copy the full SHA 856f741View commit details -
[Phi] Add yaml for assign_value (PaddlePaddle#44596)
* [Phi] Add yaml for assign_value * [Phi] Fix the bug of the assign api and modify the unittest * [Phi] Fix the bug when the tensor does not have the backend info * [Phi] Replace the functional-style cast init by the brace-init * [Phi] Cast the data explicitly
Configuration menu - View commit details
-
Copy full SHA for 8858439 - Browse repository at this point
Copy the full SHA 8858439View commit details -
[PHI] Move lu to phi (PaddlePaddle#44605)
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Register new kernels * Remove old kernels * Fix code style * Fix bugs * mutable_data->HostAlloc * Transfer infermeta * Add yaml and update python api * Add PADDLE_WITH_HIP check * Update unittests * Fix bugs * Fix bugs * Optimize directory structure * Add output checks * lu_impl.h->lu_kernel_impl.h Co-authored-by: Bobholamovic <linmanhui@baidu.com>
Configuration menu - View commit details
-
Copy full SHA for 3d88816 - Browse repository at this point
Copy the full SHA 3d88816View commit details -
Configuration menu - View commit details
-
Copy full SHA for b7496bc - Browse repository at this point
Copy the full SHA b7496bcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8c43c0f - Browse repository at this point
Copy the full SHA 8c43c0fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 23ad0cc - Browse repository at this point
Copy the full SHA 23ad0ccView commit details -
move CUDAStream to phi (PaddlePaddle#44529)
* init * move CUDAStream to phi * fix compilation * merge develop * add stream_owned_ member * split cuda_stream.h * fix cpu compile * fix constructor * fix bug * fix windows compile * fix inference test_levit * fix windows tests
Configuration menu - View commit details
-
Copy full SHA for da3743f - Browse repository at this point
Copy the full SHA da3743fView commit details -
[Auto parallel] Optimization Tuning (PaddlePaddle#43782)
* fixed bug for pass & engine * fixed bug for benchmark GPT-3 * add tuner & profiler * add algorithms & config
Configuration menu - View commit details
-
Copy full SHA for 72f2ed4 - Browse repository at this point
Copy the full SHA 72f2ed4View commit details -
skip cast trt convert when input dtype is bool (PaddlePaddle#44716)
* skip cast trt convert when input dtype is bool
Configuration menu - View commit details
-
Copy full SHA for 5d94618 - Browse repository at this point
Copy the full SHA 5d94618View commit details -
Configuration menu - View commit details
-
Copy full SHA for e3766da - Browse repository at this point
Copy the full SHA e3766daView commit details -
Phi softplus migration (PaddlePaddle#44542)
* add yaml and utests of phi softplus add yaml of softplus fix softplus bug in phi * update utests * bug fix * bug fix for test_layers * layer api match * match def and doc in ops.py * doc polish * fix unwanted modified of thresholded_relu * style imporve
Configuration menu - View commit details
-
Copy full SHA for 0551566 - Browse repository at this point
Copy the full SHA 0551566View commit details -
【PaddlePaddle Hackathon 3 No.15】为 Paddle 新增 count_nonzero (PaddlePadd…
…le#44169) * add count_nonzero api * remove grad test
Configuration menu - View commit details
-
Copy full SHA for a6c50a6 - Browse repository at this point
Copy the full SHA a6c50a6View commit details -
[WIP] Matmul v1 & v2 unification -- part 1 (PaddlePaddle#44640)
* - Unit tests to be debugged - fix - refactor - diagnostic - more diagnostic - fix - Fix number two - fix - fix - fix - alpha added - more fixes - compilation fix - removed diagnostic code - cosmetic fixes * lint
Configuration menu - View commit details
-
Copy full SHA for 653885a - Browse repository at this point
Copy the full SHA 653885aView commit details -
add FLAGS_enable_api_kernel_fallback (PaddlePaddle#44706)
* add FLAGS_enable_api_kernel_fallback * deal with more cases * add ut for coverage
Configuration menu - View commit details
-
Copy full SHA for e439d73 - Browse repository at this point
Copy the full SHA e439d73View commit details -
Configuration menu - View commit details
-
Copy full SHA for a991990 - Browse repository at this point
Copy the full SHA a991990View commit details -
add some fp16 op for kunlun resnet50 model (PaddlePaddle#44672)
* add some fp16 op for kunlun resnet50 model *test=kunlun * tmp *test=kunlun
Configuration menu - View commit details
-
Copy full SHA for fecbc95 - Browse repository at this point
Copy the full SHA fecbc95View commit details -
Configuration menu - View commit details
-
Copy full SHA for ec1e0d5 - Browse repository at this point
Copy the full SHA ec1e0d5View commit details -
[API/OP] Migrate Lstsq op into phi (PaddlePaddle#44318)
* migrate lstsq op * update * fix bugs for CIs * update * fix bugs * add uts * update * update * update * fix bugs of jip * fix bugs of hip * update * update according to review * update * update * update * update
Configuration menu - View commit details
-
Copy full SHA for ab2aaf8 - Browse repository at this point
Copy the full SHA ab2aaf8View commit details -
Add sparse SyncBatchNorm (PaddlePaddle#43520)
* add sparse SyncBatchNorm
zhangkaihuo authoredJul 29, 2022 Configuration menu - View commit details
-
Copy full SHA for 0a2db7c - Browse repository at this point
Copy the full SHA 0a2db7cView commit details -
unify fluid::CUDADeviceContext and phi::GpuContext (PaddlePaddle#44723)
* remove cudaDeviceContext * remove more template * fix rocm compile
Configuration menu - View commit details
-
Copy full SHA for 8849056 - Browse repository at this point
Copy the full SHA 8849056View commit details -
【PaddlePaddle Hackathon 3 No.12】为 Paddle 新增 pairwise_distance (Paddle…
…Paddle#44161) * add paddle.nn.functional.pairwise_distance (cattidea#273) * remove the test case for undefined behavior Co-authored-by: SigureMo <sigure.qaq@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 46be685 - Browse repository at this point
Copy the full SHA 46be685View commit details
Commits on Jul 30, 2022
-
Phi prior box (PaddlePaddle#44431)
* phi_prior_box * add float[] support * phi_prior_box_optest * update
Configuration menu - View commit details
-
Copy full SHA for d92b2f2 - Browse repository at this point
Copy the full SHA d92b2f2View commit details
Commits on Aug 1, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 3948c24 - Browse repository at this point
Copy the full SHA 3948c24View commit details -
[PHI] Move lu_unpack to phi (PaddlePaddle#44674)
* Add kernel declarations * Copy kernel implementation code * Transfer implementation code * Register new kernels * Remove old kernels * Fix code style * Fix bugs * mutable_data->HostAlloc * Transfer infermeta * Add yaml and update python api * Add PADDLE_WITH_HIP check * Update unittests * Add kernel declarations * Copy kernel implementation code * Transfer kernel implementation code * Register new kernels * Remove old kernels * Add lu_unpack_sig * Fix bugs * Fix bugs * Fix bugs * Optimize directory structure * Add output checks * Update include files * lu_impl.h->lu_kernel_impl.h * Transfer infermeta * Add yaml and update python api * Add check_eager Co-authored-by: Bobholamovic <linmanhui@baidu.com>
Configuration menu - View commit details
-
Copy full SHA for c905a9e - Browse repository at this point
Copy the full SHA c905a9eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3a30178 - Browse repository at this point
Copy the full SHA 3a30178View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8482f1a - Browse repository at this point
Copy the full SHA 8482f1aView commit details -
[Paddle Inference] add varlen_token_prune plugin, pass, convert (Padd…
…lePaddle#44733) * add varlen_token_prune plugin, pass, convert
Configuration menu - View commit details
-
Copy full SHA for 24187fc - Browse repository at this point
Copy the full SHA 24187fcView commit details -
support build with Ninja on Linux (PaddlePaddle#44210)
* support ninja * fix mkldnn on windows * fix mkldnn on windows up1 * up2 * up3 * fix gflags * BUILD_BYPRODUCTS_OPTION -> BUILD_BYPRODUCTS_ARGS * use CMAKE_COMMAND * up x
Configuration menu - View commit details
-
Copy full SHA for 1d79f1f - Browse repository at this point
Copy the full SHA 1d79f1fView commit details -
migrate overlap_add and overlap_add_grad op (PaddlePaddle#44739)
* update code format * add ymal and test * update for comments
Configuration menu - View commit details
-
Copy full SHA for 2a8219c - Browse repository at this point
Copy the full SHA 2a8219cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 71f74f5 - Browse repository at this point
Copy the full SHA 71f74f5View commit details -
infer context fix place error. (PaddlePaddle#44726)
* infer context fix place error. * update * update
Configuration menu - View commit details
-
Copy full SHA for 74e46a9 - Browse repository at this point
Copy the full SHA 74e46a9View commit details -
[operator migration] Migrate unstack_op and nms_op (PaddlePaddle#44424)
* update unstack_op * update unstack_op * update unstack_op * fix unstack test * update unstack * update with remote * fix unstack_test.py * temp_save_change_nms_op * add nms test * update nms fix * update unstack_op * temp save change * finish fix nms_op * pass nms test * fix CI * fix ops test * save change * fix code style * fix code style * fix ci and codestyle * fix ci Co-authored-by: ShiningZhang <zhang_liang1991@126.com>
Configuration menu - View commit details
-
Copy full SHA for 9d2e0ec - Browse repository at this point
Copy the full SHA 9d2e0ecView commit details -
Configuration menu - View commit details
-
Copy full SHA for cd94be6 - Browse repository at this point
Copy the full SHA cd94be6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3e8708b - Browse repository at this point
Copy the full SHA 3e8708bView commit details -
Configuration menu - View commit details
-
Copy full SHA for e48cb42 - Browse repository at this point
Copy the full SHA e48cb42View commit details -
Configuration menu - View commit details
-
Copy full SHA for 16c7c96 - Browse repository at this point
Copy the full SHA 16c7c96View commit details -
Configuration menu - View commit details
-
Copy full SHA for f064ead - Browse repository at this point
Copy the full SHA f064eadView commit details -
Configuration menu - View commit details
-
Copy full SHA for 212f015 - Browse repository at this point
Copy the full SHA 212f015View commit details -
[JitLayer]Polish PEFuntion to speed up JitLayer and fix memory leak (P…
…addlePaddle#44738) * Polish PEFuntion to speed up JitLayer * Polish PEFunction code * Fix comments
Configuration menu - View commit details
-
Copy full SHA for 7512231 - Browse repository at this point
Copy the full SHA 7512231View commit details -
Configuration menu - View commit details
-
Copy full SHA for ffb3154 - Browse repository at this point
Copy the full SHA ffb3154View commit details -
set parallel_job according to CUDA memory in Windows CI unittest (Pad…
…dlePaddle#44695) * set parallel_job according to CUDA memory * fix bug: add whitespace between conten and [] or condition wont work
Configuration menu - View commit details
-
Copy full SHA for c28bb98 - Browse repository at this point
Copy the full SHA c28bb98View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1149a37 - Browse repository at this point
Copy the full SHA 1149a37View commit details -
GPUGraph merge to develop (PaddlePaddle#44594)
Co-authored-by: seemingwang <zsasuke@qq.com> Co-authored-by: DesmonDay <908660116@qq.com> Co-authored-by: seemingwang <seemingwang@users.noreply.github.com> Co-authored-by: Thunderbrook <a754913769@163.com> Co-authored-by: xuewujiao <105861147+xuewujiao@users.noreply.github.com> Co-authored-by: root <root@yq01-sys-hic-k8s-v100-box-a225-0693.yq01.baidu.com> Co-authored-by: Thunderbrook <52529258+Thunderbrook@users.noreply.github.com> Co-authored-by: root <root@yq01-inf-hic-k8s-a100-ab2-0009.yq01.baidu.com> Co-authored-by: huwei02 <53012141+huwei02@users.noreply.github.com> Co-authored-by: yaoxuefeng <yaoxuefeng@baidu.com> Co-authored-by: lxsbupt <luoxsbupt@163.com> Co-authored-by: miaoli06 <106585574+miaoli06@users.noreply.github.com> Co-authored-by: root <root@yq01-inf-hic-k8s-a100-ab2-0008.yq01.baidu.com> Co-authored-by: chao9527 <33347532+chao9527@users.noreply.github.com> Co-authored-by: qingshui <qshuihu@gmail.com> Co-authored-by: yangjunchao <yangjunchao@baidu.com>
Configuration menu - View commit details
-
Copy full SHA for 798670b - Browse repository at this point
Copy the full SHA 798670bView commit details -
Configuration menu - View commit details
-
Copy full SHA for f15d930 - Browse repository at this point
Copy the full SHA f15d930View commit details -
unify gpu context (PaddlePaddle#44740)
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
Configuration menu - View commit details
-
Copy full SHA for 8676302 - Browse repository at this point
Copy the full SHA 8676302View commit details -
API doc(en) Bugs fix in 第四期体验评估 (PaddlePaddle#44749)
* fix docs(en) bugs;test=document_fix * update paddle.add docs;test=document_fix * update paddle.where docs;test=document_fix * for ci;test=document_fix * Update manipulation.py * update paddle.where;test=document_fix Co-authored-by: Ligoml <39876205+Ligoml@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 937ea24 - Browse repository at this point
Copy the full SHA 937ea24View commit details
Commits on Aug 2, 2022
-
Configuration menu - View commit details
-
Copy full SHA for d788e72 - Browse repository at this point
Copy the full SHA d788e72View commit details -
Refactor build_op_downstream_map for standalone executor (PaddlePaddl…
…e#44729) * Refactor build_op_downstream_map for standalone executor * Add some comments
Configuration menu - View commit details
-
Copy full SHA for 9b97ac7 - Browse repository at this point
Copy the full SHA 9b97ac7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1bd6e28 - Browse repository at this point
Copy the full SHA 1bd6e28View commit details -
Configuration menu - View commit details
-
Copy full SHA for d8fedcb - Browse repository at this point
Copy the full SHA d8fedcbView commit details -
support beam_search operator on xpu. test=kunlun (PaddlePaddle#44720)
* support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun
Configuration menu - View commit details
-
Copy full SHA for 9bf8077 - Browse repository at this point
Copy the full SHA 9bf8077View commit details -
[phi] add yolov3_loss yaml and unittest (PaddlePaddle#44476)
* add yaml and unittest * update yaml * update backward yaml and unittest * update yaml * add Yolov3LossGradInferMeta * update yolov3_loss_op.cc * fix bug * code format
Configuration menu - View commit details
-
Copy full SHA for c7cf12f - Browse repository at this point
Copy the full SHA c7cf12fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 756f01d - Browse repository at this point
Copy the full SHA 756f01dView commit details -
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
… dev_merge2graph Conflicts: paddle/fluid/distributed/ps/service/graph_brpc_server.cc paddle/fluid/distributed/ps/service/ps_service/graph_py_service.h paddle/fluid/distributed/ps/table/common_graph_table.cc paddle/fluid/distributed/ps/table/common_graph_table.h paddle/fluid/distributed/ps/table/ctr_dymf_accessor.cc paddle/fluid/distributed/ps/table/ctr_dymf_accessor.h paddle/fluid/distributed/ps/table/graph/graph_node.h paddle/fluid/distributed/ps/table/memory_sparse_table.h paddle/fluid/distributed/ps/table/sparse_sgd_rule.h paddle/fluid/distributed/ps/wrapper/CMakeLists.txt paddle/fluid/distributed/ps/wrapper/fleet.cc paddle/fluid/framework/CMakeLists.txt paddle/fluid/framework/data_feed.cc paddle/fluid/framework/data_feed.cu paddle/fluid/framework/data_feed.h paddle/fluid/framework/data_set.cc paddle/fluid/framework/device_worker.cc paddle/fluid/framework/device_worker.h paddle/fluid/framework/fleet/heter_ps/CMakeLists.txt paddle/fluid/framework/fleet/heter_ps/cudf/concurrent_unordered_map.cuh.h paddle/fluid/framework/fleet/heter_ps/feature_value.cu paddle/fluid/framework/fleet/heter_ps/feature_value.h paddle/fluid/framework/fleet/heter_ps/gpu_graph_node.h paddle/fluid/framework/fleet/heter_ps/gpu_graph_utils.h paddle/fluid/framework/fleet/heter_ps/graph_gpu_ps_table.h paddle/fluid/framework/fleet/heter_ps/graph_gpu_ps_table_inl.cu paddle/fluid/framework/fleet/heter_ps/graph_gpu_wrapper.cu paddle/fluid/framework/fleet/heter_ps/graph_gpu_wrapper.h paddle/fluid/framework/fleet/heter_ps/graph_sampler_inl.h paddle/fluid/framework/fleet/heter_ps/hashtable.h paddle/fluid/framework/fleet/heter_ps/hashtable_kernel.cu paddle/fluid/framework/fleet/heter_ps/heter_comm.h paddle/fluid/framework/fleet/heter_ps/heter_comm_inl.h paddle/fluid/framework/fleet/heter_ps/heter_comm_kernel.cu paddle/fluid/framework/fleet/heter_ps/heter_comm_kernel.h paddle/fluid/framework/fleet/heter_ps/heter_ps.cc paddle/fluid/framework/fleet/heter_ps/heter_ps.cu paddle/fluid/framework/fleet/heter_ps/heter_ps.h paddle/fluid/framework/fleet/heter_ps/heter_ps_base.h paddle/fluid/framework/fleet/heter_ps/mem_pool.h paddle/fluid/framework/fleet/heter_ps/optimizer.cuh.h paddle/fluid/framework/fleet/heter_ps/optimizer_conf.h paddle/fluid/framework/fleet/heter_ps/test_cpu_query.cu paddle/fluid/framework/fleet/ps_gpu_wrapper.cc paddle/fluid/framework/fleet/ps_gpu_wrapper.cu paddle/fluid/framework/fleet/ps_gpu_wrapper.h paddle/fluid/framework/hogwild_worker.cc paddle/fluid/framework/io/fs.cc paddle/fluid/framework/multi_trainer.cc paddle/fluid/framework/new_executor/CMakeLists.txt paddle/fluid/operators/pull_gpups_sparse_op.h paddle/fluid/platform/flags.cc paddle/fluid/pybind/data_set_py.cc paddle/fluid/pybind/fleet_py.cc paddle/utils/string/string_helper.h python/paddle/distributed/fleet/base/distributed_strategy.py python/paddle/distributed/passes/ps_trainer_pass.py python/paddle/distributed/ps/the_one_ps.py python/paddle/fluid/contrib/layers/nn.py python/paddle/fluid/dataset.py python/paddle/fluid/incubate/fleet/parameter_server/ir/trainer_pass.py python/paddle/fluid/layers/nn.py python/paddle/fluid/trainer_factory.py
Configuration menu - View commit details
-
Copy full SHA for 8737440 - Browse repository at this point
Copy the full SHA 8737440View commit details
Commits on Aug 3, 2022
-
Configuration menu - View commit details
-
Copy full SHA for ab5e2db - Browse repository at this point
Copy the full SHA ab5e2dbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 110f88c - Browse repository at this point
Copy the full SHA 110f88cView commit details
Commits on Aug 4, 2022
-
Merge branch 'gpugraph' of https://github.com/xuewujiao/Paddle into d…
…ev_merge2graph Conflicts: paddle/fluid/distributed/ps/table/common_graph_table.cc paddle/fluid/framework/data_set.cc paddle/fluid/framework/fleet/heter_ps/graph_gpu_wrapper.h
Configuration menu - View commit details
-
Copy full SHA for f7e78d5 - Browse repository at this point
Copy the full SHA f7e78d5View commit details -
Configuration menu - View commit details
-
Copy full SHA for c7f414a - Browse repository at this point
Copy the full SHA c7f414aView commit details