-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
optimize conv algo cache #41891
Merged
Merged
optimize conv algo cache #41891
Changes from all commits
Commits
Show all changes
48 commits
Select commit
Hold shift + click to select a range
5547efd
optimizer conv alog speed
phlrain 490abce
code polish
phlrain f06dcad
remove useless code
phlrain 12f8364
fix compile error
phlrain d9dfe6c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain 1729ba8
fix cpu compile error
phlrain b8c05fd
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain 0f64787
not use cudnn alog t
phlrain 5314766
add search cache max number
phlrain 56eb2c6
polish code
phlrain e5bfa67
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain aabc60f
fix cache test bug
phlrain 09a04fb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain c455f11
add groups data format to conv args
phlrain 445fe4d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain 442a9e1
fix cache test bug
phlrain 19c59f7
fix cudnn_deterministic bug
phlrain 4b0a58d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain 40c7d23
fix test switch auto tune bug
phlrain 184cab6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain df57ee6
fix test swith autotune bug;
phlrain 82c2419
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain 6dbeaa5
fix conv cache bug
phlrain 2cd1c00
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain c8fe9c6
fix cache test error
phlrain 2be9374
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain 7b277f4
fix cache test bug
phlrain 83e1c8c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain 93885d4
fix windows mac compile error
phlrain 3404862
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain fb53df7
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain 4098916
fix workspace search error
phlrain b15a4be
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain 11b8315
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain ba41e29
update cudnn cache
phlrain 662dca2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain 99a33bf
fix cache test bug; test=develop
phlrain ecfa2e4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain af7fa80
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain f7afc76
fix autotune swith test error
phlrain 0101cc4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain 4ad71a8
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain a806f93
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain 10de962
polish code
phlrain 7290528
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain a549c20
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain 65c5ecc
oplish code
phlrain f1a0da6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
phlrain File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
DnnNode
的功能和SearchResult
的重复性比较高,如果能够用SearchResult
替代更好。不过后续我们这边应该会在DnnNode
的基础上扩展出来AutoTuneResult
类型。There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
我有一个版本是使用的SearchResult,但是search Result 里面模板T是 cudnnConvolutionFwdAlgoPerf_t, 这样cache.h会依赖,gpu_info.h, cache.h 在cpu场景下也会使用,编译会有问题