-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Implement remaining nn_basic ops in opperf #17456
Implement remaining nn_basic ops in opperf #17456
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As discussed offline
Not a good practice to add duplicate (redundant code)
30-40% of the lines can be taken care of with a function. So lets get that across this entire file.
Thanks for the contribution again!
Group of operator test - all NN Basic ops (GPU) Full OpPerf test (GPU) *Note: couldn't run SoftmaxOutput op backwards on GPU - see previously documented issue and couldn't run im2col either forwards or backwards on GPU - see new documented issue. In both cases, simply switching the context from cpu to gpu leads the op to fail. |
I was able to run GPU tests with topk - https://gist.github.com/ChaiBapchya/fac7310f7d167d1361854451e7daa342 That shouldn't be a problem. Please confirm. |
Address the quoted and profiler comment made by @apeforest. Rest LGTM ! |
Updated Full OpPerf GPU Results with topk perf: https://gist.github.com/connorgoggins/2d4d2ff6dca61494eb8151a5106fec6c |
Incorrect way to rebase @connorgoggins First get your master branch updated
Once your fork master is sync'ed with remote master, rebase your branch on master
Let me know if this works! |
fbebf35
to
f5279be
Compare
@mxnet-label-bot add [pr-awaiting-review] |
48d0667
to
3a8fbc4
Compare
9970fb7
to
a5889ae
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm!
bd8041a
to
e0dc303
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thanks for your contribution
* Added SoftmaxOutput * Added LinearRegressionOutput * Added other regression ops * Added SVMOutput * Added L2, layer and instance norm * gamma and beta to ndarray * Reworked layer/instance norm * Added Embedding * Disabled backward on embedding * Added Correlation * Added data1 and 2 to ndarray * Added SpatialTransformer * Made loc ndarray type * Run backward test * Added IdentityAttachKLSparseReg * Dropping grad * Added sparseness target * Added grad back * Disabling backward for IdentityAttachKLSparseReg * Trying to debug * Print problematic op * Another log * Removing IdentityAttachKLSparseReg test for now * Removed faulty test * Added im2col * Added col2im * Added GroupNorm * Added RNN * Added paramters and state to ndarray * Added LRN * Added preloaded_multi_mp_sgd_mom_update * Added lamb_update_phase1 * Added lamb_update_phase2 * Dropped reversal * Finalized nn basic ops * Cleaned up code for linter * Refactored individual tests into generalized framework * Refined logic, added default params * Fixed LRN param placement * Refactored default params for clarity * Fixed lint errors * Fixed BatchNorm issue * Removed debugging comment * Cleaned up indentation * Added axis param for LayerNorm op * Fixed loc param issues * Linked Embedding backward issue in run_performance_test * Disabling problematic runs on gpu * Added myself to CONTRIBUTORS.md * Addressed PR comments * Fixed DEFAULT_LABEL issue * Tightend up logic, established consistency with master * Fixed indent
* Added SoftmaxOutput * Added LinearRegressionOutput * Added other regression ops * Added SVMOutput * Added L2, layer and instance norm * gamma and beta to ndarray * Reworked layer/instance norm * Added Embedding * Disabled backward on embedding * Added Correlation * Added data1 and 2 to ndarray * Added SpatialTransformer * Made loc ndarray type * Run backward test * Added IdentityAttachKLSparseReg * Dropping grad * Added sparseness target * Added grad back * Disabling backward for IdentityAttachKLSparseReg * Trying to debug * Print problematic op * Another log * Removing IdentityAttachKLSparseReg test for now * Removed faulty test * Added im2col * Added col2im * Added GroupNorm * Added RNN * Added paramters and state to ndarray * Added LRN * Added preloaded_multi_mp_sgd_mom_update * Added lamb_update_phase1 * Added lamb_update_phase2 * Dropped reversal * Finalized nn basic ops * Cleaned up code for linter * Refactored individual tests into generalized framework * Refined logic, added default params * Fixed LRN param placement * Refactored default params for clarity * Fixed lint errors * Fixed BatchNorm issue * Removed debugging comment * Cleaned up indentation * Added axis param for LayerNorm op * Fixed loc param issues * Linked Embedding backward issue in run_performance_test * Disabling problematic runs on gpu * Added myself to CONTRIBUTORS.md * Addressed PR comments * Fixed DEFAULT_LABEL issue * Tightend up logic, established consistency with master * Fixed indent
Description
This PR serves to implement the remaining operators from the nn_basic category in opperf.
Checklist
Essentials
Please feel free to remove inapplicable items for your PR.
Changes
Comments
Tested on c5.18xl-ubuntu 16.04 and Mac OS with:
run_performance_test
on individual opsrun_nn_basic_operators_benchmarks
opperf.py
(full run of all ops)@apeforest @access2rohit