Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Layer Norm x86 SIMD Optimizations #4065

Merged
merged 24 commits into from
Jul 29, 2022
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
24 commits
Select commit Hold shift + click to select a range
f5783b5
A LayerNorm_x86 class mocking LayerNorm for tests;
LinHeLurking Jul 18, 2022
7a94b1a
All SIMD optimizations success wihout support_packing; Maybe there's …
LinHeLurking Jul 20, 2022
b126e0f
Located error about packed layout.
LinHeLurking Jul 21, 2022
1982605
All test passed; Now it supports packing layout
LinHeLurking Jul 21, 2022
0fa8689
Fix runtime cpu dispatch;
LinHeLurking Jul 21, 2022
6a683d3
Use fmadd wrapper in x86_usability.h;
LinHeLurking Jul 22, 2022
bf95312
Merge packed & unpacked code.
LinHeLurking Jul 22, 2022
af97b05
Func rename.
LinHeLurking Jul 22, 2022
a9be63a
Simplify and merge more branches about packed layout;
LinHeLurking Jul 22, 2022
976692a
Code format
LinHeLurking Jul 24, 2022
d7007c3
Replace some member functions with static inline functions.
LinHeLurking Jul 25, 2022
508d143
Add copyright header
LinHeLurking Jul 25, 2022
cf015d8
apply code-format changes
LinHeLurking Jul 25, 2022
5084955
Add more tests with 16 packed for AVX512
LinHeLurking Jul 25, 2022
48fb4ea
Code format
LinHeLurking Jul 25, 2022
8c4ed97
Merge branch 'master' of https://github.com/Tencent/ncnn
LinHeLurking Jul 25, 2022
3c2c1c8
Merge branch 'master' of https://github.com/LinHeLurking/ncnn
LinHeLurking Jul 25, 2022
487568d
Copyright statement year fixed
LinHeLurking Jul 26, 2022
23db5ab
Fix accidentally added corelation of mean/var and SIMD ISA
LinHeLurking Jul 26, 2022
72777b4
Fix accidentally added corelation of fmadd/affine_fmadd and SIMD ISA
LinHeLurking Jul 26, 2022
b20d298
Fix a wrong test param
LinHeLurking Jul 26, 2022
4fddf9e
Fix runtime dispatch
LinHeLurking Jul 26, 2022
2555b3e
apply code-format changes
LinHeLurking Jul 26, 2022
1b118f7
no store duplicates
nihui Jul 29, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading