Skip to content
This repository has been archived by the owner on Dec 1, 2021. It is now read-only.

Tell device environment information to generate_project.py in order to optimization and simplicity #1103

Open
primenumber opened this issue Jun 19, 2020 · 0 comments
Labels
enhancement New feature or request

Comments

@primenumber
Copy link
Contributor

For example, memory layout of weights of kernels will be one of three layouts: OHWI (Arm w/ NEON, , x86 w/ AVX2), HWOI (generic), ChHWBCl (w/ FPGA).
Since converter doesn't know what layouts will be used, it generates and holds all layouts of weights.
Because of that, converter has ad-hoc implementation and make static-analysis more difficult.
Also, that prevents some optimizations (such as omit memory copy, memory buffer reusing).

@primenumber primenumber added the enhancement New feature or request label Jun 19, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant