-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ARM] Fix concat #3061
[ARM] Fix concat #3061
Conversation
I still meet the same error. @hlu1 v0.0.1 |
The root cause of this problem is that the way concat implemented is not vectorizable. It uses |
I think we could use some discussion here. Although for certain systems, OOB read is fine and select could just be used. Ideally, we would like to have a detector that detects if vectorization can be applied, and only apply it when it finds that the compute-pattern is OK. We could move some of the discussion to a new RFC. @hlu1 @FrozenGene please share your thoughts |
ping @hlu1 |
Merge this in for now as it is a strict improvement over previous ones |
Thanks, @hlu1 @FrozenGene |
Fix for https://discuss.tvm.ai/t/relay-build-target-rasp3b-something-wrong/2195
I added arm_cpu schedule for concat with no vectorization.New test case is also added to cover the special case that triggered this bug.
@FrozenGene, please review