Skip to content

Conversation

@s-kanaev
Copy link
Contributor

@s-kanaev s-kanaev commented Sep 7, 2021

This should eliminate some part of overhead on host when running kernels in a loop. Especially small ones.
Another part of improvement is in #4516

Sergey Kanaev added 3 commits September 7, 2021 15:43
Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
…ally.

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
@s-kanaev s-kanaev requested a review from a team as a code owner September 7, 2021 13:32
@s-kanaev s-kanaev requested a review from romanovvlad September 7, 2021 13:32
Sergey Kanaev added 3 commits September 7, 2021 16:33
Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>
@romanovvlad romanovvlad merged commit b94f23a into intel:sycl Sep 10, 2021
againull added a commit to againull/llvm that referenced this pull request Jun 17, 2022
1. cuda prefetch issue seems to be fixed by:
intel#5043

2. Performance issues with assert seem to be fixed by:
intel#4505
intel#4516
pvchupin pushed a commit that referenced this pull request Jun 24, 2022
* Release notes for commit range f34ba2c..4043dda
* Update known issues:
1. cuda prefetch issue seems to be fixed by:
#5043
2. Performance issues with assert seem to be fixed by:
#4505
#4516
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants