Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SYCL] Optimize SG group_load via BlockReadINTEL in simple cases #13673

Merged
merged 3 commits into from
May 9, 2024

Conversation

aelovikov-intel
Copy link
Contributor

No description provided.

@aelovikov-intel aelovikov-intel force-pushed the group-load branch 2 times, most recently from 9683503 to 3b225b7 Compare May 7, 2024 18:16
@aelovikov-intel aelovikov-intel marked this pull request as ready for review May 7, 2024 23:50
@aelovikov-intel aelovikov-intel requested a review from a team as a code owner May 7, 2024 23:50
@aelovikov-intel
Copy link
Contributor Author

Failures unrelated, CUDA tests are failing everywhere after the workflow has been fixed to bring its status back working, Windows build failed on the cleanup stage due to zombie processes - I've rebooted the runner.

@aelovikov-intel aelovikov-intel changed the title [SYCL] Optimize sub-group group_load via BlockRead in simple cases [SYCL] Optimize sub-group group_load via BlockReadINTEL in simple cases May 8, 2024
@aelovikov-intel aelovikov-intel changed the title [SYCL] Optimize sub-group group_load via BlockReadINTEL in simple cases [SYCL] Optimize SG group_load via BlockReadINTEL in simple cases May 8, 2024
Copy link
Contributor

@uditagarwal97 uditagarwal97 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@aelovikov-intel aelovikov-intel merged commit 48a0ff5 into intel:sycl May 9, 2024
14 checks passed
@aelovikov-intel aelovikov-intel deleted the group-load branch May 9, 2024 22:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants