Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix GEMMK=1 kernel #543

Merged
merged 2 commits into from
May 28, 2024
Merged

Fix GEMMK=1 kernel #543

merged 2 commits into from
May 28, 2024

Conversation

CNugteren
Copy link
Owner

This solves a bug that was originally introduced in #274 (which introduced the GEMMK=1 kernel), reported in #340, but then incorrectly fixed in #341. It resurfaced again in #542 recently as more tuning results are coming in, in particular for Intel devices that seem to favour the GEMMK=1 kernel in certain cases.

The issue only shows up when MWG != NWG, and even then only in certain situations.

It is very likely also solving the issue with the Intel Arc A770 reported in #533, since that device also uses tuning parameters that were leading to this bug. Furthermore the logs from the CLBlast tests in that issue also correspond with what I reproduced locally. Therefore this PR also re-adds those tuning parameters.

@CNugteren CNugteren merged commit 962f929 into master May 28, 2024
6 checks passed
@CNugteren CNugteren deleted the fix_gemmk_kernel branch May 28, 2024 07:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant