-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AllocateSharedMemoryPass
has possibility to allocate SLM size greater than device max share memory
#1716
Comments
Information from @whitneywhtsang
|
This task is still under progress. |
The root cause of this issue is the large 2d load with large
I'm trying to do some changes on the first way, if there are some common ops make the ConvertLayout op not removable. I'll switch to the second way which is add another pass to check and reduce large 2d load size to make it work functionally at first. |
Running gemm kernels like gemm_splitk_benchmark.py with the latest
llvm-target
branch will fail forThe text was updated successfully, but these errors were encountered: