-
Notifications
You must be signed in to change notification settings - Fork 451
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
core dump in the neon-gemm-kernel-benchmark.cc. #138
Comments
I was not able to reproduce this on Pixel 2 but I did find a bug with the 7bit and 4bit kernels. Fix is here: #139 . Let me know if this helps. |
Also, the TX2 has 2 types of CPU cores: NVIDIA Denver2, and ARM Cortex-A57. If the issue persists after Marie's fix, it would be interesting to hear if the issue occurs specifically on one type of core. If the 'taskset' program is available on the TX2, you can use it to run a program on a specific core, e.g. taskset 0f /the/program where 0f can be any hexadecimal mask where each bit represents a CPU core. |
@mariecwhite @bjacob first execution: second time: third time: |
I have the same error on an A53: |
hit the same issue as @hariharan-m on my Lenovo Yoga C630 with Linux on it - built with both top of master clang and gcc 10.2.1 |
Hardware: NVidia TX 2
OS: Ubuntu 16.04
GCC: 5.40
compile flag: -std=c++11 -O3
error message:
:~/workspace/test/test_neon$ ./bench_mm
kernel,Gop/s
Arithmetic error in kernel:
NEON_64bit_GEMM_Int425Operands
Wrong accumulator for depth=32, at l = 1, r = 0
reference value: -47
actual value: -94
Aborted (core dumped)
The text was updated successfully, but these errors were encountered: