sgemm Just a practice. env: gcc version 4.8.5 20150623 (Red Hat 4.8.5-16) mkl 2018.1.1 For now the mm in intel get 53% performance over mkl_blas_sgemm with omp (28 physical cores x 2 sockets).