ggml : add AVX support based on AVX2 code #1376

katsu560 · 2023-05-09T12:48:19Z

I reopen new PR w/o unused function.
I added AVX support code from AVX2 code and modified AVX2 code

added AVX support code
static inline __m256i bytes_from_bits_32(const uint8_t * x)
static inline __m256i bytes_from_nibbles_32(const uint8_t * rsi)
static inline __m256 sum_i16_pairs_float(const __m128i xh, const __m128i xl)
static inline __m256 mul_sum_i8_pairs_float(const __m256i x, const __m256i y)
static void quantize_row_q4_1(const float * restrict x, void * restrict vy, int k)
static void dequantize_row_q4_0(const void * restrict vx, float * restrict y, int k)
static void dequantize_row_q4_1(const void * restrict vx, float * restrict y, int k)
static void ggml_vec_dot_q4_1_q8_1(const int n, float * restrict s, const void * restrict vx, const void * restrict vy)
static void ggml_vec_dot_q4_2_q8_0(const int n, float * restrict s, const void * restrict vx, const void * restrict vy)
static void ggml_vec_dot_q5_0_q8_0(const int n, float * restrict s, const void * restrict vx, const void * restrict vy)
static void ggml_vec_dot_q5_1_q8_1(const int n, float * restrict s, const void * restrict vx, const void * restrict vy)
static void ggml_vec_dot_q8_0_q8_0(const int n, float * restrict s, const void * restrict vx, const void * restrict vy)

modified AVX2 code
static void quantize_row_q4_1(const float * restrict x, void * restrict vy, int k)
static void dequantize_row_q4_1(const void * restrict vx, float * restrict y, int k)

ggerganov and sw, please confirm this PR.

ggerganov · 2023-05-11T21:28:21Z

Please rebase on latest master and resolve conflicts

katsu560 · 2023-05-13T13:49:15Z

Okay, I pushed new PR #1430. Please confirm it.

ggml : add AVX support based on AVX2 code

9266be2

This was referenced May 9, 2023

ggml : add AVX support and modify AVX2 code #1331

Closed

ggml : delete unused function, packNibbles_256 #1353

Closed

katsu560 closed this May 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml : add AVX support based on AVX2 code #1376

ggml : add AVX support based on AVX2 code #1376

katsu560 commented May 9, 2023

ggerganov commented May 11, 2023

katsu560 commented May 13, 2023

ggml : add AVX support based on AVX2 code #1376

ggml : add AVX support based on AVX2 code #1376

Conversation

katsu560 commented May 9, 2023

ggerganov commented May 11, 2023

katsu560 commented May 13, 2023