sync : ggml (conv ops + cuda MSVC fixes) #3765

ggerganov · 2023-10-24T18:08:03Z

No description provided.

ggml-ci

* master: (350 commits) speculative : ensure draft and target model vocab matches (ggerganov#3812) llama : correctly report GGUFv3 format (ggerganov#3818) simple : fix batch handling (ggerganov#3803) cuda : improve text-generation and batched decoding performance (ggerganov#3776) server : do not release slot on image input (ggerganov#3798) batched-bench : print params at start log : disable pid in log filenames server : add parameter -tb N, --threads-batch N (ggerganov#3584) (ggerganov#3768) server : do not block system prompt update (ggerganov#3767) sync : ggml (conv ops + cuda MSVC fixes) (ggerganov#3765) cmake : add missed dependencies (ggerganov#3763) cuda : add batched cuBLAS GEMM for faster attention (ggerganov#3749) Add more tokenizer tests (ggerganov#3742) metal : handle ggml_scale for n%4 != 0 (close ggerganov#3754) Revert "make : add optional CUDA_NATIVE_ARCH (ggerganov#2482)" issues : separate bug and enhancement template + no default title (ggerganov#3748) Update special token handling in conversion scripts for gpt2 derived tokenizers (ggerganov#3746) llama : remove token functions with `context` args in favor of `model` (ggerganov#3720) Fix baichuan convert script not detecing model (ggerganov#3739) make : add optional CUDA_NATIVE_ARCH (ggerganov#2482) ...

sync : ggml (conv ops + cuda MSVC fixes)

58f8ddd

ggml-ci

slaren approved these changes Oct 24, 2023

View reviewed changes

ggerganov merged commit b2f7e04 into master Oct 24, 2023
38 of 39 checks passed

ggerganov deleted the sync branch October 24, 2023 18:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sync : ggml (conv ops + cuda MSVC fixes) #3765

sync : ggml (conv ops + cuda MSVC fixes) #3765

ggerganov commented Oct 24, 2023

sync : ggml (conv ops + cuda MSVC fixes) #3765

sync : ggml (conv ops + cuda MSVC fixes) #3765

Conversation

ggerganov commented Oct 24, 2023