sync : ggml #9567

ggerganov · 2024-09-20T16:10:41Z

No description provided.

ggml-ci

* CUDA eval works * stochastic gradient descent op * Adam except decay * CUDA CROSS_ENTROPY_LOSS_BACK * CUDA mnist-fc training works * backend CLI arg * refactor gguf load * remove sched from opt_step_adam * implement l1 regularization (weight decay) * extra call to add optimizer * initialize gradients with ggml_graph_reset * gradient accumulation * increment iter per eval instead of epoch * adjust backend interfaces * fix ggml_graph_reset without backend * fix ggml graph export/import * fixup * rename * revert ggml_opt changes * more general CUDA repeat_back * update documentation, fix CNN * validation split * add clarifying comment * optimize PyTorch training * adjust buffer size, thread count * fix 0.0f validation split * Update examples/mnist/mnist-common.cpp Co-authored-by: Georgi Gerganov <[email protected]> * fix gradient accumulation * tensor flag for accumulators -> tensor hash set * Update include/ggml.h Co-authored-by: slaren <[email protected]> * Update tests/test-backend-ops.cpp Co-authored-by: slaren <[email protected]> * Update tests/test-backend-ops.cpp Co-authored-by: slaren <[email protected]> * fix test prints * Update src/ggml-backend.c Co-authored-by: Georgi Gerganov <[email protected]> * better CUDA support for noncontiguous out_prod * add comment --------- Co-authored-by: Georgi Gerganov <[email protected]> Co-authored-by: slaren <[email protected]>

ggml-ci

ggerganov · 2024-09-20T16:32:00Z

@JohannesGaessler @slaren How to fix the HIP build:

[ 11%] Building HIP object ggml/src/CMakeFiles/ggml.dir/ggml-cuda/out-prod.cu.o
In file included from /__w/llama.cpp/llama.cpp/ggml/src/ggml-cuda/out-prod.cu:2:
/__w/llama.cpp/llama.cpp/ggml/src/ggml-cuda/vendors/cuda.h:3:10: fatal error: 'cuda_runtime.h' file not found
#include <cuda_runtime.h>
         ^~~~~~~~~~~~~~~~
1 error generated when compiling for gfx906.
gmake[2]: *** [ggml/src/CMakeFiles/ggml.dir/build.make:404: ggml/src/CMakeFiles/ggml.dir/ggml-cuda/out-prod.cu.o] Error 1

Edit: I think I just have to remove the include from out-prod.cu. Will give it a try in a bit

slaren · 2024-09-20T16:35:52Z

ggml/src/ggml-cuda/out-prod.cu

@@ -0,0 +1,52 @@
+#include "out-prod.cuh"
+#include "vendors/cuda.h"


Suggested change

#include "vendors/cuda.h"

This should do it, this file cannot be included directly, and it is already included in common.cuh.

I can confirm that this fixes compilation with GGML_HIPBLAS.

ggml-ci

ggerganov and others added 3 commits September 20, 2024 18:57

examples : add null threadpool args where needed (ggml/0)

bb51df5

ggml-ci

sync : ggml

bddc6c6

ggml-ci

ggerganov changed the title ~~examples : add null threadpool args where needed (ggml/0)~~ sync : ggml Sep 20, 2024

ggml : fix trailing whitespace (#0)

ebc359c

ggml-ci

slaren reviewed Sep 20, 2024

View reviewed changes

ggml : fix builds (#0)

a39be1a

ggml-ci

ggerganov force-pushed the sync branch from ad634e0 to a39be1a Compare September 20, 2024 17:36

ggerganov merged commit d13edb1 into master Sep 20, 2024
57 checks passed

ggerganov deleted the sync branch September 20, 2024 18:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sync : ggml #9567

sync : ggml #9567

ggerganov commented Sep 20, 2024 •

edited

Loading

ggerganov commented Sep 20, 2024 •

edited

Loading

slaren Sep 20, 2024

slaren Sep 20, 2024

JohannesGaessler Sep 20, 2024

		@@ -0,0 +1,52 @@
		#include "out-prod.cuh"
		#include "vendors/cuda.h"

sync : ggml #9567

sync : ggml #9567

Conversation

ggerganov commented Sep 20, 2024 • edited Loading

ggerganov commented Sep 20, 2024 • edited Loading

slaren Sep 20, 2024

Choose a reason for hiding this comment

slaren Sep 20, 2024

Choose a reason for hiding this comment

JohannesGaessler Sep 20, 2024

Choose a reason for hiding this comment

ggerganov commented Sep 20, 2024 •

edited

Loading

ggerganov commented Sep 20, 2024 •

edited

Loading