metal : reusing llama.cpp logging #3152

Ricardicus · 2023-09-12T21:31:08Z

I wanted to silence some of the outputs and found out that some
came from llama.cpp and some from metal.m on my mac. I saw the "TODO" and
thought I might chime in here with this. Perhaps we should regard the log level also,
and be able to set verbosity via command line arguments? Let me know what you think.

cebtenzzre · 2023-09-13T01:32:29Z

The cmake build is failing on macOS:

[ 91%] Linking CXX executable ../../bin/metal
Undefined symbols for architecture x86_64:
  "_llama_log", referenced from:
      _ggml_metal_init in ggml-metal.m.o
      _ggml_metal_free in ggml-metal.m.o
      _ggml_metal_host_malloc in ggml-metal.m.o
      _ggml_metal_add_buffer in ggml-metal.m.o
      _ggml_metal_set_tensor in ggml-metal.m.o
      _ggml_metal_get_tensor in ggml-metal.m.o
      _ggml_metal_graph_find_concurrency in ggml-metal.m.o
      ...
ld: symbol(s) not found for architecture x86_64
clang: error: linker command failed with exit code 1 (use -v to see invocation)
make[2]: *** [bin/metal] Error 1
make[1]: *** [examples/metal/CMakeFiles/metal.dir/all] Error 2

Ricardicus · 2023-09-13T06:02:46Z

Ok, sorry I missed that. Fixed it

ggerganov · 2023-09-13T08:22:29Z

The change in this PR is not OK because it couples ggml with llama.cpp

I didn't write a detailed explanation in the TODO, but what I meant is to implement a way to pass a log callback and have llama.cpp or other projects be able to provide their callback to the metal backend.

Ricardicus · 2023-09-13T08:32:22Z

Ok! I see. I can try to do that, and decouple the two again.

Ricardicus · 2023-09-13T14:17:09Z

I decoupled it. Introducing a log function setter in ggml-metal that is called from llama.cpp that points to the internal one. I still need to include llama.h to get the enum definition. Is this OK?

Ricardicus · 2023-09-13T14:44:25Z

Fixed the trailing whitespace editor config error.

Ricardicus · 2023-09-15T14:11:48Z

I resolved a conflict that had appeared.

Ricardicus · 2023-09-17T14:48:15Z

I decoupled it more now. Since log level is not really used yet I took the liberty to move that into a definition in ggml.h instead. My reasoning was that llama.cpp is already depending on ggml.h, so in order to synchronize the two with a callback I needed a function signature that could be passed in both metal and llama.cpp translation units. Maybe it would be easier to work with ints and macros, but the idea of an enum also makes sense.

ggerganov

I decoupled it more now.

Yes - this is the way :) I think it OK now - maybe move the llama_log_callback typedef in ggml.h so we can reuse it:

typedef void (*ggml_log_callback)(enum ggml_log_level level, const char * text, void * user_data);

ggml-metal.m

Ricardicus · 2023-09-20T16:05:36Z

I fixed a typedef for log callbacks in ggml.h now :)

ggerganov · 2023-09-20T17:49:51Z

Will merge this some time next week - don't worry, I won't forget :)

…example * 'master' of github.com:ggerganov/llama.cpp: convert : remove bug in convert.py permute function (ggerganov#3364) make-ggml.py : compatibility with more models and GGUF (ggerganov#3290) gguf : fix a few general keys (ggerganov#3341) metal : reusing llama.cpp logging (ggerganov#3152) build : add ACCELERATE_NEW_LAPACK to fix warning on macOS Sonoma (ggerganov#3342) readme : add some recent perplexity and bpw measurements to READMES, link for k-quants (ggerganov#3340) cmake : fix build-info.h on MSVC (ggerganov#3309) docs: Fix typo CLBlast_DIR var. (ggerganov#3330) nix : add cuda, use a symlinked toolkit for cmake (ggerganov#3202)

metal : reusing llama.cpp logging

84471ff

cmake : build fix

6ff3f2e

Ricardicus added 3 commits September 13, 2023 15:45

metal : logging callback

4f0e095

metal : logging va_args memory fix

1f55026

metal : minor cleanup

86abc77

Ricardicus added 2 commits September 13, 2023 16:23

metal : setting function like logging macro to capital letters

8d5004b

llama.cpp : trailing whitespace fix

696bf05

Merge branch 'master' into master

e004116

Ricardicus added 3 commits September 17, 2023 16:38

ggml : log level enum used by llama

d266e15

Merge branch 'master' of github.com:Ricardicus/llama.cpp

92b39b4

Makefile : cleanup ggml-metal recipe

78de0df

ggerganov reviewed Sep 17, 2023

View reviewed changes

ggml-metal.m Outdated Show resolved Hide resolved

ggml : ggml_log_callback typedef

e0eba91

Ricardicus requested a review from ggerganov September 17, 2023 20:59

ggml : minor

b643435

ggerganov approved these changes Sep 27, 2023

View reviewed changes

ggerganov merged commit dc68974 into ggerganov:master Sep 27, 2023
29 of 33 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

metal : reusing llama.cpp logging #3152

metal : reusing llama.cpp logging #3152

Ricardicus commented Sep 12, 2023

cebtenzzre commented Sep 13, 2023

Ricardicus commented Sep 13, 2023

ggerganov commented Sep 13, 2023

Ricardicus commented Sep 13, 2023

Ricardicus commented Sep 13, 2023 •

edited

Loading

Ricardicus commented Sep 13, 2023 •

edited

Loading

Ricardicus commented Sep 15, 2023

Ricardicus commented Sep 17, 2023

ggerganov left a comment

Ricardicus commented Sep 20, 2023

ggerganov commented Sep 20, 2023

metal : reusing llama.cpp logging #3152

metal : reusing llama.cpp logging #3152

Conversation

Ricardicus commented Sep 12, 2023

cebtenzzre commented Sep 13, 2023

Ricardicus commented Sep 13, 2023

ggerganov commented Sep 13, 2023

Ricardicus commented Sep 13, 2023

Ricardicus commented Sep 13, 2023 • edited Loading

Ricardicus commented Sep 13, 2023 • edited Loading

Ricardicus commented Sep 15, 2023

Ricardicus commented Sep 17, 2023

ggerganov left a comment

Choose a reason for hiding this comment

Ricardicus commented Sep 20, 2023

ggerganov commented Sep 20, 2023

Ricardicus commented Sep 13, 2023 •

edited

Loading

Ricardicus commented Sep 13, 2023 •

edited

Loading