Building llama.cpp or building libllama.so on a virtualized Linux on Apple silicon does not work. #2344

AndreasKunar · 2023-07-23T13:11:30Z

I'm using a MacBook Air M2 24GB/1TB with Ubuntu 23.04 in a Parallels VM

This is also an issue for downstream llama-cpp-python, which uses/builds libllama.so

The compiler flag "-mcpu=native" seems to be the culprit, generating inlining errors. Without it, it builds, and w/o any source code changes. But the build produces warnings - not quite sure about the "missing braces around initializer", the "unused variable" probably can be ignored. The code executes and works for me, but I could not fully test it

A good fix is a bit difficult, since the M1,... in a VM is not easily detectable. The difference I found was in "uname -p" which results in "unknown" on RapberryOS 64 and in "aarch64" on virtual Ubuntu on an M2.

An insert/edit of "makefile" starting in line 262 of the makefile (right after "# Apple M1, M2, etc. \n # Raspberry Pi 3, 4, Zero 2 (64-bit)") works for me:
ifeq ($(filter aarch64%,$(UNAME_P)),)
# do not set for Apple running in VMs
CFLAGS += -mcpu=native
CXXFLAGS += -mcpu=native
endif

This is a simmilar problem as closed issue #1655

I'm new to github and don't know how to write pull requests, also I don't know if the fix does not produce undesired side-effects on non Raspbian Linux on Pis.

davidrodriguezpozo · 2023-08-16T15:39:43Z

I'm also encountering this issue when trying to dockerize a project that has llama-cpp-python as dependency. Downloading and using it locally worked like a charm, but when trying to build the docker container, it failed with this same error.

AndreasKunar · 2023-09-05T09:32:44Z

@davidrodriguezpozo

I found a solution for using llama.cpp in Apple Silicon Linux VMs (and probably also Docker on Apple Silicon) without changing anything!!!

Just build with the following command for Apple Silicon Linux VMs:
UNAME_M=arm64 UNAME_P=arm LLAMA_NO_METAL=1 make

redthing1 · 2023-09-15T20:33:59Z

@AndreasKunar but doesn't your fix disable SIMD on ARM? Reducing performance?

AndreasKunar · 2023-09-16T08:17:36Z

@AndreasKunar but doesn't your fix disable SIMD on ARM? Reducing performance?

@redthing1 - my understanding is, that it just force-disables the Metal-framework/MPS support, which is not available in VMs or Docker for Apple Silicon anyway. In my understanding the obligatory Apple Virtualization Framework for Apple Silicon which has to get used there by VMs/Docker only offers a limited Apple Silicon hardware functionality subset. I think the CPU-code generated by these switches still uses the available CPU SIMD instructions. At least in my tests, it had similar token/s performance in VMs than running it without GPU-enablement in a host macOS.

I'm disappointed, that VMs/Docker only seems to be able to support GPU-acceleration via CUDA and probably Nvidia/AMD via CLBlast+OpenCL for x64 CPUs.

I'm still hopeful, that Apple someday might extend their obligatory Virtualization Framework technology to cover their two key shortcomings: a) missing GPU/NPU support (even if it's only via intermediaries like OpenCL, etc.) and b) missing nested hypervisor support (which is somewhat crippling Linux/Windows in VMs). But they did neither when going from M1 to M2 or in macOS Sonoma.

redthing1 · 2023-09-17T05:09:48Z

Thank you for the explanation. I hope so as well.

k0gen · 2023-09-20T08:48:45Z

I encountered the same issue when switching my Docker image from Debian Bullseye to Bookworm.

llama.cpp build info:
UNAME_S:  Linux
UNAME_P:  unknown
UNAME_M:  aarch64

The problem arises from the incorrect definition of the UNAME_M environment variable as 'aarch64,' which will cause build failures on Apple Silicon.
I have resolved this by forcing UNAME_M=arm64

ENV UNAME_M=arm64

@AndreasKunar please confirm that

UNAME_M=arm64 make

is enough to make it compile correctly

AndreasKunar · 2023-09-20T10:11:21Z

I encountered the same issue when switching my Docker image from Debian Bullseye to Bookworm.
llama.cpp build info:
UNAME_S:  Linux
UNAME_P:  unknown
UNAME_M:  aarch64
The problem arises from the incorrect definition of the UNAME_M environment variable as 'aarch64,' which will cause build failures on Apple Silicon. I have resolved this by forcing UNAME_M=arm64
ENV UNAME_M=arm64
@AndreasKunar please confirm that
UNAME_M=arm64 make
is enough to make it compile correctly

Sorry, I‘m no expert in the llama.cpp makefile design. I would also set UNAME_P=arm LLAMA_NO_METAL=1 to be sure. At least this worked for me.

redthing1 · 2023-09-21T03:40:42Z

Good find, thanks for sharing.

github-actions · 2024-04-09T01:07:33Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

AndreasKunar mentioned this issue Jul 23, 2023

[User] error: inlining failed in call to 'always_inline' 'vfmaq_f16': target specific option mismatch #1655

Closed

github-actions bot added the stale label Mar 25, 2024

github-actions bot closed this as completed Apr 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Building llama.cpp or building libllama.so on a virtualized Linux on Apple silicon does not work. #2344

Building llama.cpp or building libllama.so on a virtualized Linux on Apple silicon does not work. #2344

AndreasKunar commented Jul 23, 2023

davidrodriguezpozo commented Aug 16, 2023

AndreasKunar commented Sep 5, 2023 •

edited

Loading

redthing1 commented Sep 15, 2023

AndreasKunar commented Sep 16, 2023

redthing1 commented Sep 17, 2023

k0gen commented Sep 20, 2023

AndreasKunar commented Sep 20, 2023

redthing1 commented Sep 21, 2023

github-actions bot commented Apr 9, 2024

Building llama.cpp or building libllama.so on a virtualized Linux on Apple silicon does not work. #2344

Building llama.cpp or building libllama.so on a virtualized Linux on Apple silicon does not work. #2344

Comments

AndreasKunar commented Jul 23, 2023

davidrodriguezpozo commented Aug 16, 2023

AndreasKunar commented Sep 5, 2023 • edited Loading

redthing1 commented Sep 15, 2023

AndreasKunar commented Sep 16, 2023

redthing1 commented Sep 17, 2023

k0gen commented Sep 20, 2023

AndreasKunar commented Sep 20, 2023

redthing1 commented Sep 21, 2023

github-actions bot commented Apr 9, 2024

AndreasKunar commented Sep 5, 2023 •

edited

Loading