CUDA: faster dequantize kernels for Q4_0 and Q4_1 (#4938) · YannFollet/llama.cpp@4a3156d

Triggered via push January 15, 2024 06:48

YannFollet

pushed 4a3156d

master

Status Failure

Total duration 29m 13s

Artifacts 1

build.yml

on: push

Matrix: windows-latest-cmake-cublas

Matrix: windows-latest-cmake

macOS-latest-cmake-ios

1m 28s

macOS-latest-cmake-tvos

1m 27s

ios-xcode-build

1m 7s

Matrix: macOS-latest-swift

Matrix: ubuntu-latest-cmake-mpi

Matrix: ubuntu-latest-cmake-sanitizer

release

19s

Annotations

2 errors

windows-latest-cmake (avx512, -DLLAMA_NATIVE=OFF -DLLAMA_BUILD_SERVER=ON -DLLAMA_AVX512=ON -DBUIL...

Process completed with exit code 1.

release

Resource not accessible by integration

Artifacts

Produced during runtime

Name	Size
artifact Expired	751 MB

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA: faster dequantize kernels for Q4_0 and Q4_1 (#4938) #16

Summary

CUDA: faster dequantize kernels for Q4_0 and Q4_1 (#4938) #16

Jobs

Run details

build.yml

Annotations

Artifacts