llama.cpp b9196 adds Vulkan unaligned tensor support for ROPE
New release fixes ROPE tensor alignment issues on Vulkan, boosting local LLM compatibility.
ggml-org released llama.cpp b9196. This release adds Vulkan support for unaligned tensors during ROPE (Rotary Position Embedding) operations. Builds are available for macOS (Apple Silicon arm64, Intel x64), Linux (x64 CPU, arm64 CPU, s390x CPU, x64 Vulkan, arm64 Vulkan, x64 ROCm 7.2, x64 OpenVINO, x64 SYCL FP32, x64 SYCL FP16), Android (arm64 CPU), Windows (x64 CPU, arm64 CPU, x64 CUDA 12, x64 CUDA 13, x64 Vulkan, x64 SYCL, x64 HIP), and openEuler (x86 310p, x86 910b ACL Graph, aarch64 310p, aarch64 910b ACL Graph).
- Vulkan backend now handles unaligned tensors in ROPE, fixing crashes with quantized or custom models.
- Precompiled binaries cover 20+ platform variants: macOS, Linux, Windows, Android, openEuler.
- Signed release with verified GPG key; based on commit 7ba22c6 with focus on stability improvements.
Why It Matters
Local LLM inference becomes more reliable on Vulkan GPUs, expanding hardware options for AI enthusiasts and professionals.