Supports macOS Apple Silicon, Intel, and iOS XCFramework out of the box?

Supports macOS Apple Silicon, Intel, and iOS XCFramework out of the box.

Linux builds include CPU, Vulkan, ROCm 7.2, OpenVINO, and SYCL variants?

Linux builds include CPU, Vulkan, ROCm 7.2, OpenVINO, and SYCL variants.

Windows binaries cover CUDA 12/13, Vulkan, OpenCL Adreno, and HIP?

Windows binaries cover CUDA 12/13, Vulkan, OpenCL Adreno, and HIP.

Developer Tools

llama.cpp b9822 release brings builds for all major platforms

llama.cpp Releases June 28, 2026

⚡New llama.cpp release b9822 includes pre-built binaries for macOS, Windows, Linux, Android, and iOS.

Deep Dive

ggml-org's llama.cpp project has released b9822, a significant update that emphasizes cross-platform accessibility for local large language model inference. The release ships a wide array of pre-built binaries, eliminating the need for users to compile from source. This is a major convenience for developers and enthusiasts who want to run models like Llama, Mistral, or Gemma on their own hardware without wrestling with build environments.

Platform support is comprehensive: macOS gets builds for Apple Silicon (with optional KleidiAI acceleration), Intel x64, and an iOS XCFramework. Linux users can choose from Ubuntu x64/arm64 CPU, Vulkan, ROCm 7.2, OpenVINO, and SYCL FP32/FP16 variants. Windows binaries cover x64/arm64 CPU, CUDA 12 and 13 DLLs, Vulkan, OpenCL Adreno, and HIP. Android arm64 CPU is also included. The release also fixes the test-chat-template --no-common option, ensuring better template handling. This release lowers the barrier to entry for running state-of-the-art LLMs locally across diverse hardware configurations.

Key Points

Supports macOS Apple Silicon, Intel, and iOS XCFramework out of the box.
Linux builds include CPU, Vulkan, ROCm 7.2, OpenVINO, and SYCL variants.
Windows binaries cover CUDA 12/13, Vulkan, OpenCL Adreno, and HIP.

Why It Matters

Pre-built binaries simplify deploying local LLMs across macOS, Windows, Linux, Android, and iOS without compilation.

Read Original Article

llama.cpp b9822 release brings builds for all major platforms

Why It Matters

Related Articles

🚀 Stay Ahead in AI