Running Google Gemma 4 Locally with LM Studio's New Headless CLI and Claude Code (ai.georgeliu.com) AI

The article explains how to run Google’s Gemma 4 26B (MoE) locally on macOS using LM Studio 0.4.0’s new headless command-line tools (llmster/lms CLI) and how to integrate the setup with Claude Code. It walks through downloading and loading the model, checking performance and memory/parallelism, and selecting context length and quantization to fit within a Mac with 48GB unified memory. It also notes that while Gemma 4’s MoE design makes it feasible on modest hardware, running it via Claude Code can introduce noticeable slowdown.

April 05, 2026 18:40 Source: Hacker News