Lemonade by AMD: a fast and open source local LLM server using GPU and NPU (lemonade-server.ai) AI

Lemonade is an open-source local LLM server that runs on PCs using available GPUs and NPUs, aiming for quick setup and private, local-first AI for text, images, and speech. It supports an OpenAI-compatible API and integrates with a range of apps, with a lightweight native backend and cross-platform availability (Windows, Linux, and macOS beta).

April 04, 2026 17:29 Source: Hacker News