Lemonade

Local LLM server for discovering and running AI apps on personal GPUs and NPUs

Lemonade enables users to discover and run optimized LLMs directly from their own GPUs and NPUs. It provides OpenAI-compatible APIs and serves as an MCP-compatible local server supporting multiple hardware platforms including AMD Radeon, NVIDIA, and Intel processors.

Author: lemonade-sdk

Stars: 3,491

GitHub

Install: See https://lemonade-server.ai/install_options.html for platform-specific installation (Windows, Ubuntu, macOS, Arch Linux, Snap)