Show HN: Apfel – The free AI already on your Mac

Every Apple Silicon Mac has a built-in LLM locked behind Siri. Apfel is an open-source tool that sets it free as a CLI tool and an OpenAI-compatible server.
The free AI already on your Mac.
Every Mac with Apple Silicon has a built-in LLM. Apple locked it behind Siri. apfel sets it free - as a CLI tool, an OpenAI-compatible server, and a chat.
$ brew install Arthur-Ficial/tap/apfel
Apple Silicon · macOS Tahoe · Apple Intelligence enabled
Real commands. Real outputs. All running on Apple Silicon.
The AI is already installed on your Mac. Apple ships it with macOS. apfel just gives you a way to talk to it - from your terminal, from your code, from anywhere.
CLI tool, HTTP server, or interactive chat. Pick the one that fits.
Pipe-friendly and composable. Works with jq, xargs, and your shell scripts. stdin, stdout, JSON output, file attachments, proper exit codes.
Drop-in replacement at localhost:11434. Point any OpenAI SDK at it and go. Streaming, tool calling, CORS, response formats.
Multi-turn conversations with automatic context management. Five trimming strategies. System prompt support. All on your Mac.
Apple built an LLM into your Mac. apfel gives it a front door.
Starting with macOS 26 (Tahoe), every Apple Silicon Mac includes a language model as part of Apple Intelligence. Apple exposes it through the FoundationModels framework - a Swift API that gives apps access to SystemLanguageModel. All inference runs on the Neural Engine and GPU. No network calls, no cloud, no API keys. The model is just there.
Out of the box, the on-device model powers Siri, Writing Tools, and system features. There is no terminal command, no HTTP endpoint, no way to pipe text through it. The FoundationModels framework exists, but you need to write a Swift app to use it. That is what apfel does.
apfel is a Swift 6.3 binary that wraps LanguageModelSession and exposes it three ways: as a UNIX command-line tool with stdin/stdout, as an OpenAI-compatible HTTP server (built on Hummingbird), and as an interactive chat with context management.
It handles the things Apple's raw API does not: proper exit codes, JSON output, file attachments, five context trimming strategies for the small 4096-token window, real token counting via the SDK, and conversion of OpenAI tool schemas to Apple's native Transcript.ToolDefinition format.
Source: Hacker News













