Inference Engine for Apple Silicon

Onde is a high-performance inference engine optimized for Apple silicon, enabling private, low-latency, on-device LLM capabilities for App Store applications.
On-device LLM inference — optimized for Apple silicon.
In Production · SDK · Website
Onde powers live App Store apps with fully on-device chat — no server, no latency, no data leaving the device.
© 2026 Onde Inference
| Name | Name | Last commit date | || |---|---|---|---|---|
On-device LLM inference — optimized for Apple silicon.
In Production · SDK · Website
Onde powers live App Store apps with fully on-device chat — no server, no latency, no data leaving the device.
© 2026 Onde Inference
Source: Hacker News













