How to Use Ollama with Swift and iOS
A complete guide to local AI in Swift: a URLSession-based OllamaClient with non-streaming and streaming chat using AsyncStream, SwiftUI integration with async token display, configuring Ollama for LAN access from iOS devices on the same network, and Apple Foundation Models for true on-device inference on iOS 18+ and macOS 15+ using the Neural Engine with zero network access required.