Show HN: Aotol AI – Offline LLM app runs on iOS with voice and multilingual

1 doublez78 0 9/15/2025, 5:20:37 AM apps.apple.com ↗

I’ve been experimenting with running LLMs directly on consumer devices. Aotol AI is an iOS app that runs a quantized model entirely on-device, with no network calls.

What it does: - Fully offline LLM chat (works even with no signal) - Multilingual text + voice (switch languages on the fly) - Conversations stay on your phone (no data sent out)

How it works: - Model: Llama 3.2 3B (quantized q4f16) - Size: ~2 GB app bundle - Inference: uses MLC-LLM + TVM runtime, optimized for iOS - Average response: ~1–2s/token on iPhone 15 Pro Max - Added voice chat using AVSpeechSynthesizer + SFSpeechRecognizer

Why: I wanted to test if “desktop-grade” LLM experiences could run locally on a phone, both for privacy and offline availability.

Limitations: - Accuracy is ~70% for general QA (small model, quantized) - Long prompts will slow down - Memory footprint is tight on older devices

Download (iOS): https://apps.apple.com/app/aotol-ai-private-on-device-ai/id6...

I’d love feedback from anyone experimenting with: - Smaller models on-device (sub-4B) - Optimizing quantization for speed vs accuracy - UX patterns for chat when inference can stall

We should have the ability to run any code we want on hardware we own (hugotunius.se)

Cognitive load is what matters (github.com)

NPM debug and chalk packages compromised (aikido.dev)

I didn't bring my son to a museum to look at screens (sethpurcell.com)

I ditched Docker for Podman (codesmash.dev)

Germany is not supporting ChatControl – blocking minority secured (digitalcourage.social)

30 minutes with a stranger (pudding.cool)

Show HN: A store that generates products from anything you type in search (anycrap.shop)

Show HN: Term.everything – Run any GUI app in the terminal (github.com)

Charlie Kirk killed at event in Utah (nbcnews.com)

996 (lucumr.pocoo.org)

Next.js is infuriating (blog.meca.sh)

Show HN: I recreated Windows XP as my portfolio (mitchivin.com)

The MacBook has a sensor that knows the exact angle of the screen hinge (twitter.com)

EU court rules nuclear energy is clean energy (weplanet.org)

Anthropic agrees to pay $1.5B to settle lawsuit with book authors (nytimes.com)

Signal Secure Backups (signal.org)

Using Claude Code to modernize a 25-year-old kernel driver (dmitrybrant.com)

iPhone Air (apple.com)

Pontevedra, Spain declares its entire urban area a "reduced traffic zone" (greeneuropeanjournal.eu)

I replaced Animal Crossing's dialogue with a live LLM by hacking GameCube memory (joshfonseca.com)

Google can keep its Chrome browser but will be barred from exclusive contracts (cnbc.com)

We all dodged a bullet (xeiaso.net)

UTF-8 is a brilliant design (iamvishnu.com)

Stripe Launches L1 Blockchain: Tempo (tempo.xyz)

Mistral raises 1.7B€, partners with ASML (mistral.ai)

New Mexico is first state in US to offer universal child care (governor.state.nm.us)

Chat Control Must Be Stopped (privacyguides.org)

“This telegram must be closely paraphrased before being communicated to anyone” (history.stackexchange.com)

The treasury is expanding the Patriot Act to attack Bitcoin self custody (tftc.io)

Almost anything you give sustained attention to will begin to loop on itself (henrikkarlsson.xyz)

Where's the shovelware? Why AI coding claims don't add up (mikelovesrobots.substack.com)

Models of European metro stations (stations.albertguillaumes.cat)

Google AI Overview made up an elaborate story about me (bsky.app)

iPhone dumbphone (stopa.io)

Claude Code: Now in Beta in Zed (zed.dev)

Eternal Struggle (yoavg.github.io)

Why our website looks like an operating system (posthog.com)

KDE launches its own distribution (lwn.net)

Corporations are trying to hide job openings from US citizens (thehill.com)

Many hard LeetCode problems are easy constraint problems (buttondown.com)

ICE is using fake cell towers to spy on people's phones (forbes.com)

Claude now has access to a server-side container environment (anthropic.com)

Court rejects Verizon claim that selling location data without consent is legal (arstechnica.com)

I'm absolutely right (absolutelyright.lol)

LLM Visualization (bbycroft.net)

Notes on Managing ADHD (borretti.me)

Serverless Horrors (serverlesshorrors.com)

MIT Study Finds AI Use Reprograms the Brain, Leading to Cognitive Decline (publichealthpolicyjournal.com)

E-paper display reaches the realm of LCD screens (spectrum.ieee.org)

Show HN: Aotol AI – Offline LLM app runs on iOS with voice and multilingual

Comments (0)