FAQ

Questions, answered.

It mostly comes down to one idea: the model runs on your device, so nothing leaves it. Here's the rest.

Still have a question?

Ask on GitHub
Is it really free?

Yes. Quenderin is MIT-licensed open source, and inference runs on your own hardware—so there are no token costs and no subscription.

Is my data sent anywhere?

No. After the one-time model download, Quenderin makes no network calls. There are no analytics, no accounts, and no telemetry. Because it's open source, you can verify this yourself.

What devices does it run on?

The desktop app runs today on macOS and Linux. Native iOS and Android apps are in active development—the same hardware-detection and model-selection engine, rebuilt natively for performance.

Do I need a powerful phone?

No. Quenderin scales from a Raspberry Pi to an M-series Mac. It detects what your device can handle and recommends a model that actually runs—down to a 0.4 GB ultra-light build.

Which models can it run?

Any GGUF model on llama.cpp—Llama, Qwen, DeepSeek, Mistral, Gemma, Phi, and thousands more from Hugging Face. See the model catalog for the curated shortlist it recommends from.

Can it actually do things, or just chat?

It includes a tool-using agent for small tasks, with a hard safety blocklist it cannot override on its own: it will never autonomously pay, delete, transfer money, or touch credentials. Anything sensitive is surfaced for you to confirm.

How big is a model download?

Your choice—from about 0.4 GB for the ultra-light build to 9 GB for the flagship. It downloads once over Wi-Fi and resumes in the background if interrupted.

When can I install it?

It's open source and runnable from GitHub today. The polished mobile apps are on the way—star or watch the repo on GitHub to hear the moment they land.

Bring your AI offline.

It's open source and runs from GitHub today. Star the repo to follow along.