Getting Started
Sign up, pair a machine, and start serving private AI to your team in five minutes.
OwnLLM has two halves:
- A SaaS control plane at
https://ownllm.appthat handles auth, billing, audit, and the public OpenAI-compatible API. - A local agent that runs on your GPU machine and serves inference. It
ships either as the Atlas desktop app (Mac, Windows, Linux GUI) or as
the
ownllmCLI (headless).
Pairing the local agent to your tenant takes one command. After that,
everything — model selection, audit logs, API keys — happens from
https://ownllm.app.
The five-minute path
- Sign up at ownllm.app and pick a plan.
- Copy your pairing key from the onboarding wizard
(
ownllm_pair_<32-hex>, valid for seven days, single-use). - Install the agent on your GPU host:
- Desktop: download the
.dmg/.msi/.debfrom/download. - CLI:
curl -fsSL https://ownllm.app/install.sh | sh.
- Desktop: download the
- Pair the machine:
- Desktop: paste the key into the first-run wizard.
- CLI:
ownllm pair ownllm_pair_<key>.
- Open
https://<your-slug>.ownllm.appin any browser, sign in, chat.
Once paired, the agent opens an outbound Cloudflare Tunnel and registers itself in your tenant. No port forwarding. No DNS. No reverse proxy.
What runs where
| Surface | Lives at | Talks to |
|---|---|---|
| Marketing site, admin dashboard, chat UI | ownllm.app | The agent, via tunnel |
OpenAI-compatible proxy (/v1/*) | ownllm.app | The agent, via tunnel |
| Local agent (CLI or desktop) | Your GPU machine | Ollama, cloudflared |
| Inference (Ollama) | Your GPU machine | The local agent |
Your prompts and completions transit ownllm.app (so we can authorize, log,
rate-limit) but inference runs on your hardware. We never send model
weights or completions to a third-party provider.
Next steps
- The CLI reference (lands in the next release) covers
pair,service,models,runtime,recipes, anddoctor. - The Desktop app guide (next release) walks through the pairing UI, dashboard, and tray icon.
- The API docs (next release) explain
OPENAI_BASE_URL, key scopes, budgets, and supported endpoints.