Question 1

Is OwnLLM self-hosted or SaaS?

Accepted Answer

Hybrid. The OwnLLM web app remains the SaaS control plane in Europe. The app installed on your machine runs models and receives inference through an outbound tunnel. That gives you SSO, audit logs, updates, and zero-config networking without DIY platform work.

Question 2

Why is it cheaper than per-seat AI subscriptions?

Accepted Answer

At 30 to 80 users, cloud AI licenses quickly become recurring and hard to predict. OwnLLM adds a flat software subscription on top of a machine you choose, helping you amortize hardware while keeping internal usage under control.

Question 3

Does it replace Claude Code, Cursor, or Copilot?

Accepted Answer

Not necessarily. The strongest angle is to keep the tools that already work, then route repetitive, sensitive, or internal workloads to your local OpenAI-compatible API. You save money without forcing developers to change their workflow.

Question 4

What machine do we need to start?

Accepted Answer

A Mac Mini, Mac Studio, or RTX workstation is enough for a beta and a small team. More users or larger models require more VRAM. OwnLLM provides recommendations based on your workloads and target models.

Question 5

Can we choose our own models?

Accepted Answer

Yes. The MVP starts with a curated catalog of open-weight models via Ollama. Recommendations depend on VRAM and capabilities such as chat, code, vision, thinking, embeddings, or tool calling.

Question 6

Does tool calling work with every model?

Accepted Answer

No. OwnLLM exposes an OpenAI-compatible API, but each local model has its own capabilities. If a client sends tools to a model that does not support tool calling, OwnLLM returns a clear model_does_not_support_tools error and asks you to pick a tool-capable model.

Question 7

What are the v1 limits?

Accepted Answer

One machine per tenant, no RAG, no built-in agents or MCP workflows, no fine-tuning, and no native mobile app. v1 focuses on a private, reliable, multi-user, OpenAI-compatible AI service.

Private AI that runs on your machine, for your whole team.

AI subscriptions are moving up per seat. Local infrastructure gives you control over the cost curve.

From GPU machine to AI service in 3 steps

When cloud AI becomes a budget line, your GPU becomes an asset.

Your models run on your hardware

SSO, SCIM, and governance

Make dev tools pay back faster

Web chat for non-technical teams

Audit and predictable costs

Start small, expose the right model for the job

Sell local AI without forcing DIY on your teams.

A software subscription that makes your AI infrastructure pay back

Team

Startup

Enterprise

The objections your CTO, DPO, and developers will raise.