Question 1

Which tier should I pick?

Accepted Answer

Hosted ($1,999/mo) is the fastest path — sign up, OAuth your tools, you're running in an hour. Self-Host (one-time) is the right call if your security or compliance team requires data to stay on your infrastructure. Enterprise is for deployments that need BAA, SOC 2 evidence, or BYO KMS.

Question 2

What does "LLM token usage at cost" mean on Hosted?

Accepted Answer

You pay only what the upstream model provider charges (OpenAI, Anthropic, OpenRouter, etc.). We pass through wholesale pricing with no markup — your monthly Hosted bill is $1,999 plus whatever your agents actually spent on inference. Most ops teams see $200-800/mo of tokens depending on volume.

Question 3

What hardware ships in the Self-Host tiers?

Accepted Answer

Five options. Spark ($11,900): NVIDIA DGX Spark, 128GB unified memory, runs up to 200B-class models. Workstation ($24,900): RTX 6000 Ada, 48GB VRAM, 8-12 agents. Dual-GPU ($44,900): 2x RTX 6000, 96GB VRAM, 70B FP16. Server ($72,900): single H100 80GB, 405B-class. Enterprise ($499,900+): DGX H100/H200 multi-GPU clusters for frontier workloads.

Question 4

What's the difference between Hosted and Self-Host with BYO LLM?

Accepted Answer

Hosted: we run the orchestration platform on AWS, you OAuth in. Easiest. Self-Host: you run the orchestration platform on your AWS/GCP/your own server. More control, lower long-term cost, harder to set up — but we install it for you.

Question 5

Can I switch tiers later?

Accepted Answer

Yes — your config and integrations export cleanly. Hosted → Self-Host: we transfer the config, you take ownership. Self-Host → Hardware: we ship the box, restore your config, you continue. Hosted ↔ Self-Host transitions take ~2 weeks; Hardware delivery: 4-6 weeks.

Question 6

Why no free tier?

Accepted Answer

Two reasons. First, every install requires real human work to configure, even when the buyer is technical. Second, we want customers, not freeloaders — paid tiers signal you're serious about hiring AI agents that handle real work, and we can give you the install attention that deserves. If you need a true free starting point, the Hosted tier is monthly billing — cancel after 30 days for $1,999.

Question 7

What happens if I stop paying the annual Self-Host renewal?

Accepted Answer

Your installation keeps running. The $1,200/yr covers ongoing software updates and email support; without it you stop receiving updates and support tickets but your existing deployment continues to function indefinitely on the version you have.

Question 8

Can I use Claude / Bedrock / my own model?

Accepted Answer

Yes. Three native integration paths: (1) Any OpenAI-compatible endpoint (OpenRouter, Together, Fireworks) — set two env vars. (2) Local Ollama on your hardware — default install. (3) AWS Bedrock with HIPAA BAA — native provider, set PROVIDER=bedrock plus your AWS region. Your data never leaves your AWS account.

Question 9

Can my agents talk to our internal tools?

Accepted Answer

Yes. Paste your API docs (OpenAPI spec, Postman collection, or just markdown describing endpoints) and Inhouse generates the tool wrapper automatically — usually under a minute from spec to working agent integration. When the API drifts (renamed fields, new required headers, deprecated endpoints) the wrapper inspects the response and self-corrects. We've used this for HubSpot, Salesforce, Stripe, Zendesk, Linear, Asana, and several customers' homegrown billing/ticketing platforms. Auth methods supported: OAuth 2.0 + PKCE, bearer tokens, API keys, basic auth, mTLS.

Question 10

Do you offer a pilot?

Accepted Answer

Yes — book a fit call and we can scope a 30-day pilot before any commitment. Most customers prefer to evaluate against one specific workflow before sizing the full install.

Hosted, self-hosted, or enterprise.

Hosted

Self-Host

Enterprise

What you pay for, plainly

✓Included

×Not included

Common questions

Not sure which tier fits?