Integrations
Where your model runs is your call.
Three native providers. Same agent platform. Pick the one that matches your compliance posture — switch later without changing a line of agent config.
Local Self-Hosted
DefaultAir-gappable, fully sovereign. Your models, your hardware, zero external dependencies.
Configuration
# Default — no config needed OLLAMA_URL=http://localhost:11434 # Pull a model and go ollama pull qwen3:235b-a22b
Available models: Qwen3 235B, Llama 3.3 70B, Mistral Large, Kimi K2.6, DeepSeek V3, any GGUF model
Compliance posture
- ›Data never leaves your network
- ›No vendor cloud dependency
- ›Full air-gap capable
- ›No training on your data — models run locally
Best for
Teams that need complete data sovereignty, defense/government, or want the lowest long-term cost.
OpenRouter
Fastest setupAccess 200+ models from one API key. Cheapest path to production — pay per token, no hardware.
Configuration
PROVIDER=openrouter OPENROUTER_API_KEY=sk-or-...
Available models: Kimi K2.6, GPT-4o, Claude Sonnet, Qwen3, Llama 3.3, Gemini 2.5 Pro, 200+ more
Compliance posture
- ›Data routes through OpenRouter (US)
- ›No BAA available
- ›OpenRouter retains logs per their policy
- ›Best for non-regulated workloads
Best for
Startups, non-regulated teams, or anyone who wants to test multiple models before committing to hardware.
AWS Bedrock
HIPAA / FedRAMPNative Bedrock integration. Claude, Llama, and Mistral on AWS with HIPAA BAA and FedRAMP path.
Configuration
PROVIDER=bedrock AWS_REGION=us-east-1 # Uses your IAM role / instance profile
Available models: Claude Opus, Claude Sonnet, Llama 3.3, Mistral Large, Cohere Command R+
Compliance posture
- ›Data stays in your AWS account
- ›AWS BAA covers Bedrock inference
- ›FedRAMP Moderate path available
- ›No training on your data — AWS Bedrock policy
Best for
HealthTech, FinTech, LegalTech, Government — any team that needs a BAA or FedRAMP.
Custom integrations
Connect any tool, not just the ones we ship.
18 integration packs are pre-built (HubSpot, Salesforce, Stripe, Zendesk, etc.). For everything else — including your homegrown internal tools — Inhouse generates the API wrapper itself.
Paste API docs, get a working integration
Drop in an OpenAPI spec, Postman collection, or a markdown description of the endpoints. Inhouse parses it, generates tool bindings, and your agents start calling it within a minute.
Self-heals when APIs change
Field renamed? New required header? Endpoint deprecated? Agents inspect the response body, infer the correction, and retry — usually without a redeploy.
Every auth method, native
OAuth (with your own client app), bearer tokens, basic auth, API keys, mTLS. Credentials encrypted at rest with AES-256-GCM. Per-agent scoping if you need it.
Production-aware
Respects rate limits and retry-after headers. Exponential backoff on 5xx. Circuit breaker if an upstream is degraded. All actions logged in your audit trail.
- ✓Rate-limit aware (429 retry-after honored)
- ✓Exponential backoff · circuit breaker
- ✓Every call logged with timing + outcome
Used in production for HubSpot, Salesforce, Stripe, Zendesk, Linear, Asana, and several customers' homegrown internal systems.
One platform, any provider
Your agents, roles, tools, and workflows stay identical regardless of which provider you choose. Switching is a config change — not a migration.
Pick your provider
Local Ollama for sovereignty, OpenRouter for speed, Bedrock for compliance.
Set the env var
One line: PROVIDER=bedrock. That's it. No code changes, no agent reconfiguration.
Agents come online
Same roles, same tools, same personality. Different inference backend. Switch anytime.
Not sure which provider fits?
15-min call. We'll match your compliance requirements, budget, and workload to the right inference path.
Book a 15-min demo →