Solution

The platform under the agents.Reference architectures for organisations shipping more than one.

A single agent is a project. A portfolio of agents is a platform. We design and stand up the foundational architecture — shared MCP servers, shared observability, shared eval infrastructure, shared cost attribution — so your second, third, and fifteenth agent are cheaper, faster, and more consistent to ship than the first.

Shared
MCP servers · observability · eval infra across all agents
~50%
faster to ship the second agent vs. the first
Per-agent
cost attribution from day one
Multi-tenant
ready by default for IT services / SaaS deployments
Use cases

Where platform architecture pays back

Multi-product orgs

You are about to ship the second, third, and fifth agent. Each needs the same Odoo MCP, the same Langfuse setup, the same eval framework.

IT services teams

You sell agentic delivery to multiple customers. Multi-tenant from day one, with per-customer cost attribution.

SaaS platforms

You expose AI features to your end customers. The platform layer is the difference between scaling and re-architecting at 100 customers.

Industries served
IT ServicesEnterprise SoftwareSaaS PlatformsMulti-Product Organisations
System architecture

How the system is wired

Platform layering
Agentsproduct · customer · workflowOrchestrationshared patternsShared MCPcanonical toolsObservabilityeval · cost · auditRuntimecontainers · queues
Technology

Platform reference stack

Agent layerPer-product / per-customer agents · supervisor + handoff patternsShared MCP serversOrg-wide canonical tools — one per system, reused everywhereObservability + evalLangfuse multi-tenant · shared eval registry · cost attributionAuth + auditShared identity provider · audit log sink · policy registryRuntimeContainerised · CI/CD · staging / prod parity · queue-based fan-out
Methodology

Platform delivery process

01

Audit existing agents

What is shared, what is duplicated, what is drifting between teams.

02

Define the canonical tools

One MCP server per system. Migration plan for the existing per-team duplicates.

03

Shared observability + eval

Per-tenant traces, shared eval registry, cost attribution surface across all agents.

04

Multi-tenant rollout

Tenant scoping in MCP servers, in observability, in audit. Migrate the first agent. Measure. Migrate the rest.

05

Governance + maintenance

Who owns the shared tools, who reviews changes, how new agents onboard. Documented.

Security & scalability

Platform-grade security & scale

Multi-tenancy by design

Scoped tokens, scoped data, scoped audit. One platform serves many tenants without leaking between them.

Per-agent cost attribution

Cost per agent · per customer · per workflow surfaced from day one. No more "the AI bill went up, we do not know why."

Shared audit log

One immutable trace per tenant, queryable by your security and compliance teams.

Horizontal scale

Queue-based fan-out, prompt caching, parallel sub-agents. Designed for 10× growth without re-architecture.

Integrations

Platform integration points

  • Identity: Okta · Azure AD · Auth0
  • Observability: Langfuse · Datadog · custom OTel sinks
  • Runtime: Docker · Kubernetes · Cloud Run · ECS
  • Queues: SQS · Redis Streams · Cloud Tasks
  • Storage: PostgreSQL · S3-compatible · Pinecone / pgvector
Business impact

Why a platform layer pays back

Most organisations build their second agent from scratch because the first was a one-off. The platform layer is the difference between linear and sub-linear cost-to-ship.

~50%
faster on the second agent vs. the first
~30%
cheaper steady-state operating cost from shared infra
0
duplicate MCP servers in maintenance after migration
Case studies

How recent engagements actually shipped

IT Services · 6 weeks discovery → handoff

PR review pipeline cuts senior-engineer time 4×

Mid-market IT services firm · Ahmedabad · 180 engineers

Problem

Senior engineers were spending 8–12 hours per week each on first-pass PR review across a 6-team monorepo. Junior PRs waited 2+ days for sign-off; velocity stalled; the highest-judgement people were doing the lowest-judgement work.

Solution

A multi-agent CI workflow triggered on every PR open. Three specialist agents run in parallel — a reviewer (Claude Sonnet 4.6) for code-correctness and convention, a security agent for risk patterns, and a test-generator agent for coverage gaps. Outputs are consolidated into a single PR comment within 90 seconds. Humans review the agent's synthesis, not the raw diff.

Claude Sonnet 4.6 (reviewCustom MCP server: GitHub APIGitHub ActionsLangfuse traces
~36 hrs/wk
senior engineer time reclaimed across the team
< 3 days
payback period at loaded-cost rate
review throughput per senior engineer
0
production regressions traced to AI-passed reviews in 90 days
Read the full case study
ERP / Enterprise Software · 8 weeks discovery → handoff

ERP support triage agent eliminates the Level-1 backlog

Odoo-based ERP partner · Gujarat · ~60 implementation consultants

Problem

Customer support backlog had grown to ~340 open tickets. Level-1 triage took 12–20 minutes per ticket on average, and 35% of tickets were misrouted on first pass — every misroute became a customer-facing escalation churn.

Solution

A supervisor-pattern agent that ingests email and form submissions, classifies the issue, queries the customer's Odoo instance for context (open invoices, recent modules, last login, current contracts), drafts a Level-1 response with the right module screenshots inline, and routes complex tickets to the right consultant with a pre-filled handoff brief.

Claude Sonnet 4.6 (drafting)Custom MCP server: Odoo (read-only customer / order / invoice scope)Supervisor patternPydantic schemas
340 → 18
open L1 backlog within 6 weeks of go-live
~60%
L1 staffing reduction on agent-eligible categories
$2.30
average cost per agent-resolved ticket
8 wks
engagement, discovery to handoff
Read the full case study
Financial Services / Compliance · 10 weeks discovery → audit sign-off

Audit-grade compliance review ships under multi-layer guardrails

Regulated financial-services intermediary · India · 95 employees

Problem

Manual compliance review of vendor and onboarding documents was the bottleneck for new-customer activation. Every traffic spike threatened SLA breach. Reviewer fatigue led to inconsistent flagging — some weeks too strict, some weeks too loose, with no defensible pattern.

Solution

A single-agent system wrapped in four guardrail layers: an input filter that detects and redacts PII / strips prompt-injection patterns; a versioned policy registry the agent must cite by clause ID for every conclusion; output validators (schema + LLM-as-judge cross-check); and a human-in-the-loop gate on anything scored above a defined risk threshold. Every decision is appended to an immutable audit log.

Custom detectorsClaude Opus 4.7 (final ruling)Versioned in repoPydantic v2
0
audit findings across 4 quarterly reviews
3.2×
throughput per reviewer
< 6 hrs
customer activation time
10 wks
engagement, discovery to audit sign-off
Read the full case study
Deep dives

Read what we publish on this

Production

Claude Code Artifacts turn terminal output into live review pages: what Team and Enterprise buyers should pilot first

Artifacts in Claude Code beta publish self-contained HTML to claude.ai that republishes to the same URL as the session progresses, with version history and org-only sharing. Strict CSP, no external fetch, no backend. Requires Team or Enterprise and claude.ai login. Here is the workflow I use for PR walkthroughs and incident timelines without screenshot threads in Slack.

Read the post
MCP

MCP Enterprise-Managed Authorization is stable: how IdP-provisioned connector access replaces per-server OAuth hell

EMA makes the organization IdP the decision-maker for which MCP servers a user can reach. Admins enable connectors once; clients exchange an Identity Assertion JWT for scoped tokens without redirecting every employee through OAuth per server. Anthropic ships it across Claude, Claude Code, and Cowork; VS Code supports it; Okta is the first IdP. Here is the pilot I run before July 28 stateless transport work lands.

Read the post
Architecture

Cursor cloud subagents in 2026: /in-cloud, /babysit, and /automate without losing your local guardrails

Cursor 3.7 lets you spin subagents in cloud VMs with /in-cloud, iterate on a PR until merge-ready with /babysit, and hand off between local and cloud sessions. Cursor 3.8 adds /automate and five GitHub review triggers. Here is the workflow I use so parallel cloud work does not bypass Auto-review, environment snapshots, or pre-push /review.

Read the post
Production

Agentjacking is real: poisoned Sentry errors can hijack Cursor, Claude Code, and Codex without touching your repo

Tenet Threat Labs injected a fake stack trace through a public Sentry DSN and watched 100+ coding agents execute attacker commands during normal triage. No git write access required. The agent treats the error as ground truth. Here is how I harden observability MCP feeds, scope triage prompts, and block auto-exec on untrusted telemetry.

Read the post
Frequently asked

Enterprise AI Architecture — questions buyers ask

Audit your current architecture

A platform review starts with where your existing agents are — what is shared, what is duplicated, and what is going to bite at the next scale step.