Blog
Engineering deep-dives, research notes, product updates and customer stories — from the team building governed, autonomous agents.
Prompts are not a security boundary. Here's the architecture we landed on after our first six months in production.
What we learned shipping the most-deployed agent on the platform — and why "drafts only" beat "full automation".
A framework for thinking about approval gates as enforceable contracts between operators and the agents they deploy.
A B2B SaaS team rolled out the CRM Specialist in two weeks. Here's the rollout plan, the metrics, and the surprises.
A look at the policy engine that sits between the model and the world — written in Rust, powered by CEL.
Single-tenant feels safer. We argue, with the threat model in hand, that the opposite is true for agent workloads.
Most "autonomous" agent demos are autonomy theatre. We unpack what the data says about user trust and adoption.
17 new integrations, the agent simulator, granular blast-radius controls — and what we're shipping next.
How we run 12,000 eval cases on every PR — and what we do with the cases that flake.
Newsletter
One email a month. No fluff, no growth-hacking — just the deepest piece we wrote that month.