Resources for production agents.

Plain-language explanations and technical notes from our work building data foundations, agent runtimes, MCP tools, evals, and closed-loop intelligence. Start anywhere — Learn for concepts, Use cases for patterns, Research for open questions, AI Index for the A-Z.

Featured guide
Evals
before
launch.
Featured resource

Workflow Evals

The test suite your AI workflows have to pass before any change reaches users — measuring quality, latency, cost, and safety on real production data instead of vibes.

Read guide
Featured research
Synthetic
personality
that holds.
Research

Synthetic Personality

Before an AI agent can be useful to anyone, it has to be something — a coherent identity that holds up across users, sessions, and adversarial pressure. This is the research track that defines what that means and how to keep it stable.

Read research

Latest articles

What we are publishing around agent infrastructure, evals, source graphs, and closed-loop operations.

Highlights

Protocols, control planes, and operating practices behind the systems we launch.

Browse the library

A structured index for deeper concepts from the capability map.

Data substrate

Agent runtime

Evaluation

Operations

Research notes

Conversation Intelligence

Turning every approved conversation — support, email, team chat, customer messaging, voice, sales — into structured signal you can act on, instead of anecdotes that evaporate when a ticket closes.

Chat Orchestration Runtime

The end-to-end architecture of modern conversational AI systems: model-agnostic, client-agnostic, plugin-driven runtimes that coordinate intent, context, retrieval, tools, reasoning, reflection, memory, and rendering — with the LLM as one interchangeable component, not the system.

AI-Native Dashboards

A study on conversational, adaptive, living dashboard interfaces — workspaces that begin as a blank canvas with a single conversational input and build themselves in real time as the user asks, persisting widgets, layouts, and memory across sessions.

Prompt-Native Widgets

Generative, context-aware dashboard components whose logic and rendering are defined by natural language prompts rather than hardcoded configurations — runtime-generated analytical surfaces that retrieve, reason, link, and adapt instead of merely displaying.

Production Agent Interfaces

The chat surface as an operating console — knowledge bases plugged in, tools connected, agents on a roster, with real-time visibility into context budget, token spend, model choice, and concrete savings opportunities. The interface that lets a team actually run an agent in production, not just demo one.

Conversation Listeners

Opt-in listeners that capture conversations from every channel an organization uses — support, email, team chat, customer messaging, webchat, sales tools, voice — and route them into the signal-extraction pipeline with consent and retention rules attached.

Signal Extraction

Turning raw conversation transcripts into structured fields — intent, subject, sentiment, CSAT, tool performance, product mentions — that downstream systems can query, dashboard, and act on.

Conversation Forensics

Incident detection and root-cause analysis on human↔agent conversations — replaying threads, reading the context around negative sentiment, extracting whether the user actually resolved their problem, and turning the answer into a learning artifact the system can use next time.

Capability map