The Matrix Blog

Building production AI agents, in the open.

Field notes from building Matrix — a multi-tenant agent platform that runs over chat, real-time voice, and autonomous tasks. Voice wire protocols, agent memory, RAG, MCP, cognitive architectures, access control, and the war stories behind each.

35 articles and counting.

Vision & Category

5 posts

Vision & Category·9 min read

Stop Writing Prompt Loops: Why AI Agents Need a Platform, Not a Framework

Frameworks give you a loop around model.generate(). Production agents need ten more layers. Why agents are a platform problem, not a framework one.

Sep 6, 2025Read

Vision & Category·8 min read

Multi-Tenancy Is Not a Feature You Bolt On

Tenancy is the floor every query stands on, not a late addition. How Matrix isolates tenants from line one — JWT, TenantContext, orgId filtering, and BYOK encryption.

Sep 16, 2025Read

Vision & Category·7 min read

Personas as Data, Not Code: Ship an Agent by Filling Out a Form

In Matrix there are no hardcoded personas. An agent is a configured record you create through a form or one POST — no code, no redeploy, persisted in Neo4j.

Sep 22, 2025Read

Vision & Category·10 min read

The 10-Layer Agent Stack You'll Build Anyway (So We Built It)

The ten layers of agent infrastructure every serious agent app ends up building — and how Matrix composes them into one runtime.

Feb 24, 2026Read

Vision & Category·9 min read

Build vs. Buy: The Real Cost of Rolling Your Own Agent Infra

An honest, line-item accounting of what 'just build it' actually costs in engineering-months — and the maintenance tail nobody budgets for.

Mar 18, 2026Read

Tutorials & Use Cases

5 posts

Tutorials & Use Cases·8 min read

Everything Is a Node: The Generic Entity Model

Matrix has no per-domain @Node classes. Organizations, agents, sessions, leads, memories — all of it is EntityType / EntityNode rows in one Neo4j graph.

Sep 11, 2025Read

Tutorials & Use Cases·9 min read

Build a Voice Tutor in an Afternoon

A hands-on walkthrough: stand up a real-time voice tutor that knows a syllabus, remembers each student, and renders math on a blackboard — no agent code.

Dec 3, 2025Read

Tutorials & Use Cases·9 min read

From Zero to Outbound Campaign: A Recruiter Agent Walkthrough

Build an AI recruiter agent that voice-calls a candidate list with a per-call objective — persona, campaign, CSV audience, paced dispatch, and dispositions.

Feb 1, 2026Read

Tutorials & Use Cases·9 min read

Turn Matrix Into Your CRM: Leads, Custom Fields, and Zoho Sync

Because everything is an entity, Matrix doubles as a lightweight AI CRM your agents write to directly — per-org custom fields and optional Zoho sync included.

Feb 15, 2026Read

Tutorials & Use Cases·9 min read

Deploying an Agent Platform on Cloud Run + Neo4j

A real-world topology for running an agent platform on Google Cloud — three Cloud Run services, a self-hosted Neo4j VM, and the gotchas that bite in production.

Mar 21, 2026Read

Skills, Tools & MCP

4 posts

Skills, Tools & MCP·7 min read

Composing Agents From Four Primitives: Tool, Skill, Knowledge, Toolbox

An agent in Matrix is assembled, not coded — mix four primitives and the runtime composes one coherent tool surface and prompt, with no per-feature wiring.

Sep 26, 2025Read

Skills, Tools & MCP·8 min read

The Built-In Toolbox: web_search, bash, and a Sandbox per Agent

Every Matrix agent can opt into seven INTERNAL-transport built-in tools — web_search, fetch_url, bash, file_*, grep — each scoped to a per-(org, agent) sandbox. No keys, no quota.

Sep 28, 2025Read

Skills, Tools & MCP·7 min read

Import Any Anthropic Agent Skill From a GitHub URL

Matrix speaks the Anthropic Agent Skills format, so any GitHub repo that publishes a SKILL.md is one POST away from a reusable skill on your agents.

Jan 20, 2026Read

Skills, Tools & MCP·7 min read

MCP Both Directions: Your Platform as Client and Server

Matrix sits on both sides of the Model Context Protocol — it exposes its own MCP server and its agents consume external ones, under one JWT auth model.

Jan 24, 2026Read

Voice

5 posts

Voice·9 min read

We Put Gemini Live on a Phone Line. Here's Everything That Broke.

A first-person war story of bridging Exotel telephony to the consumer Gemini Live API — the wire-protocol traps, token bugs, and Cloud Run quirks, in the order they bit.

Oct 11, 2025Read

Voice·8 min read

Barge-In and the Wire-Protocol Gotchas of Real-Time Voice Agents

The exact wire knobs that make real-time voice AI work: barge-in, snake_case audio frames, the Constrained endpoint, and the message shapes that bite.

Oct 28, 2025Read

Voice·8 min read

Inside the Audio Pipeline: 48kHz Mic to Sub-Second Replies

A round-trip tour of the audio plumbing behind an instant-feeling voice agent: 48kHz mic capture, 16kHz PCM chunks, 24kHz playback, and barge-in.

Nov 11, 2025Read

Voice·8 min read

One Agent, Two Voice Paths: Telephony Bridge vs. Browser-Direct

The same AI voice agent, reachable two ways: a server-held telephony bridge or a browser-direct WebSocket. Here's the architecture trade-off.

Nov 29, 2025Read

Voice·9 min read

Outbound AI Calling Campaigns That Don't Sound Like Robocalls

Run outbound voice campaigns with a real agent — memory, per-call objective, tools — instead of a dumb dialer. Here's the full pipeline and the one hazard to know.

Jan 25, 2026Read

Memory

5 posts

Memory·9 min read

Agents That Actually Remember You — Across Chat and Phone

The feature users feel: an agent that knows who you are whether you call or type. One memory pool per contact, joined across voice and chat.

Dec 14, 2025Read

Memory·8 min read

Embed-on-Write, Recall-on-Read: Vector Memory With Zero New Infra

How Matrix gives agents persistent vector memory by storing 768-d embeddings on the graph node and querying Neo4j's native HNSW index — no separate vector DB.

Dec 15, 2025Read

Memory·9 min read

Working, Episodic, Semantic, Procedural: The Four Agent Memories

The CoALA memory taxonomy, mapped to concrete Matrix mechanisms — what's working memory, what survives a session, and what only a human can edit.

Dec 19, 2025Read

Memory·7 min read

Reconcile-on-Write: How to Stop Agent Memory From Rotting

Naive agent memory rots into near-duplicates and stale facts. Matrix reconciles every semantic write — cosine-gated UPDATE/ADD with an optional LLM arbiter.

May 19, 2026Read

Memory·8 min read

Temporal Validity: Let the Latest Fact Win Without Losing History

Facts about a contact change over time. Matrix gives every memory temporal validity so the latest fact wins recall while the old one is kept for history.

May 29, 2026Read

Knowledge & RAG

3 posts

Knowledge & RAG·8 min read

RAG You Set Up by Dragging a PDF Into a Browser

Standing up a RAG pipeline shouldn't be a project. In Matrix you create a corpus, drag in files, and your agent can search them — no plumbing.

Dec 26, 2025Read

Knowledge & RAG·8 min read

Auto-Wired Retrieval: Your Agent Shouldn't Need RAG Plumbing

Most RAG stacks make you hand-wire a retriever into the agent loop. In Matrix, attaching a Knowledge corpus is the only step — the search tool appears on its own.

Jan 3, 2026Read

Knowledge & RAG·8 min read

GraphRAG in One Flag: From Chunks to an Entity Graph

Plain vector RAG retrieves isolated chunks and misses facts in the relationships between them. Matrix builds an entity graph from one flag.

Jan 4, 2026Read

Security & Governance

3 posts

Security & Governance·9 min read

Row, Field, and Type Security for AI Agents

Per-principal row, field, and type security on the entity model — enforced centrally so it covers dashboards, the API, find_records, and every agent tool.

Mar 23, 2026Read

Security & Governance·8 min read

Agents as Principals: An Agent Should Never See What Its Caller Can't

An agent is a security principal, not a god-mode service account. Matrix scopes what an agent can read to who's driving it — via Agent.mode.

Apr 24, 2026Read

Security & Governance·8 min read

Hierarchical Grants a Child Can Never Exceed Its Parent

How Matrix AND-composes a principal's grant with every ancestor's up the reportsTo chain, making delegated admin safe by construction.

May 4, 2026Read

Cognitive Core & Autonomy

5 posts

Cognitive Core & Autonomy·11 min read

From Chatbot to Cognitive Architecture: CoALA in Production

Most agents are a while-loop around a model. Matrix runs a real cognitive architecture — one CoALA decision cycle powering interactive and autonomous agents alike.

May 11, 2026Read

Cognitive Core & Autonomy·7 min read

One Decision Cycle for Interactive and Autonomous Agents

Interactive and autonomous agents look like two systems. In Matrix they're one decision cycle that swaps only the DECIDE seam by Agent.mode.

May 14, 2026Read

Cognitive Core & Autonomy·8 min read

Multi-Candidate Decisions: Don't Let Agents Take the First Idea

A greedy agent that runs its first thought is brittle. Matrix's autonomous planner proposes several candidate actions, scores them, then commits.

May 18, 2026Read

Cognitive Core & Autonomy·10 min read

Self-Improving Agents That Can't Go Rogue: Propose, Approve, Apply

Letting an agent edit its own behaviour is powerful and dangerous. Matrix lets it propose changes to its procedural memory — a human approves before anything changes.

Jun 15, 2026Read

Cognitive Core & Autonomy·7 min read

Adaptive Compute: Agents That Think Harder Only When It Helps

Today we shipped adaptive compute and closed the last four gaps between our agent runtime and the CoALA paper — dynamic search breadth and depth, reasoning-scored memory, and learning as a choice.

Jun 24, 2026Read