From the CoreCast
engineering team.

We write about what we're learning building AI agent memory infrastructure — persistent context, semantic recall, compliance, and the messy reality of production agent systems.

AI agent conversation flow showing context degradation across multiple turns
Engineering  •  May 5, 2026

After 10 turns, most agents start forgetting — and users notice long before they can articulate why. Here's the architectural reason this happens, and what persistent memory actually requires to fix it.

Read article
Diagram comparing vector semantic search and temporal recall for agent memory retrieval
Research  •  April 28, 2026

Vector search answers "what's relevant?" Temporal recall answers "what happened recently, in order?" Most production agents need both — here's how to decide what to weight and when.

Read article
Architecture diagram showing the boundary between context window and persistent external memory
Engineering  •  April 21, 2026

Treating your context window as memory works in demos and falls apart in production. Here's the precise architectural distinction that separates teams who get this right from teams who rebuild it twice.

Read article
Infrastructure diagram showing semantic memory indexing pipeline for high-volume agent conversations
Engineering  •  April 14, 2026

The three architectural mistakes in our first implementation — synchronous indexing, flat memory model, no compaction — and how we fixed them without breaking existing customers.

Read article
Cost comparison between context stuffing and selective memory retrieval at scale
Playbook  •  April 7, 2026

Context stuffing feels cheap until your token bill lands. A clear-eyed look at the unit economics — and the three hidden costs beyond tokens — that make memory infrastructure the smarter long-term bet.

Read article
Developer using an AI coding agent with persistent codebase memory and architectural knowledge
Use Cases  •  March 31, 2026

A coding agent with no memory is expensive autocomplete. Here's how to build one that remembers your architecture, your conventions, and your decisions — across sessions and across the whole team.

Read article
Architecture diagram showing multi-tenant memory isolation with per-tenant partitioning
Engineering  •  March 24, 2026

Three architectural mistakes — query-time isolation, flat scope model, missing retention controls — and the expensive lessons they taught us about building memory for SaaS products.

Read article
Customer support agent interface with AI-assisted recall of prior interactions and resolution history
Use Cases  •  March 17, 2026

Support agents without memory make "I already told you this" complaints structurally worse. Here are the specific recall patterns — issue history, customer profiles, temporal queries — that move the needle.

Read article
Compliance framework showing GDPR and HIPAA requirements for AI agent memory retention
Compliance  •  March 10, 2026

Agent memory is personal data — and most teams discover what that means in their first enterprise sales call. Here are the engineering requirements GDPR and HIPAA actually create for your memory layer.

Read article
Integration diagram showing LangGraph state graph connected to CoreCast persistent memory layer
Playbook  •  March 3, 2026

LangGraph handles execution state. CoreCast handles persistent memory. Here's exactly how they fit together — and why the combination covers what neither solves alone.

Read article