About

I work on AI agent memory research and full-stack engineering. Books, podcasts, and ideas show up here too. This site is where I write about whatever earns the post.

Research: agent memory

A tiered memory system for AI agents. Per-session SQLite working memory on the device side, a Postgres-backed shared corpus on the team side, HDBSCAN clustering and a cross-encoder reranker as sidecars. Five-tier RRF recall, workspace scoping, and a dormant cognitive-primitives extension (ACT-R, Hebbian, Bayesian) that the next eval cycle will resurrect.

Currently sitting at R@5 = 97.4% on LongMemEval-S, ahead of every published reference. The interesting work is what comes after that ceiling: measuring memory the way human expertise actually behaves (decay, co-activation, surprise weighting) rather than as flat similarity retrieval.

Day-job: infrastructure and full-stack

The rest of the stack, written about with the same honesty. Topics:

Cluster ops and GitOps. What broke this week, what survived me.
Observability and instrumentation. The alerts that mean something, and the ones I muted three months ago.
Backend and frontend engineering. Small interfaces, errors as values, opinions I'll soften when something burns.
Databases at scale. The query that works in dev and tanks in prod.
Full-stack work. Boring on the resume, load-bearing in the codebase.
AI agent tooling in daily use. What saves time vs what feels productive but isn't.

Elsewhere

Posts about anything I want to write about.