#architecture
-
2026-03-12
Signal Fusion: How Semantic, Relational, and Direct Signals Combine to Make Recommendations That Don't SuckEvery recommendation system that works well is fusing multiple signal types. The ones that don't understand this ship vibes-based retrieval and wonder why users leave. A taxonomy of signals, how they combine, and what the SOTA ecosystem gets right and wrong.
-
2026-03-10
From Single LLM Call to Deep Agent: An Honest Migration PathStart with one function call. Add skills when the prompt gets too long. A no-framework guide to building agents that actually ship.
-
2026-03-10
Signal Stability Classification: Inference Cost-Benefit in Hybrid Recommendation SystemsNot all behavioral signals deserve the same compute budget. Genre affinity changes over weeks; session mood changes in seconds. Classify by stability, infer by tier, and stop pretending daily batch is the answer to everything.
-
2026-03-10
Query-Theme-Keyed Search ExpansionTwo users search 'sleep' and get different results — with no LLM at query time. How pre-computed, theme-keyed expansion terms turn a flat search into something that actually knows you.
-
2026-03-10
Pre-computed Personalization: The Offline Agent PatternWhy your personalization agent should never run at request time. The LLM does its heavy lifting on a schedule; your product serves the artifacts. Zero latency, infinite scale.
-
2026-03-10
The Multi-Artifact Output PatternOne LLM call, multiple output shapes for multiple consumers. Design your schema like a protocol, not an afterthought.