A study vault

Eighteen open-source agent codebases, read sideways: how memory compression actually works across eight of them, why Strix uses XML tool calls where Claude Code uses native ones, what guardrails layer in production agents.

This is a personal notebook. The orientation below is the path I took; you are welcome to wander.

18 projects · 12 concepts · 10 insights · 26 terms

Three ways in

By project

The codebase as the unit. Memory, tools, sandbox, all in one place.

Browse projects →

By concept

The idea across codebases. Memory compression, agent loops, caching.

Browse concepts →

By insight

The clever bit that doesn't fit a textbook chapter.

Insight gallery →

Featured tonight

Pre-baked reading orders. Each tour is a 30–90 min study session with explicit takeaways.

All tours →

Memory compression strategies — a deep dive

60 min · Engineers building long-running agents who need state to outlive a context window. 7 stops

Architecting an AI-Act-aware compliance agent

75 min · Engineers building agents that plug into CI/CD for an EU AI Act compliance / GRC platform. Audit, scope, oversight are all first-class. 9 stops

Most pedagogically dense projects

Ranked by how much you learn per page. Drill-down docs preserved as styled HTML.

All projects →

agent-cli

Claude Code

Anthropic's official agentic CLI. Streaming tool calls, prompt caching, thinking signatures, multi-agent subagents, slash commands.

agent-loop tool-calling-formats memory-compression prompt-caching +5

agent-platform verified has v2

OpenHands (v0)

All-hands AI v0 — autonomous software engineer agent. Event-sourced state, microagents, controller-level guardrails.

agent-loop tool-calling-formats memory-compression prompt-caching +4

security-agent verified

Strix

Open-source 'AI hacker' for autonomous pentesting. XML tool format, markdown-as-skills, LLM-based dedupe, module-level agent graph.

agent-loop tool-calling-formats memory-compression multi-agent-coordination +4

Latest insights

One non-obvious trick per card. The clever bits that don't fit a textbook.

All insights →

OpenHands (v0) ●●●

Memory compression preserves credentials, payloads, task IDs explicitly

Generic "summarize this conversation" loses the bits the agent needs to keep working. Mature systems enumerate preservation rules.

memory-compression

Strix ●●●

Markdown-as-prompt-library architecture

Decouples the agent's loop from its expertise. Domain experts contribute via PR; the loop almost never changes; the library evolves weekly.

skills-as-md agent-loop

Strix ●●●

LLM-based deduplication that reasons about root cause

Two pentest reports describing the same SQL injection with different payloads aren't textually similar — but they should dedupe. Hashing fails; LLM reasoning works.

memory-compression multi-agent-coordination

Claude Code ●●●

Cache boundary as a literal sentinel string

Survives every refactor, no marker objects to remember to add. Lets dozens of contributors compose one prompt without breaking cross-org cache reuse.

prompt-caching