A comprehensive showcase of technical, product, and business expertise in designing and deploying production-ready AI agent systems.
Architecture, reliability, observability, evals
Strategy, UX, distribution, pricing
Governance, trust, rollout, moats
Production patterns for reliability, safety, evals, observability, and governance. Learn how to ship the smallest agent that solves the job while keeping it safe, observable, and governable.
How to pick winning agent wedges, design adoption + distribution loops, price safely, and govern rollout. Build agentic products that ship, stick, and scale.
Building blocks for agentic AI workflows. Understand the 4 layers of architecture and when to use skills, subagents, MCP servers, or simple context.
Four domain-specific proposals demonstrating full-spectrum competence: product wedge, engineering architecture, reliability, and governance.
Script writing with multi-role critique
Financial advisory with compliance guardrails
Inventory and trading optimization
Research and execution support
The art and science of curating, compressing, and delivering the right context to AI agents. Master the 4 knobs: Write, Select, Compress, Isolate.
Comprehensive guide to building production-ready agentic AI systems using the Claude Agent SDK. Master tools, governance, subagents, and deployment.
Domain-specific reinforcement learning for AI agents. Optimize workflow policies, improve long-horizon credit assignment, and achieve measurable performance lift.
Comprehensive framework for evaluating agent behavior over time. Master the 6-dimension scorecard, grader types, and 3-tier pipeline for production-ready eval systems.