I’m an LLM researcher with a passion for explaining scientific concepts to others.

Agent Evaluation: A Detailed Guide by Cameron R. Wolfe, Ph.D.

Best practices and common patterns for effectively evaluating AI agents...

Read on Substack