I’m an LLM researcher with a passion for explaining scientific concepts to others.

The Anatomy of an LLM Benchmark by Cameron R. Wolfe, Ph.D.

Common patterns used to create the most effective LLM evaluation datasets...

Read on Substack