Complete guide to building evaluation systems for LLM applications: gold sets, LLM-as-judge, regression testing, offline/online evaluation, and production monitoring
Prerequisites: