6.3 LLM-as-Judge
Self-evaluation, cross-model verification, and scoring rubrics for LLM output quality.
Self-evaluation, cross-model verification, and scoring rubrics for LLM output quality.
Rubric design, pairwise comparison, and calibration for LLM-as-judge evaluation.