8.5 Building an Eval PipelineDataset creation, test harness, and CI/CD for LLM output quality assurance.