Skip to main content

7 docs tagged with "evaluation"

View All Tags

11.6 Results & Analysis

Reproducing the evaluation metrics from the Cricket AI paper (3,627 claims, 97.71% auto-verification).