AI Evaluation

Gold Dataset Explorer

Browse the evaluation gold dataset — synthetic articles with ground-truth entity annotations, perturbation labels, and difficulty ratings.