ReasoningCheckpoint·arcadia

Validation set similarity increases leakage risk

Higher sequence similarity between validation and training sets increases the chance of data leakage, leading to overly optimistic performance estimates.

Confidence
85%
partialactive