Fylo›ARCADIA›Graph
Hubs
Factor·arcadia

Data leakage

Phenomenon where similarities between training and test data cause overoptimistic performance estimates due to information leakage between splits.

Confidence
90%
active

Source

Bhatnagar et al. study on training data bias control

Connections (3)

Naive split increases data leakageAssociation
Filtering stringency affects data leakageAssociation
Higher validation set similarity increases data leakageAssociation