Association·arcadia

Naive split increases data leakage

Naive clustering-based split produces more data leakage, leading to increased performance by exploiting training/test similarity

Confidence
90%
active

Evidence Quote

Naive split approach produces higher performance due to increased data leakage between training and test sets

Relationship

Naive clustering-based data split increases Data leakage