Fylo›ARCADIA›Graph
Hubs

Naive split increases data leakage — ARCADIA Knowledge Graph

Association·arcadia

Naive split increases data leakage

Naive clustering-based split produces more data leakage, leading to increased performance by exploiting training/test similarity

Confidence
90%
active

Evidence Quote

“Naive split approach produces higher performance due to increased data leakage between training and test sets”

Relationship

Naive clustering-based data split increases Data leakage

Arguments

Naive clustering-based data splitsubject
Data leakageobject

Connections (4)

Data leakage and training data bias impact model performanceInferenceChain
Automated metadata saving streamlines data collectionAssociation
Data leakage and training data biases impact model performanceInferenceChain
Data leakage and biases impact biological foundation model performanceInferenceChain