Factor·arcadia

Data split strategy to control leakage

Method to avoid overlap between pretraining and test sets to reduce data leakage in machine learning models.

Confidence
70%
active

Source

Unclear effects of nonindependence on biological foundation models