Factor·arcadia
Sequence similarity filtering stringency
Four schemes of training data balance based on sequence similarity filtering; stringency acts as proxy for likelihood of data leakage, ranging from relaxed (high similarity) to stringent (low similarity).
Confidence
90%
active
Source
Bhatnagar et al. approach to control training data biases with sequence similarity filtering