Factor·arcadia

Sequence similarity filtering stringency

Four schemes of training data balance based on sequence similarity filtering; stringency acts as proxy for likelihood of data leakage, ranging from relaxed (high similarity) to stringent (low similarity).

Confidence
90%
active

Source

Bhatnagar et al. approach to control training data biases with sequence similarity filtering