Fylo›ARCADIA›Graph
Hubs

Phylogenetic biases and non-independence cap model generalizability — ARCADIA Knowledge Graph

Association·arcadia

Phylogenetic biases and non-independence cap model generalizability

Claim that unmitigated phylogenetic bias and non-independence in protein datasets restrict the generalizability of machine learning models for protein prediction and design.

Confidence
90%
active

Evidence Quote

“Unmitigated non-independence and phylogenetic biases ... cap the generalizability of protein prediction and design”

Relationship

Phylogenetic bias decreases Machine learning models for protein design

Arguments

Phylogenetic biassubject
Machine learning models for protein designobject

Connections (3)

How phylogenetic data engineering expands protein model generalizabilityInferenceChain
Petabase-scale sequence alignment increases viral discoveryAssociation
Tree-of-life sampling and algorithmic biases shape the performance of protein language/design modelsInferenceChain

Evidence

“Supports claims that integrating phylogenies is essential for comparative biological analyses and controlling bias.”

(1985). Phylogenies and the Comparative Method doi:10.1086/284325 ↗