Factor·arcadia
Machine learning models for protein design
AI models that leverage large biological datasets to generate novel proteins or predict protein features.
Confidence
80%
active
Source
Prachee Avasthi et al. (2024). How phylogenetic bias shapes protein databases and models doi:10.57844/arcadia-570f-5cfb ↗
Connections (5)
Taxonomic sampling bias affects protein language modelsAssociation
Taxonomic bias alters Foldseek, protein language model, and protein design outcomesAssociation
Phylogenetic biases and non-independence cap model generalizabilityAssociation
Explicit phylogenetic information in models improves generalizability and accuracyAssociation
Pseudoreplication and non-independence limit language model generalizabilityAssociation