Association·arcadia

Transcriptome-based protein database enables detection of longer protein sequences

Claim that transcriptome-based protein database enables detection of longer protein sequences compared to prior databases.

Confidence
70%
active

Evidence Quote

New database favored detection of longer protein sequences

Relationship

Reference protein database enables Longer protein sequences detected

Evidence

Reference for CD-HIT clustering and comparison of large sets of sequences.

Li W & Godzik A (2006). Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences doi:10.1093/bioinformatics/btl158

Reference for CD-HIT acceleration for clustering next-generation sequencing data.

Fu L et al. (2012). CD-HIT: accelerated for clustering the next-generation sequencing data doi:10.1093/bioinformatics/bts565

Reference for InterProScan 5 tool for genome-scale protein function classification.

Jones P et al. (2014). InterProScan 5: genome-scale protein function classification doi:10.1093/bioinformatics/btu031