Probabilistic orthology analysis
- PMID: 20525594
- DOI: 10.1093/sysbio/syp046
Probabilistic orthology analysis
Abstract
Orthology analysis aims at identifying orthologous genes and gene products from different organisms and, therefore, is a powerful tool in modern computational and experimental biology. Although reconciliation-based orthology methods are generally considered more accurate than distance-based ones, the traditional parsimony-based implementation of reconciliation-based orthology analysis (most parsimonious reconciliation [MPR]) suffers from a number of shortcomings. For example, 1) it is limited to orthology predictions from the reconciliation that minimizes the number of gene duplication and loss events, 2) it cannot evaluate the support of this reconciliation in relation to the other reconciliations, and 3) it cannot make use of prior knowledge (e.g., about species divergence times) that provides auxiliary information for orthology predictions. We present a probabilistic approach to reconciliation-based orthology analysis that addresses all these issues by estimating orthology probabilities. The method is based on the gene evolution model, an explicit evolutionary model for gene duplication and gene loss inside a species tree, that generalizes the standard birth-death process. We describe the probabilistic approach to orthology analysis using 2 experimental data sets and show that the use of orthology probabilities allows a more informative analysis than MPR and, in particular, that it is less sensitive to taxon sampling problems. We generalize these anecdotal observations and show, using data generated under biologically realistic conditions, that MPR give false orthology predictions at a substantial frequency. Last, we provide a new orthology prediction method that allows an orthology and paralogy classification with any chosen sensitivity/specificity combination from the spectra of achievable combinations. We conclude that probabilistic orthology analysis is a strong and more advanced alternative to traditional orthology analysis and that it provides a framework for sophisticated comparative studies of processes in genome evolution.
Similar articles
-
Integrating Sequence Evolution into Probabilistic Orthology Analysis.Syst Biol. 2015 Nov;64(6):969-82. doi: 10.1093/sysbio/syv044. Epub 2015 Jun 30. Syst Biol. 2015. PMID: 26130236
-
Bayesian gene/species tree reconciliation and orthology analysis using MCMC.Bioinformatics. 2003;19 Suppl 1:i7-15. doi: 10.1093/bioinformatics/btg1000. Bioinformatics. 2003. PMID: 12855432
-
Inferring orthology and paralogy.Methods Mol Biol. 2012;855:259-79. doi: 10.1007/978-1-61779-582-4_9. Methods Mol Biol. 2012. PMID: 22407712 Review.
-
An efficient method for exploring the space of gene tree/species tree reconciliations in a probabilistic framework.IEEE/ACM Trans Comput Biol Bioinform. 2012 Jan-Feb;9(1):26-39. doi: 10.1109/TCBB.2011.64. Epub 2011 Mar 30. IEEE/ACM Trans Comput Biol Bioinform. 2012. PMID: 21464510
-
The effect of gene duplication on homology.Novartis Found Symp. 1999;222:226-36; discussion 236-42. Novartis Found Symp. 1999. PMID: 10332763 Review.
Cited by
-
ASTRAL-Pro: Quartet-Based Species-Tree Inference despite Paralogy.Mol Biol Evol. 2020 Nov 1;37(11):3292-3307. doi: 10.1093/molbev/msaa139. Mol Biol Evol. 2020. PMID: 32886770 Free PMC article.
-
GeneRax: A Tool for Species-Tree-Aware Maximum Likelihood-Based Gene Family Tree Inference under Gene Duplication, Transfer, and Loss.Mol Biol Evol. 2020 Sep 1;37(9):2763-2774. doi: 10.1093/molbev/msaa141. Mol Biol Evol. 2020. PMID: 32502238 Free PMC article.
-
Horizontal gene transfer of Chlamydia: Novel insights from tree reconciliation.PLoS One. 2018 Apr 5;13(4):e0195139. doi: 10.1371/journal.pone.0195139. eCollection 2018. PLoS One. 2018. PMID: 29621277 Free PMC article.
-
Isometric gene tree reconciliation revisited.Algorithms Mol Biol. 2017 Jun 13;12:17. doi: 10.1186/s13015-017-0108-x. eCollection 2017. Algorithms Mol Biol. 2017. PMID: 28630644 Free PMC article.
-
The mathematics of xenology: di-cographs, symbolic ultrametrics, 2-structures and tree-representable systems of binary relations.J Math Biol. 2017 Jul;75(1):199-237. doi: 10.1007/s00285-016-1084-3. Epub 2016 Nov 30. J Math Biol. 2017. PMID: 27904954
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical