PAML (for Phylogenetic Analysis by Maximum Likelihood) is a package of programs for phylogenetic analyses of DNA or protein sequences using maximum likelihood. It is maintained by Ziheng Yang and distributed under the GNU GPL v3.
Before running any of the programs included in the PAML
software, please go through the different sections of this Wiki, where a detailed explanation of how to install and run the programs is given. One of the main problems users face is related to data formatting. In that way, please make sure that you go through the section Data formatting
in the PAML Wiki before running any PAML
program and format your input data files and control files accordingly.
Important
- Problems with input data, control files, error/warning messages output by the
PAML
programs (which inform users about formatting issues in their input files or wrong settings in their control files), and general questions should be posted in thePAML
discussion group. Before posting a message, please use the search tool in thePAML
discussion group to check whether your question/s have already been asked by otherPAML
users. You should also refer to the FAQs document. If you still cannot find an answer to your question/s, please post them in thePAML
discussion group. - Please, do not paste the screen output and the error/warning that you get without any reference when you are requesting help. Instead, make sure that you attach the input data and the control files you are using, explain how you ran
PAML
, whichPAML
version you are using, etc. In that way, your issue will be much easier and faster to troubleshoot. You can find more information about how to report your warning/error message on this website. - Please, only raise an issue on this GitHub repository when you experience technical problems such as compiling issues, programs aborting or not running at all, etc.
ANSI C source codes and executable files are distributed for UNIX, Linux, and Mac OS X (see the latest stable release available on this GitHub repository). PAML
is not good for tree making, although it may be used to estimate parameters and test hypotheses to study the evolutionary process once you have reconstructed trees using other programs such as RAxML-NG
, IQ-TREE
, PAUP*
, PHYLIP
, PhyML
, etc.
To download and install PAML
software, please follow the next links in the wiki:
- Downloading and installing
PAML
If you are a macOS user and you are experiencing issues such as "trace trap" or "segmentation fault" when running
PAML
programs, please download thedev
branch and follow the installation instructions given in the sectionPAML
for Mac OS X in the PAML Wiki.
The PAML Wiki is still under construction 🔧 -- we are slowly migrating the PAML documentation in PDF format to a more interactive and engaging tutorial!
In the meantime, you can access the following sections, which are also detailed in the PAML Wiki home section:
- Overview
- Data formatting
- Substitution models
PAML
programs
If you use PAML
, please cite the following:
- Yang, Z (1997). PAML: a program package for phylogenetic analysis by maximum likelihood. Comput. Appl. Biosci. 13, 555-556.
- Yang, Z (2007). PAML 4: Phylogenetic Analysis by Maximum Likelihood. Mol. Biol. Evol. 24, 1586-1591.
If you use the PAML
program MCMCtree
, please cite the following papers if you have used/run...
- ... the approximate likelihood calculation to speed up analyses with phylogenomic datasets (calculating branch lengths, Hessian, and gradient):
- ... Bayesian model selection analyses:
- ... the models for continuous morphological characters implemented in
MCMCtree
: - ... the protocol Bayesian Molecular Clock Dating Using Genome-Scale Datasets:
- dos Reis M and Yang, Z (2019). Bayesian Molecular Clock Dating Using Genome-Scale Datasets. In: Anisimova, M. (eds) Evolutionary Genomics. Methods in Molecular Biology, vol 1910. Humana, New York, NY. You can also access this chapter and the code used throughout the protocol on the
divtime
GitHub repository maintained by Mario dos Reis.
- dos Reis M and Yang, Z (2019). Bayesian Molecular Clock Dating Using Genome-Scale Datasets. In: Anisimova, M. (eds) Evolutionary Genomics. Methods in Molecular Biology, vol 1910. Humana, New York, NY. You can also access this chapter and the code used throughout the protocol on the
- ... the protocol Environmental Microbial Evolution: Methods and Protocols:
- ... the Bayesian sequential-subtree (BSS) approach and/or the scripts to fit skew-t distributions to fossil calibrations:
If you use the PAML
program CODEML
, please cite the following papers if you have used/run...
- ... the protocol A Beginners guide to estimating the non-synonymous to synonymous rate ratio of all protein-coding genes in a genome:
- ... the protocol Beginner's guide on the use of PAML to detect positive selection and/or the corresponding GitHub tutorial on the
positive-selection
repository:- Álvarez-Carretero S, Kapli P, Yang Z (2023). Beginner's guide on the use of PAML to detect positive selection, Mol Biol Evol, 40(4):msad041.. Remember to read the supplementary material where we discuss (i) analyses and checks you should carry out before running tests of positive selection with CODEML, (ii) gene tree VS species tree, and (iii) the usage of rooted and unrooted trees.
Changes and bug fixes until v4.10.1 were documented in the file doc/pamlHistory.txt. Changes in later versions have been documented for each release in the releases
section of this GitHub repository.