html发布机制tacat,paml使用手册

26P A M L M A N U A L

Amino Acid Sequences (seqtype = 2)

model specifies the model of amino acid substitution: 0 for the Poisson model assuming equal rates for any amino acid substitutions (Bishop and Friday, 1987); 1 for the

proportional model in which the rate of change to an amino acid is proportional to the frequency of that amino acid. Model = 2 specifies a class of empirical

models, and the empirical amino acid substitution rate matrix is given in the file

specified by aaRatefile. Files included in the package are for the empirical

models of Dayhoff et al. (1978) (dayhoff.dat), Jones et al. 1992 (1992) (see

(Kishino, Miyata, and Hasegawa 1990) for the construction), and Whelan and

Goldman (2001) (wag.dat). The file mtmam.dat has a matrix for

mitochondrial proteins estimated by maximum likelihood from a data set of 20

mammals. The mtREV24 model of the MOLPHY package (Adachi and

Hasegawa 1996b) is also provided (the file mtREV24.dat). These two are

similar, and the difference is that the former is derived from proteins from

mammals only while the latter came from more-diverse species including

chicken, fish, frog, and lamprey. Due to differences in the implementation, you

may see small differences in log-likelihood values and branch lengths between

aaml and protml in the MOLPHY package. Such differences are normal and you should use the same program to compare different trees. Under the

mtREV24 model, the two programs should give almost identical results.

If you want to specify your own substitution rate matrix, have a look at one of

those files, which has notes about the file structure. Other options for amino

acid substitution models should be ignored. To summarize, the variables

model, aaDist, CodonFreq, NSsites, and icode are used for codon

sequences (seqtype = 1), while model, alpha, and aaRatefile are

used for amino acid sequences.

runmode also works in the same way as in baseml.ctl. Specifying runmode = ?2 will forces the program to calculate the ML distances in pairwise comparisons.

You can change the following variables in the control file codeml.ctl:

aaRatefile, model, and alpha.

If you do pairwise ML comparison (runmode = -2) and the data contain

ambiguity characters or alignment gaps, the program will remove all sites which

have such characters from all sequences before the pairwise comparison if

cleandata = 1. This is known as "complete deletion". It will remove

alignment gaps and ambiguity characters in each pairwise comparsion

("pairwise" deletion) if cleandata = 0. {{This does not seem to be true.

The program currently removes all sites with any ambiguities if runmode = -2.

Need checking. Note by Ziheng 31/08/04.}} Note that in a likelihood analysis

of multiple sequences on a phylogeny, alignment gaps are treated as ambiguity

characters if cleandata = 0, and both alignment gaps and ambiguity

characters are deleted if cleandata = 1. Note that removing alignment gaps

and treating them as ambiguity characters both underestimate sequence

divergences. Ambiguity characters in the data (cleandata = 0) make the

likelihood calculation slower.

Output for amino acid sequences (seqtype = 2): The output file is self-explanatory and very similar to the result files for the nucleotide- and codon-based analyses. The

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值