<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1745-6150-4-20</ui>
   <ji>1745-6150</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p><it>&#947;</it>-MYN: a new algorithm for estimating Ka and Ks with consideration of variable substitution rates</p>
         </title>
         <aug>
            <au id="A1" ce="yes">
               <snm>Wang</snm>
               <fnm>Da-Peng</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>wangdp@big.ac.cn</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Wan</snm>
               <fnm>Hao-Lei</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>wanhaolei@big.ac.cn</email>
            </au>
            <au id="A3" ce="yes">
               <snm>Zhang</snm>
               <fnm>Song</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>zhangsong@big.ac.cn</email>
            </au>
            <au id="A4" ca="yes">
               <snm>Yu</snm>
               <fnm>Jun</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>junyu@big.ac.cn</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100029, PR China</p>
            </ins>
            <ins id="I2">
               <p>Graduate University of Chinese Academy of Sciences, Beijing 100039, PR China</p>
            </ins>
            <ins id="I3">
               <p>Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, PR China</p>
            </ins>
         </insg>
         <source>Biology Direct</source>
         <issn>1745-6150</issn>
         <pubdate>2009</pubdate>
         <volume>4</volume>
         <issue>1</issue>
         <fpage>20</fpage>
         <url>http://www.biology-direct.com/content/4/1/20</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">19531225</pubid>
               <pubid idtype="doi">10.1186/1745-6150-4-20</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>13</day>
               <month>6</month>
               <year>2009</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>16</day>
               <month>6</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>16</day>
               <month>6</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Wang et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Over the past two decades, there have been several approximate methods that adopt different mutation models and used for estimating nonsynonymous and synonymous substitution rates (Ka and Ks) based on protein-coding sequences across species or even different evolutionary lineages. Among them, MYN method (a <ul>M</ul>odified version of <ul>Y</ul>ang-<ul>N</ul>ielsen method) considers three major dynamic features of evolving DNA sequences&#8211;bias in transition/transversion rate, nucleotide frequency, and unequal transitional substitution but leaves out another important feature: unequal substitution rates among different sites or nucleotide positions.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We incorporated a new feature for analyzing evolving DNA sequences&#8211;unequal substitution rates among different sites&#8211;into MYN method, and proposed a modified version, namely <it>&#947; </it>(gamma)-MYN, based on an assumption that the evolutionary rate at each site follows a mode of <it>&#947;</it>-distribution. We applied <it>&#947;</it>-MYN to analyze the key estimator of selective pressure &#969; (Ka/Ks) and other relevant parameters in comparison to two other related methods, YN and MYN, and found that neglecting the variation of substitution rates among different sites may lead to biased estimations of &#969;. Our new method appears to have minimal deviations when relevant parameters vary within normal ranges defined by empirical data.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Our results indicate that unequal substitution rates among different sites have variable influences on &#969; under different evolutionary rates while both transition/transversion rate ratio and unequal nucleotide frequencies affect Ka and Ks thus selective pressure &#969;.</p>
            </sec>
            <sec>
               <st>
                  <p>Reviewers</p>
               </st>
               <p>This paper was reviewed by Kateryna Makova, David A. Liberles (nominated by David H Ardell), Zhaolei Zhang (nominated by Mark Gerstein), and Shamil Sunyaev.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Comparative sequence analysis is a powerful tool for biologists to study evolutionary relationship among animals and plants across diverse taxonomic lineages <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Pair-wise sequence comparison is perhaps the simplest comparative analysis for phylogeny for two reasons <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. First, calculating pair-wise distances is the initial step for distance-matrix methods of phylogeny reconstruction. Second, Markov-process models of nucleotide substitution used in distance calculations lay a foundation for likelihood and Bayesian analyses. One of the sophisticated methods is to estimate nonsynonymous and synonymous substitution rates for interrogating sequence dynamics and constructing phylogenetic trees. Since Ka and Ks represent the number of substitutions per nonsynonymous and synonymous site, respectively, these parameters (or often their ratio Ka/Ks or &#969;) are important for the estimation of evolutionary rates. The indications of Ka &lt; Ks (&#969; &lt; 1), Ka > Ks (&#969; > 1), and Ka = Ks (&#969; = 1) on evolutionary trends are negative (purifying), positive (adaptive), and neutral mutations, respectively. Ka and Ks can be estimated based on approximate methods, which typically involve three essential steps <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>: (1) counting the number of synonymous (S) and nonsynonymous (N) sites among targeted sequences, (2) counting the number of synonymous (S<sub>d</sub>) and nonsynonymous (N<sub>d</sub>) substitutions between two orthologous sequences, and (3) calculating the number of synonymous (d<sub>s</sub>) and nonsynonymous (d<sub>n</sub>) substitutions per site after correcting for multiple substitutions. Most of the methods assume simplified nucleotide substitution paths and involve <it>ad hoc </it>data treatments that are not well-justified <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. For instance, NG (Nei-Gojobori) method, a commonly-used approximate method in the early days, considers all possible evolutionary courses among compared DNA sequences and assumes that each nucleotide can be substituted with any of three other nucleotides at equal rate when it counts both sites and substitutions <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. It adopts Jukes-Cantor's one parameter formula only to correct for multiple substitutions. Another example, LWL (Li-Wu-Luo) method, classifies sites and substitutions as <it>i</it>-fold degenerate sites (<it>i </it>= 0, 2, 4) and considers unequal rates between transitional and transversional changes only when it counts substitutions, but considers equal rates when counting sites <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. A modified LWL, LPB (Li-Pamilo-Bianchi) method corrects for bias in counting sites by using different formulas for Ka and Ks estimation, which differentiate LPB from LWL method <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. Versions of LWL and LPB methods were also proposed by distinguishing two-fold degenerate sites and substitutions, taking the account of the transition/transversion rate bias when counting sites and correcting for arginine codons <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>.</p>
         <p>Among approximate methods, YN (Yang-Neilsen) method made significant progress through consideration of transition/transversion rate and nucleotide frequency biases <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Based on YN method, we recently proposed a <ul>m</ul>odified <ul>YN</ul> method (MYN) to distinguish substitutions between purines (A/G) and between pyrimidines (T/C) <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. MYN incorporates most of the major features of sequence evolution but assumes that different sites in sequences evolve the same way and at the same rate. This assumption is somewhat less thorough, and accumulating evidence of rate variation over sites is rather overwhelming <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. Since mutation rates certainly vary among sites, and mutations at different sites may be fixed or drifting at different rates due to their versatile roles in the structure and function of gene products (mostly proteins albeit RNAs also fold into different conformations), unequal nucleotide frequencies, different codon usage among species, and variation of substitution rates among different sites should all be taken into account, allowing for significant yet maybe incremental improvements on various parameter estimations. Some sixteen years ago, one of the pioneers of this field, Ziheng Yang suggested <it>&#947;</it>-distribution (gamma-distribution) as an adequate approximation based on his intensive comparative analysis on several continuous distributions leveraging on sequence data from the globin genes <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. As <it>&#947;</it>-distribution has been frequently used in estimating sequence divergence <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>, we adopt it to formulate an improved approximate method, denoted as <it>&#947;</it>-MYN, based on MYN method <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. In this method, we assume that nucleotide substitutions follow <it>&#947;</it>-distribution because negative binomial distribution is known to be generated when Poisson parameter <it>&#947; </it>varies according to a particular <it>&#947; </it>distribution among sites <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. We would like to emphasize that the <it>&#947; </it>distribution here refers to raw mutation rate rather than <it>&#947; </it>distribution of &#969; itself. It has been proposed that nucleotide substitution in coding region is context-dependent <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, and therefore, substitution rates depend on not only the neighboring sequences but also their functional constraints and models that allow for the correlation of substitution rates at adjacent sites were also developed <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. However, as these models tend to produce results similar to the simple gamma model and variations of &#945; can make the distribution suitable for accommodating different levels of rate variations in various datasets <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, we chose the simple gamma distribution as the depiction of raw various mutation rates. Since YN and MYN methods perform better as compared to numerous other methods <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and MYN improves the performance of YN for most parameter combinations <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, we focus on evaluating the performance of <it>&#947;</it>-MYN by comparing it to YN and MYN under variable conditions. The definitions of symbols used in Ka and Ks estimations are listed in Table <tblr tid="T1">1</tblr>.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Symbols used in Ka and Ks calculation</p>
            </caption>
            <tblbdy cols="2">
               <r>
                  <c ca="center">
                     <p>
                        <b>Symbol</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Definition</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="2">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>S</p>
                  </c>
                  <c ca="left">
                     <p>Number of synonymous sites</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>N</p>
                  </c>
                  <c ca="left">
                     <p>Number of nonsynonymous sites</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>Ks</p>
                  </c>
                  <c ca="left">
                     <p>Synonymous substitution rate</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>Ka</p>
                  </c>
                  <c ca="left">
                     <p>Nonsynonymous substitution rate</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>&#969;</p>
                  </c>
                  <c ca="left">
                     <p>Estimator of selective strength, &#969; = Ka/Ks</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>S<sub>d</sub></p>
                  </c>
                  <c ca="left">
                     <p>Number of synonymous substitutions</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>N<sub>d</sub></p>
                  </c>
                  <c ca="left">
                     <p>Number of nonsynonymous substitutions</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>
                        <it>t</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>Divergence time between two sequences, the expected number of nucleotide substitutions per codon, <it>t </it>= (Ks &#215; 3S + Ka &#215; 3N)/(S + N)</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>&#945;</p>
                  </c>
                  <c ca="left">
                     <p>The parameter of gamma distribution</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>&#954;</p>
                  </c>
                  <c ca="left">
                     <p>Ratio of transitional rate/transversional rate</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>&#954;<sub>R</sub></p>
                  </c>
                  <c ca="left">
                     <p>Ratio of transitional rate between purines to transversional rate, &#954;<sub>R </sub>= &#945;<sub>R</sub>/&#946;</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>&#954;<sub>Y</sub></p>
                  </c>
                  <c ca="left">
                     <p>Ratio of transitional rate between pyrimidines to transversional rate, &#954;<sub>Y </sub>= &#945;<sub>Y</sub>/&#946;</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>g<sub>N</sub></p>
                  </c>
                  <c ca="left">
                     <p>Frequency of nucleotide N, N &#8712; [T, C, A, G]</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>&#945;<sub>R</sub></p>
                  </c>
                  <c ca="left">
                     <p>Transitional rate between purines</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>&#945;<sub>Y</sub></p>
                  </c>
                  <c ca="left">
                     <p>Transitional rate between pyrimidines</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>&#946;</p>
                  </c>
                  <c ca="left">
                     <p>Transversional rate</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Computer simulation</p>
            </st>
            <p>Computer simulation is a routine approach for evaluating computational procedures of different algorithms. In molecular phylogeny, one major approach for simulating DNA sequence evolution is to generate an ancestral sequence for the root of a tree and "evolve" it along the tree building process according to substitution models, branch lengths, and substitution parameters <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>. This approach can be implemented in the evolver program in the PAML (Phylogenetic Analysis by Maximum Likelihood <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>) package, which usually uses nucleotide or amino acid sequence data to simulate evolving protein-coding sequences. To assess the advantages of &#947;-MYN in comparison with YN and MYN, we generated three groups of simulated sequences with the PAML package: (1) equal codon frequencies, (2) human frequencies (based on human protein-coding genes from the ENSEMBL database) <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> and (3) rice frequencies (based on rice protein-coding genes) <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. We also generated 2,000 sequence pairs with 1,200 bp in length for examining the effect of different parameters.</p>
         </sec>
         <sec>
            <st>
               <p>Consistency analysis and effect of codon frequencies</p>
            </st>
            <p>In general, a better method should have relatively minimal deviations from real values with near infinite amount of data and within a reasonable range of all relevant parameters. In reality, we have to define both data in a limited way and parameter ranges within reasonable boundaries. In this exercise, we use &#969; = 0.3, 1, and 3 to represent negative, neutral, and positive mutations, respectively <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, and fix parameter <it>t </it>to 0.6 for initial assessment. Since genuine values for &#954; often range from 1.5 to 5, we always fix &#954; = 3.75 as typical. Considering that <it>&#947;</it>-MYN differentiates &#954;<sub>Y </sub>from &#954;<sub>R</sub>, we always fix one of them to 3.75 and allow the other varying from 1 to 10. We then analyze &#969; among data generated with YN, MYN, and <it>&#947;</it>-MYN against &#954;<sub>R </sub>(fixing &#954;<sub>Y </sub>= 3.75), using the three codon frequencies under different selective pressures (Figure <figr fid="F1">1A&#8211;I</figr>). We observed that <it>&#947;</it>-MYN produces less deviated &#969; from the standard data under negative selection as we perform analyses for different species. Although <it>&#947;</it>-MYN performs in a very similar way as MYN does, it is obviously better than YN under either positive or neutral selections. Since biased codon frequencies often have opposite effects as compared to the bias of transition/transversion rate ratio, ignoring codon frequency bias can lead to an overestimation of &#969; <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Using empirical data from human and rice, which represent distinct codon usages, we also did not detect any effect among different codon frequencies (Figure <figr fid="F1">1A&#8211;I</figr>). Since most of the evolutionary studies tend to calculate evolutionary rates between closely-related species, future research should focus more on the effect of different parameters and the improvement of calculations under negative selection.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Estimated &#969; based on YN, MYN, and <it>&#947;</it>-MYN</p>
               </caption>
               <text>
                  <p><b>Estimated &#969; based on YN, MYN, and <it>&#947;</it>-MYN</b>. We plotted &#969; values estimated by YN, MYN, and <it>&#947;</it>-MYN when &#954;<sub>Y </sub>= 3.75, considering &#954;<sub>R </sub>varying from 1 to10. We used the canonical genetic code for simulated sequences with 1.6 million codons and three sets of codon frequencies: equal (A to C), human (D to F) calculated from human protein-coding genes, and rice (G to I) calculated from rice protein-coding genes. &#969; = 0.3 (A, D, G), &#969; = 1 (B, E, H), and &#969; = 3 (C, F, I) were considered as representative values for purifying selection, neutral mutation, and positive selection, respectively.</p>
               </text>
               <graphic file="1745-6150-4-20-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Effect of &#947;-distribution</p>
            </st>
            <p>MYN method assumes that different sites in a sequence evolve in the same way and at the same rate. It is obvious that such an assumption does not happen in the real world for most proteins and their genes. For instance, mutation rates are not the same in nuclear and organellar genomes among different species <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. In addition, sequence variations among portions or domains of proteins mutate differently from a fixed mutation rate due to their specific structural and functional constraints for different genes under different selective pressures. Therefore, we introduced a parameter &#945; in MYN method so that each substitution rate across sites is assumed to follow <it>&#947;</it>-distribution.</p>
            <p>Since &#945; is an unknown random variable and its variations may lead to changes of probability density of <it>&#947;</it>-distribution as well as deviations of <it>&#947;</it>-MYN method, we chose different parameters to force it to deviate from real values under different selective pressures (Figure <figr fid="F2">2</figr>). For a qualitative survey, the order of estimated values of &#969;, in the cases of &#954;<sub>R </sub>= 1, 2, and 3, is: YN &lt;<it>&#947;</it>-MYN &lt; MYN; the order of estimated values of &#969; for the rest cases, &#954;<sub>R </sub>= 4, 5, 6, 7, 8, 9, and 10, is: <it>&#947;</it>-MYN&lt;MYN &lt;YN. Furthermore, we observed that estimated &#969; do not change much as &#945; varies when expected &#969; = 1 or 3, and <it>&#947;</it>-MYN again performs better when &#969; = 0.3 than it does when &#969; = 1 or 3. Because most calculated &#969; values indicate negative selection, and variation of &#945; has stronger influence under negative selection, we analyzed the variation of &#969; in a range of 0.1 to 0.9 to evaluate the effect of &#945; on &#969;. We obtained different optimal &#945; values when &#969; varies from 0.1 to 0.9, and plotted different <it>&#947; </it>distribution densities (Figure <figr fid="F3">3</figr>). Each curve appears reaching its maximum and goes down with an increasing substitution rate. The peaks of the curves shift to the left and become lower in density when optimal &#945; values decrease from 4.8 to 1.5; the decrease is attributable to the increase of &#969; (selective pressure) from 0.1 to 0.9. Furthermore, selective pressure shows significant effects on &#945;, with an increase in the probability at the lowest and highest substitution rates across all sites. Because <it>&#947;</it>-MYN produces less biases than both YN and MYN do when &#969; varies from 0.1 to 0.9 under &#945; = 4 (data not shown), we chose &#945; = 4 as a typical value for our analyses.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Estimated &#969; when &#954;<sub>Y </sub>= 3.75 and &#954;<sub>R </sub>varies from 1 to 10 under negative selection</p>
               </caption>
               <text>
                  <p><b>Estimated &#969; when &#954;<sub>Y </sub>= 3.75 and &#954;<sub>R </sub>varies from 1 to 10 under negative selection</b>. We obtained better &#969; estimates by introducing parameter &#945; when orthologous genes are under negative selection with &#969; varying from 0.1 to 0.9. The canonical genetic code was used for simulated sequences with 1.6 million human codons.</p>
               </text>
               <graphic file="1745-6150-4-20-2"/>
            </fig>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p><it>&#947; </it>distribution density as a function of substitution rates at optimal &#945; values</p>
               </caption>
               <text>
                  <p><b><it>&#947; </it>distribution density as a function of substitution rates at optimal &#945; values</b>. We plotted different <it>&#947; </it>distribution densities as a function of substitution rates at optimal &#945; values: (1) &#969; = 0.1, &#945; = 4.8; (2) &#969; = 0.2, &#945; = 4; (3) &#969; = 0.3, &#945; = 3.3; (4) &#969; = 0.4, &#945; = 3; (5) &#969; = 0.5, &#945; = 2.5; (6) &#969; = 0.6, &#945; = 2; (7) &#969; = 0.7, 0.8, and 0.9, &#945; = 1.5. Note that each curve reaches its maximum and goes down with increasing substitution rates.</p>
               </text>
               <graphic file="1745-6150-4-20-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Effect of t</p>
            </st>
            <p>The parameter <it>t </it>represents divergence time between two sequences. To test the effect of <it>t </it>on our method, we use human codon frequency (2,000 pairs of sequences with 400 codons for each case), and vary <it>t </it>from 0.1 to 1. Since <it>&#947;</it>-MYN does not change much in comparison with MYN under positive selection and neutral selection, we only consider the three obvious conditions of negative selection when &#954;<sub>R </sub>= 10 and &#954;<sub>Y </sub>= 1 are fixed: &#969; = 0.2, 0.3 and 0.4 (Figure <figr fid="F4">4</figr>). Although YN, MYN, and <it>&#947;</it>-MYN all have a nearly identical overall trend when <it>t </it>varies from 0.1 to 1, and they all tend to overestimate &#969; for negative selection, <it>&#947;</it>-MYN deviates less from the expected values. Despite the fact that <it>&#947;</it>-MYN also overestimates &#969; and the overestimation becomes less obvious as <it>t </it>increases, while the overestimation of both YN and MYN becomes severer.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>The effect of <it>t </it>based on YN, MYN, and <it>&#947;</it>-MYN</p>
               </caption>
               <text>
                  <p><b>The effect of <it>t </it>based on YN, MYN, and <it>&#947;</it>-MYN</b>. <it>&#947;</it>-MYN deviates less under the effect of <it>t </it>as compared to other methods. Both YN and MYN tend to overestimate &#969;. Since MYN and <it>&#947;</it>-MYN are two modified forms of YN, all datasets exhibit a similar trend. However, when <it>t </it>increases, <it>&#947;</it>-MYN performs better than the other two methods. Parameter values are (A) &#945; = 4, &#954;<sub>R </sub>= 10, &#954;<sub>Y </sub>= 1, the expected value of &#969; = 0.2; (B) &#945; = 4, &#954;<sub>R </sub>= 10, &#954;<sub>Y </sub>= 1, the expected value of &#969; = 0.3; and (C) &#945; = 4, &#954;<sub>R </sub>= 10, &#954;<sub>Y </sub>= 1, the expected value of &#969; = 0.4.</p>
               </text>
               <graphic file="1745-6150-4-20-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Effects of &#954;<sub>R </sub>and &#954;<sub>Y</sub></p>
            </st>
            <p>We used the same data (2,000 pairs of human codon sequences with 400 codons for each case) and methods to test the effects of &#954;<sub>R </sub>and &#954;<sub>Y</sub>. We plotted the average estimates of &#969; from YN, MYN, and <it>&#947;</it>-MYN methods against &#954;<sub>Y </sub>= &#954;<sub>R </sub>for the parameter combinations: the expected &#969; values vary as 0.2 and 0.3 when &#945; = 4 (Figure <figr fid="F5">5</figr>). While the curves produce from YN and MYN methods superimpose each other when &#954;<sub>R </sub>(=&#954;<sub>Y</sub>) varies from 1 to 10, <it>&#947;</it>-MYN deviates clearly less from the expected &#969;. We found that <it>&#947;</it>-MYN still performs better than the other two methods, whereas MYN is degraded to YN when &#954;<sub>Y </sub>is equal to &#954;<sub>R</sub>. The result suggests that the assumption of variable substitution rates among different sites is necessary to Ka and Ks calculations.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>The effects of &#954;<sub>R </sub>and &#954;<sub>Y </sub>based on YN, MYN, and <it>&#947;</it>-MYN</p>
               </caption>
               <text>
                  <p><b>The effects of &#954;<sub>R </sub>and &#954;<sub>Y </sub>based on YN, MYN, and <it>&#947;</it>-MYN</b>. We showed the effects of &#954;<sub>R </sub>and &#954;<sub>Y </sub>when &#969; = 0.2 (A) and 0.3 (B) were said to represent negative selection. The human codon frequency was used for the simulated sequences and &#945; = 4 for both plots.</p>
               </text>
               <graphic file="1745-6150-4-20-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Effect of S%</p>
            </st>
            <p>Usually the effect of S% (the fraction of synonymous sites in a sequence) is considered as a factor of method evaluation. Changes in &#969; in relation to S% are often evaluated based on the effect of S% on the deviation of Ka and Ks. Therefore, an overestimated S% may give rise to underestimation of Ks and overestimation of Ka, resulting in overestimation of &#969;. Likewise, underestimation of S% may also lead to overestimation of Ks and underestimation of &#969;. It has been reported that S% has enormous influence on Ka and Ks under negative selection but has neglectable effect under positive selection <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. We used human sequences to examine the effect of S% on <it>&#947;</it>-MYN (Table <tblr tid="T2">2</tblr>), fixing &#954;<sub>Y </sub>to 3.75. As &#954;<sub>R </sub>increases, the value of S% generated from our method exhibits minor fluctuations under different negative selections, when compared to that from YN. But the difference between <it>&#947;</it>-MYN and MYN is minute under this condition. In more details, the order of estimated values of S%, in the cases of &#954;<sub>R </sub>= 1, 2, and 3, is: YN &lt; MYN &lt;<it>&#947;</it>-MYN; in the rest cases, the order of estimated values of S%, when &#954;<sub>R </sub>= 5, 6, 7, 8, 9 and 10, is: MYN &lt;<it>&#947;</it>-MYN &lt; YN. We did not observe any obvious trend under the condition of &#954;<sub>R </sub>= 4. Therefore, <it>&#947;</it>-MYN is deemed insensitive to S% changes.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>S% Estimates under different negative selections based on YN, MYN, and <it>&#947;</it>-MYN</p>
               </caption>
               <tblbdy cols="13">
                  <r>
                     <c cspan="13" ca="center">
                        <p>
                           <b>Human Codon Frequencies</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="13">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>&#969;</p>
                     </c>
                     <c ca="center">
                        <p>&#945;</p>
                     </c>
                     <c ca="center">
                        <p>Method</p>
                     </c>
                     <c cspan="10" ca="center">
                        <p>S (%)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>&#954;<sub>R </sub>= 1</p>
                     </c>
                     <c ca="center">
                        <p>&#954;<sub>R </sub>= 2</p>
                     </c>
                     <c ca="center">
                        <p>&#954;<sub>R </sub>= 3</p>
                     </c>
                     <c ca="center">
                        <p>&#954;<sub>R </sub>= 4</p>
                     </c>
                     <c ca="center">
                        <p>&#954;<sub>R </sub>= 5</p>
                     </c>
                     <c ca="center">
                        <p>&#954;<sub>R </sub>= 6</p>
                     </c>
                     <c ca="center">
                        <p>&#954;<sub>R </sub>= 7</p>
                     </c>
                     <c ca="center">
                        <p>&#954;<sub>R </sub>= 8</p>
                     </c>
                     <c ca="center">
                        <p>&#954;<sub>R </sub>= 9</p>
                     </c>
                     <c ca="center">
                        <p>&#954;<sub>R </sub>= 10</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="13">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0.1</p>
                     </c>
                     <c ca="center">
                        <p>4.8</p>
                     </c>
                     <c ca="center">
                        <p>YN</p>
                     </c>
                     <c ca="center">
                        <p>25.92</p>
                     </c>
                     <c ca="center">
                        <p>27.09</p>
                     </c>
                     <c ca="center">
                        <p>27.95</p>
                     </c>
                     <c ca="center">
                        <p>28.60</p>
                     </c>
                     <c ca="center">
                        <p>29.18</p>
                     </c>
                     <c ca="center">
                        <p>29.63</p>
                     </c>
                     <c ca="center">
                        <p>29.99</p>
                     </c>
                     <c ca="center">
                        <p>30.28</p>
                     </c>
                     <c ca="center">
                        <p>30.57</p>
                     </c>
                     <c ca="center">
                        <p>30.77</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>MYN</p>
                     </c>
                     <c ca="center">
                        <p>29.03</p>
                     </c>
                     <c ca="center">
                        <p>28.88</p>
                     </c>
                     <c ca="center">
                        <p>28.78</p>
                     </c>
                     <c ca="center">
                        <p>28.66</p>
                     </c>
                     <c ca="center">
                        <p>28.56</p>
                     </c>
                     <c ca="center">
                        <p>28.50</p>
                     </c>
                     <c ca="center">
                        <p>28.42</p>
                     </c>
                     <c ca="center">
                        <p>28.34</p>
                     </c>
                     <c ca="center">
                        <p>28.29</p>
                     </c>
                     <c ca="center">
                        <p>28.16</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p><it>&#947;</it>-MYN</p>
                     </c>
                     <c ca="center">
                        <p>29.12</p>
                     </c>
                     <c ca="center">
                        <p>28.94</p>
                     </c>
                     <c ca="center">
                        <p>28.83</p>
                     </c>
                     <c ca="center">
                        <p>28.70</p>
                     </c>
                     <c ca="center">
                        <p>28.59</p>
                     </c>
                     <c ca="center">
                        <p>28.53</p>
                     </c>
                     <c ca="center">
                        <p>28.45</p>
                     </c>
                     <c ca="center">
                        <p>28.36</p>
                     </c>
                     <c ca="center">
                        <p>28.31</p>
                     </c>
                     <c ca="center">
                        <p>28.18</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="13">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0.2</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>YN</p>
                     </c>
                     <c ca="center">
                        <p>25.91</p>
                     </c>
                     <c ca="center">
                        <p>27.07</p>
                     </c>
                     <c ca="center">
                        <p>27.92</p>
                     </c>
                     <c ca="center">
                        <p>28.62</p>
                     </c>
                     <c ca="center">
                        <p>29.22</p>
                     </c>
                     <c ca="center">
                        <p>29.66</p>
                     </c>
                     <c ca="center">
                        <p>30.04</p>
                     </c>
                     <c ca="center">
                        <p>30.36</p>
                     </c>
                     <c ca="center">
                        <p>30.65</p>
                     </c>
                     <c ca="center">
                        <p>30.87</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>MYN</p>
                     </c>
                     <c ca="center">
                        <p>29.00</p>
                     </c>
                     <c ca="center">
                        <p>28.87</p>
                     </c>
                     <c ca="center">
                        <p>28.69</p>
                     </c>
                     <c ca="center">
                        <p>28.57</p>
                     </c>
                     <c ca="center">
                        <p>28.53</p>
                     </c>
                     <c ca="center">
                        <p>28.43</p>
                     </c>
                     <c ca="center">
                        <p>28.30</p>
                     </c>
                     <c ca="center">
                        <p>28.26</p>
                     </c>
                     <c ca="center">
                        <p>28.21</p>
                     </c>
                     <c ca="center">
                        <p>28.15</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p><it>&#947;</it>-MYN</p>
                     </c>
                     <c ca="center">
                        <p>29.10</p>
                     </c>
                     <c ca="center">
                        <p>28.95</p>
                     </c>
                     <c ca="center">
                        <p>28.75</p>
                     </c>
                     <c ca="center">
                        <p>28.62</p>
                     </c>
                     <c ca="center">
                        <p>28.57</p>
                     </c>
                     <c ca="center">
                        <p>28.47</p>
                     </c>
                     <c ca="center">
                        <p>28.33</p>
                     </c>
                     <c ca="center">
                        <p>28.28</p>
                     </c>
                     <c ca="center">
                        <p>28.23</p>
                     </c>
                     <c ca="center">
                        <p>28.16</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="13">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0.3</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>YN</p>
                     </c>
                     <c ca="center">
                        <p>25.90</p>
                     </c>
                     <c ca="center">
                        <p>27.06</p>
                     </c>
                     <c ca="center">
                        <p>27.94</p>
                     </c>
                     <c ca="center">
                        <p>28.65</p>
                     </c>
                     <c ca="center">
                        <p>29.19</p>
                     </c>
                     <c ca="center">
                        <p>29.63</p>
                     </c>
                     <c ca="center">
                        <p>30.04</p>
                     </c>
                     <c ca="center">
                        <p>30.38</p>
                     </c>
                     <c ca="center">
                        <p>30.65</p>
                     </c>
                     <c ca="center">
                        <p>30.90</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>MYN</p>
                     </c>
                     <c ca="center">
                        <p>29.05</p>
                     </c>
                     <c ca="center">
                        <p>28.87</p>
                     </c>
                     <c ca="center">
                        <p>28.70</p>
                     </c>
                     <c ca="center">
                        <p>28.57</p>
                     </c>
                     <c ca="center">
                        <p>28.43</p>
                     </c>
                     <c ca="center">
                        <p>28.33</p>
                     </c>
                     <c ca="center">
                        <p>28.26</p>
                     </c>
                     <c ca="center">
                        <p>28.19</p>
                     </c>
                     <c ca="center">
                        <p>28.14</p>
                     </c>
                     <c ca="center">
                        <p>28.06</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p><it>&#947;</it>-MYN</p>
                     </c>
                     <c ca="center">
                        <p>29.14</p>
                     </c>
                     <c ca="center">
                        <p>28.95</p>
                     </c>
                     <c ca="center">
                        <p>28.76</p>
                     </c>
                     <c ca="center">
                        <p>28.61</p>
                     </c>
                     <c ca="center">
                        <p>28.47</p>
                     </c>
                     <c ca="center">
                        <p>28.36</p>
                     </c>
                     <c ca="center">
                        <p>28.28</p>
                     </c>
                     <c ca="center">
                        <p>28.21</p>
                     </c>
                     <c ca="center">
                        <p>28.15</p>
                     </c>
                     <c ca="center">
                        <p>28.08</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="13">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0.4</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>YN</p>
                     </c>
                     <c ca="center">
                        <p>25.85</p>
                     </c>
                     <c ca="center">
                        <p>27.06</p>
                     </c>
                     <c ca="center">
                        <p>27.96</p>
                     </c>
                     <c ca="center">
                        <p>28.65</p>
                     </c>
                     <c ca="center">
                        <p>29.20</p>
                     </c>
                     <c ca="center">
                        <p>29.66</p>
                     </c>
                     <c ca="center">
                        <p>30.05</p>
                     </c>
                     <c ca="center">
                        <p>30.37</p>
                     </c>
                     <c ca="center">
                        <p>30.64</p>
                     </c>
                     <c ca="center">
                        <p>30.90</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>MYN</p>
                     </c>
                     <c ca="center">
                        <p>28.96</p>
                     </c>
                     <c ca="center">
                        <p>28.84</p>
                     </c>
                     <c ca="center">
                        <p>28.70</p>
                     </c>
                     <c ca="center">
                        <p>28.55</p>
                     </c>
                     <c ca="center">
                        <p>28.44</p>
                     </c>
                     <c ca="center">
                        <p>28.35</p>
                     </c>
                     <c ca="center">
                        <p>28.27</p>
                     </c>
                     <c ca="center">
                        <p>28.22</p>
                     </c>
                     <c ca="center">
                        <p>28.16</p>
                     </c>
                     <c ca="center">
                        <p>28.10</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p><it>&#947;</it>-MYN</p>
                     </c>
                     <c ca="center">
                        <p>29.08</p>
                     </c>
                     <c ca="center">
                        <p>28.94</p>
                     </c>
                     <c ca="center">
                        <p>28.78</p>
                     </c>
                     <c ca="center">
                        <p>28.62</p>
                     </c>
                     <c ca="center">
                        <p>28.50</p>
                     </c>
                     <c ca="center">
                        <p>28.40</p>
                     </c>
                     <c ca="center">
                        <p>28.31</p>
                     </c>
                     <c ca="center">
                        <p>28.25</p>
                     </c>
                     <c ca="center">
                        <p>28.18</p>
                     </c>
                     <c ca="center">
                        <p>28.11</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="13">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0.5</p>
                     </c>
                     <c ca="center">
                        <p>2.5</p>
                     </c>
                     <c ca="center">
                        <p>YN</p>
                     </c>
                     <c ca="center">
                        <p>25.88</p>
                     </c>
                     <c ca="center">
                        <p>27.05</p>
                     </c>
                     <c ca="center">
                        <p>27.95</p>
                     </c>
                     <c ca="center">
                        <p>28.66</p>
                     </c>
                     <c ca="center">
                        <p>29.23</p>
                     </c>
                     <c ca="center">
                        <p>29.66</p>
                     </c>
                     <c ca="center">
                        <p>30.04</p>
                     </c>
                     <c ca="center">
                        <p>30.35</p>
                     </c>
                     <c ca="center">
                        <p>30.64</p>
                     </c>
                     <c ca="center">
                        <p>30.86</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>MYN</p>
                     </c>
                     <c ca="center">
                        <p>28.97</p>
                     </c>
                     <c ca="center">
                        <p>28.81</p>
                     </c>
                     <c ca="center">
                        <p>28.68</p>
                     </c>
                     <c ca="center">
                        <p>28.56</p>
                     </c>
                     <c ca="center">
                        <p>28.47</p>
                     </c>
                     <c ca="center">
                        <p>28.36</p>
                     </c>
                     <c ca="center">
                        <p>28.29</p>
                     </c>
                     <c ca="center">
                        <p>28.18</p>
                     </c>
                     <c ca="center">
                        <p>28.15</p>
                     </c>
                     <c ca="center">
                        <p>28.05</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p><it>&#947;</it>-MYN</p>
                     </c>
                     <c ca="center">
                        <p>29.12</p>
                     </c>
                     <c ca="center">
                        <p>28.93</p>
                     </c>
                     <c ca="center">
                        <p>28.77</p>
                     </c>
                     <c ca="center">
                        <p>28.63</p>
                     </c>
                     <c ca="center">
                        <p>28.53</p>
                     </c>
                     <c ca="center">
                        <p>28.41</p>
                     </c>
                     <c ca="center">
                        <p>28.33</p>
                     </c>
                     <c ca="center">
                        <p>28.21</p>
                     </c>
                     <c ca="center">
                        <p>28.17</p>
                     </c>
                     <c ca="center">
                        <p>28.07</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="13">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0.6</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>YN</p>
                     </c>
                     <c ca="center">
                        <p>25.89</p>
                     </c>
                     <c ca="center">
                        <p>27.05</p>
                     </c>
                     <c ca="center">
                        <p>27.92</p>
                     </c>
                     <c ca="center">
                        <p>28.66</p>
                     </c>
                     <c ca="center">
                        <p>29.20</p>
                     </c>
                     <c ca="center">
                        <p>29.66</p>
                     </c>
                     <c ca="center">
                        <p>30.02</p>
                     </c>
                     <c ca="center">
                        <p>30.34</p>
                     </c>
                     <c ca="center">
                        <p>30.61</p>
                     </c>
                     <c ca="center">
                        <p>30.88</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>MYN</p>
                     </c>
                     <c ca="center">
                        <p>28.99</p>
                     </c>
                     <c ca="center">
                        <p>28.81</p>
                     </c>
                     <c ca="center">
                        <p>28.64</p>
                     </c>
                     <c ca="center">
                        <p>28.58</p>
                     </c>
                     <c ca="center">
                        <p>28.45</p>
                     </c>
                     <c ca="center">
                        <p>28.37</p>
                     </c>
                     <c ca="center">
                        <p>28.27</p>
                     </c>
                     <c ca="center">
                        <p>28.18</p>
                     </c>
                     <c ca="center">
                        <p>28.10</p>
                     </c>
                     <c ca="center">
                        <p>28.09</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p><it>&#947;</it>-MYN</p>
                     </c>
                     <c ca="center">
                        <p>29.19</p>
                     </c>
                     <c ca="center">
                        <p>28.96</p>
                     </c>
                     <c ca="center">
                        <p>28.77</p>
                     </c>
                     <c ca="center">
                        <p>28.67</p>
                     </c>
                     <c ca="center">
                        <p>28.52</p>
                     </c>
                     <c ca="center">
                        <p>28.43</p>
                     </c>
                     <c ca="center">
                        <p>28.31</p>
                     </c>
                     <c ca="center">
                        <p>28.22</p>
                     </c>
                     <c ca="center">
                        <p>28.13</p>
                     </c>
                     <c ca="center">
                        <p>28.12</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="13">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0.7</p>
                     </c>
                     <c ca="center">
                        <p>1.5</p>
                     </c>
                     <c ca="center">
                        <p>YN</p>
                     </c>
                     <c ca="center">
                        <p>25.87</p>
                     </c>
                     <c ca="center">
                        <p>27.05</p>
                     </c>
                     <c ca="center">
                        <p>27.93</p>
                     </c>
                     <c ca="center">
                        <p>28.63</p>
                     </c>
                     <c ca="center">
                        <p>29.21</p>
                     </c>
                     <c ca="center">
                        <p>29.65</p>
                     </c>
                     <c ca="center">
                        <p>30.02</p>
                     </c>
                     <c ca="center">
                        <p>30.33</p>
                     </c>
                     <c ca="center">
                        <p>30.63</p>
                     </c>
                     <c ca="center">
                        <p>30.86</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>MYN</p>
                     </c>
                     <c ca="center">
                        <p>28.94</p>
                     </c>
                     <c ca="center">
                        <p>28.79</p>
                     </c>
                     <c ca="center">
                        <p>28.65</p>
                     </c>
                     <c ca="center">
                        <p>28.54</p>
                     </c>
                     <c ca="center">
                        <p>28.47</p>
                     </c>
                     <c ca="center">
                        <p>28.36</p>
                     </c>
                     <c ca="center">
                        <p>28.27</p>
                     </c>
                     <c ca="center">
                        <p>28.16</p>
                     </c>
                     <c ca="center">
                        <p>28.12</p>
                     </c>
                     <c ca="center">
                        <p>28.05</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p><it>&#947;</it>-MYN</p>
                     </c>
                     <c ca="center">
                        <p>29.21</p>
                     </c>
                     <c ca="center">
                        <p>28.99</p>
                     </c>
                     <c ca="center">
                        <p>28.82</p>
                     </c>
                     <c ca="center">
                        <p>28.67</p>
                     </c>
                     <c ca="center">
                        <p>28.57</p>
                     </c>
                     <c ca="center">
                        <p>28.44</p>
                     </c>
                     <c ca="center">
                        <p>28.33</p>
                     </c>
                     <c ca="center">
                        <p>28.21</p>
                     </c>
                     <c ca="center">
                        <p>28.16</p>
                     </c>
                     <c ca="center">
                        <p>28.08</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="13">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0.8</p>
                     </c>
                     <c ca="center">
                        <p>1.5</p>
                     </c>
                     <c ca="center">
                        <p>YN</p>
                     </c>
                     <c ca="center">
                        <p>25.86</p>
                     </c>
                     <c ca="center">
                        <p>27.05</p>
                     </c>
                     <c ca="center">
                        <p>27.96</p>
                     </c>
                     <c ca="center">
                        <p>28.62</p>
                     </c>
                     <c ca="center">
                        <p>29.20</p>
                     </c>
                     <c ca="center">
                        <p>29.65</p>
                     </c>
                     <c ca="center">
                        <p>30.02</p>
                     </c>
                     <c ca="center">
                        <p>30.33</p>
                     </c>
                     <c ca="center">
                        <p>30.61</p>
                     </c>
                     <c ca="center">
                        <p>30.86</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>MYN</p>
                     </c>
                     <c ca="center">
                        <p>28.93</p>
                     </c>
                     <c ca="center">
                        <p>28.79</p>
                     </c>
                     <c ca="center">
                        <p>28.69</p>
                     </c>
                     <c ca="center">
                        <p>28.52</p>
                     </c>
                     <c ca="center">
                        <p>28.48</p>
                     </c>
                     <c ca="center">
                        <p>28.37</p>
                     </c>
                     <c ca="center">
                        <p>28.27</p>
                     </c>
                     <c ca="center">
                        <p>28.17</p>
                     </c>
                     <c ca="center">
                        <p>28.11</p>
                     </c>
                     <c ca="center">
                        <p>28.05</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p><it>&#947;</it>-MYN</p>
                     </c>
                     <c ca="center">
                        <p>29.19</p>
                     </c>
                     <c ca="center">
                        <p>29.00</p>
                     </c>
                     <c ca="center">
                        <p>28.86</p>
                     </c>
                     <c ca="center">
                        <p>28.65</p>
                     </c>
                     <c ca="center">
                        <p>28.58</p>
                     </c>
                     <c ca="center">
                        <p>28.45</p>
                     </c>
                     <c ca="center">
                        <p>28.34</p>
                     </c>
                     <c ca="center">
                        <p>28.22</p>
                     </c>
                     <c ca="center">
                        <p>28.14</p>
                     </c>
                     <c ca="center">
                        <p>28.08</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="13">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0.9</p>
                     </c>
                     <c ca="center">
                        <p>1.5</p>
                     </c>
                     <c ca="center">
                        <p>YN</p>
                     </c>
                     <c ca="center">
                        <p>25.87</p>
                     </c>
                     <c ca="center">
                        <p>27.06</p>
                     </c>
                     <c ca="center">
                        <p>27.96</p>
                     </c>
                     <c ca="center">
                        <p>28.65</p>
                     </c>
                     <c ca="center">
                        <p>29.20</p>
                     </c>
                     <c ca="center">
                        <p>29.66</p>
                     </c>
                     <c ca="center">
                        <p>30.01</p>
                     </c>
                     <c ca="center">
                        <p>30.33</p>
                     </c>
                     <c ca="center">
                        <p>30.61</p>
                     </c>
                     <c ca="center">
                        <p>30.85</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>MYN</p>
                     </c>
                     <c ca="center">
                        <p>28.91</p>
                     </c>
                     <c ca="center">
                        <p>28.80</p>
                     </c>
                     <c ca="center">
                        <p>28.67</p>
                     </c>
                     <c ca="center">
                        <p>28.57</p>
                     </c>
                     <c ca="center">
                        <p>28.47</p>
                     </c>
                     <c ca="center">
                        <p>28.38</p>
                     </c>
                     <c ca="center">
                        <p>28.27</p>
                     </c>
                     <c ca="center">
                        <p>28.17</p>
                     </c>
                     <c ca="center">
                        <p>28.08</p>
                     </c>
                     <c ca="center">
                        <p>28.04</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p><it>&#947;</it>-MYN</p>
                     </c>
                     <c ca="center">
                        <p>29.18</p>
                     </c>
                     <c ca="center">
                        <p>29.01</p>
                     </c>
                     <c ca="center">
                        <p>28.84</p>
                     </c>
                     <c ca="center">
                        <p>28.69</p>
                     </c>
                     <c ca="center">
                        <p>28.58</p>
                     </c>
                     <c ca="center">
                        <p>28.46</p>
                     </c>
                     <c ca="center">
                        <p>28.34</p>
                     </c>
                     <c ca="center">
                        <p>28.22</p>
                     </c>
                     <c ca="center">
                        <p>28.12</p>
                     </c>
                     <c ca="center">
                        <p>28.07</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Effect of sequence lengths</p>
            </st>
            <p>The length of homologous genes subjected to an analysis usually varies in actual calculation. In order to evaluate the effect of variable sequence lengths, we use two groups of simulated rice sequences under the conditions of (1) &#969; = 0.2, &#954;<sub>R </sub>= 10, &#954;<sub>Y </sub>= 1, <it>t </it>= 0.6, and &#945; = 4; and (2) &#969; = 0.3, &#954;<sub>R </sub>= 10, &#954;<sub>Y </sub>= 1, <it>t </it>= 0.6, and &#945; = 4. We then calculate the average estimated &#969; when the number of codons varied from 100 to 1,000 (Table <tblr tid="T3">3</tblr>). It appears that all three methods overestimate &#969; regardless the number of codons in the datasets. In particular, despite the fact that all three methods give rise greater biases for shorter sequences (&lt;300 codons), <it>&#947;</it>-MYN performs better than the other two methods. We also found that the performance of <it>&#947;</it>-MYN is getting better faster than the other two methods as the number of codon increases.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Average &#969; estimates calculated based on YN, MYN and <it>&#947;</it>-MYN.</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c cspan="7" ca="center">
                        <p>
                           <b>Rice Codon Frequencies (&#945; = 4)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Number of codons</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>&#969; = 0.2</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>&#969; = 0.3</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>YN</p>
                     </c>
                     <c ca="center">
                        <p>MYN</p>
                     </c>
                     <c ca="center">
                        <p><it>&#947;</it>-MYN</p>
                     </c>
                     <c ca="center">
                        <p>YN</p>
                     </c>
                     <c ca="center">
                        <p>MYN</p>
                     </c>
                     <c ca="center">
                        <p><it>&#947;</it>-MYN</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p>0.308</p>
                     </c>
                     <c ca="center">
                        <p>0.245</p>
                     </c>
                     <c ca="center">
                        <p>0.235</p>
                     </c>
                     <c ca="center">
                        <p>0.458</p>
                     </c>
                     <c ca="center">
                        <p>0.364</p>
                     </c>
                     <c ca="center">
                        <p>0.352</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>200</p>
                     </c>
                     <c ca="center">
                        <p>0.305</p>
                     </c>
                     <c ca="center">
                        <p>0.230</p>
                     </c>
                     <c ca="center">
                        <p>0.222</p>
                     </c>
                     <c ca="center">
                        <p>0.450</p>
                     </c>
                     <c ca="center">
                        <p>0.341</p>
                     </c>
                     <c ca="center">
                        <p>0.332</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>300</p>
                     </c>
                     <c ca="center">
                        <p>0.294</p>
                     </c>
                     <c ca="center">
                        <p>0.219</p>
                     </c>
                     <c ca="center">
                        <p>0.210</p>
                     </c>
                     <c ca="center">
                        <p>0.435</p>
                     </c>
                     <c ca="center">
                        <p>0.325</p>
                     </c>
                     <c ca="center">
                        <p>0.316</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>400</p>
                     </c>
                     <c ca="center">
                        <p>0.290</p>
                     </c>
                     <c ca="center">
                        <p>0.215</p>
                     </c>
                     <c ca="center">
                        <p>0.207</p>
                     </c>
                     <c ca="center">
                        <p>0.426</p>
                     </c>
                     <c ca="center">
                        <p>0.317</p>
                     </c>
                     <c ca="center">
                        <p>0.308</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>500</p>
                     </c>
                     <c ca="center">
                        <p>0.294</p>
                     </c>
                     <c ca="center">
                        <p>0.216</p>
                     </c>
                     <c ca="center">
                        <p>0.208</p>
                     </c>
                     <c ca="center">
                        <p>0.430</p>
                     </c>
                     <c ca="center">
                        <p>0.317</p>
                     </c>
                     <c ca="center">
                        <p>0.308</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>600</p>
                     </c>
                     <c ca="center">
                        <p>0.291</p>
                     </c>
                     <c ca="center">
                        <p>0.214</p>
                     </c>
                     <c ca="center">
                        <p>0.206</p>
                     </c>
                     <c ca="center">
                        <p>0.427</p>
                     </c>
                     <c ca="center">
                        <p>0.316</p>
                     </c>
                     <c ca="center">
                        <p>0.307</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>700</p>
                     </c>
                     <c ca="center">
                        <p>0.290</p>
                     </c>
                     <c ca="center">
                        <p>0.213</p>
                     </c>
                     <c ca="center">
                        <p>0.205</p>
                     </c>
                     <c ca="center">
                        <p>0.424</p>
                     </c>
                     <c ca="center">
                        <p>0.313</p>
                     </c>
                     <c ca="center">
                        <p>0.305</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>800</p>
                     </c>
                     <c ca="center">
                        <p>0.290</p>
                     </c>
                     <c ca="center">
                        <p>0.212</p>
                     </c>
                     <c ca="center">
                        <p>0.205</p>
                     </c>
                     <c ca="center">
                        <p>0.424</p>
                     </c>
                     <c ca="center">
                        <p>0.313</p>
                     </c>
                     <c ca="center">
                        <p>0.305</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>900</p>
                     </c>
                     <c ca="center">
                        <p>0.288</p>
                     </c>
                     <c ca="center">
                        <p>0.212</p>
                     </c>
                     <c ca="center">
                        <p>0.204</p>
                     </c>
                     <c ca="center">
                        <p>0.422</p>
                     </c>
                     <c ca="center">
                        <p>0.312</p>
                     </c>
                     <c ca="center">
                        <p>0.303</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>1000</p>
                     </c>
                     <c ca="center">
                        <p>0.287</p>
                     </c>
                     <c ca="center">
                        <p>0.210</p>
                     </c>
                     <c ca="center">
                        <p>0.203</p>
                     </c>
                     <c ca="center">
                        <p>0.421</p>
                     </c>
                     <c ca="center">
                        <p>0.310</p>
                     </c>
                     <c ca="center">
                        <p>0.302</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Note: The parameters used are &#954;<sub>R </sub>= 10, &#954;<sub>Y </sub>= 1, <it>t </it>= 0.6, and &#945; = 4. &#969; = 0.2 and &#969; = 0.3 are used separately to represent purifying selection. &#969; values are averaged over 2,000 pairs of simulated sequences.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Testing real data</p>
            </st>
            <p>We used three ortholog datasets for the test, 14,323 from human-dog, 16,066 from human-mouse, and 12,351 from human-chimp. For a more comprehensive display, we examined the cumulative percentage of &#954;<sub>R</sub>-&#954;<sub>Y </sub>(Figure <figr fid="F6">6</figr>), showing different transitional substitutions with unequal frequencies. For example, the cumulative percentages for &#954;<sub>R </sub>- &#954;<sub>Y </sub>> 0.4 for human-dog, human-mouse and human-chimp orthologs are 52.27%, 52.66%, and 24.47% and those for &#954;<sub>R </sub>- &#954;<sub>Y </sub>&lt; -0.4 are 25.36%, 24.31%, and 21.87%, respectively. In the rest cases, for |&#954;<sub>R </sub>- &#954;<sub>Y</sub>| &#8804; 0.4, they are 22.37%, 23.02%, and 53.66% for the three ortholog groups. We found that the value for human-chimp is more than twice as much as that of human-dog (or human-mouse), and the reasons are attributable to a close evolutionary relationship between human and chimpanzee <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Cumulative percentage of &#954;<sub>R </sub>- &#954;<sub>Y </sub>for human-dog, human-mouse and human-chimp orthologs at a bin size of 0.2</p>
               </caption>
               <text>
                  <p><b>Cumulative percentage of &#954;<sub>R </sub>- &#954;<sub>Y</sub> for human-dog, human-mouse and human-chimp orthologs at a bin size of 0.2</b>. We divided the x-axis into 100 bins and plotted the cumulative percentage of &#954;<sub>R </sub>- &#954;<sub>Y </sub>from the orthologous genes of human-dog, human-mouse and human-chimp.</p>
               </text>
               <graphic file="1745-6150-4-20-6"/>
            </fig>
            <p>To evaluate the performance of <it>&#947;</it>-MYN, we compared a set of values for several key parameters (S%, Ka, Ks, and &#969;) generated with <it>&#947;</it>-MYN and three other selected methods in a straightforward way, considering three cases of &#954;<sub>R </sub>- &#954;<sub>Y </sub>> 0.4, &#954;<sub>R </sub>- &#954;<sub>Y </sub>&lt; -0.4, and |&#954;<sub>R </sub>- &#954;<sub>Y</sub>| &#8804; 0.4 (Table <tblr tid="T4">4</tblr>). We chose the value of 0.4 as a threshold so that the three cases can stand for three groups of &#954;<sub>R </sub>under the condition of &#954;<sub>Y </sub>= 3.75: (1) &#954;<sub>R </sub>= 5, 6, 7, 8, 9 and 10; (2) &#954;<sub>R </sub>= 1, 2, and 3; (3) &#954;<sub>R </sub>= 4. Other than YN and MYN, we also used a maximum likelihood method proposed by Goldman and Yang (denoted as GY) <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>.</p>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Proportions of synonymous sites (S%) and estimates of Ka, Ks and &#969;</p>
               </caption>
               <tblbdy cols="13">
                  <r>
                     <c ca="center">
                        <p>Method</p>
                     </c>
                     <c cspan="4" ca="center">
                        <p>&#954;<sub>R </sub>- &#954;<sub>Y </sub>> 0.4</p>
                     </c>
                     <c cspan="4" ca="center">
                        <p>&#954;<sub>R </sub>- &#954;<sub>Y </sub>&lt; -0.4</p>
                     </c>
                     <c cspan="4" ca="center">
                        <p>|&#954;<sub>R </sub>- &#954;<sub>Y</sub>| &#8804; 0.4</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="12">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>S%</p>
                     </c>
                     <c ca="center">
                        <p>Ka</p>
                     </c>
                     <c ca="center">
                        <p>Ks</p>
                     </c>
                     <c ca="center">
                        <p>&#969;</p>
                     </c>
                     <c ca="center">
                        <p>S%</p>
                     </c>
                     <c ca="center">
                        <p>Ka</p>
                     </c>
                     <c ca="center">
                        <p>Ks</p>
                     </c>
                     <c ca="center">
                        <p>&#969;</p>
                     </c>
                     <c ca="center">
                        <p>S%</p>
                     </c>
                     <c ca="center">
                        <p>Ka</p>
                     </c>
                     <c ca="center">
                        <p>Ks</p>
                     </c>
                     <c ca="center">
                        <p>&#969;</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="13">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="13" ca="center">
                        <p>human-dog orthologs</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>GY</p>
                     </c>
                     <c ca="center">
                        <p>24.80%</p>
                     </c>
                     <c ca="center">
                        <p>0.0691</p>
                     </c>
                     <c ca="center">
                        <p>0.4936</p>
                     </c>
                     <c ca="center">
                        <p>0.1483</p>
                     </c>
                     <c ca="center">
                        <p>24.72%</p>
                     </c>
                     <c ca="center">
                        <p>0.0723</p>
                     </c>
                     <c ca="center">
                        <p>0.4676</p>
                     </c>
                     <c ca="center">
                        <p>0.1639</p>
                     </c>
                     <c ca="center">
                        <p>24.34%</p>
                     </c>
                     <c ca="center">
                        <p>0.0874</p>
                     </c>
                     <c ca="center">
                        <p>0.5347</p>
                     </c>
                     <c ca="center">
                        <p>0.1704</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>YN</p>
                     </c>
                     <c ca="center">
                        <p>25.09%</p>
                     </c>
                     <c ca="center">
                        <p>0.0674</p>
                     </c>
                     <c ca="center">
                        <p>0.4900</p>
                     </c>
                     <c ca="center">
                        <p>0.1531</p>
                     </c>
                     <c ca="center">
                        <p>24.96%</p>
                     </c>
                     <c ca="center">
                        <p>0.0700</p>
                     </c>
                     <c ca="center">
                        <p>0.4750</p>
                     </c>
                     <c ca="center">
                        <p>0.1695</p>
                     </c>
                     <c ca="center">
                        <p>24.27%</p>
                     </c>
                     <c ca="center">
                        <p>0.0928</p>
                     </c>
                     <c ca="center">
                        <p>0.5412</p>
                     </c>
                     <c ca="center">
                        <p>0.1746</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>MYN</p>
                     </c>
                     <c ca="center">
                        <p>23.88%</p>
                     </c>
                     <c ca="center">
                        <p>0.0664</p>
                     </c>
                     <c ca="center">
                        <p>0.5748</p>
                     </c>
                     <c ca="center">
                        <p>0.1361</p>
                     </c>
                     <c ca="center">
                        <p>26.16%</p>
                     </c>
                     <c ca="center">
                        <p>0.0711</p>
                     </c>
                     <c ca="center">
                        <p>0.4495</p>
                     </c>
                     <c ca="center">
                        <p>0.1800</p>
                     </c>
                     <c ca="center">
                        <p>24.22%</p>
                     </c>
                     <c ca="center">
                        <p>0.0939</p>
                     </c>
                     <c ca="center">
                        <p>0.5596</p>
                     </c>
                     <c ca="center">
                        <p>0.1727</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>&#947;-MYN</p>
                     </c>
                     <c ca="center">
                        <p>23.93%</p>
                     </c>
                     <c ca="center">
                        <p>0.0681</p>
                     </c>
                     <c ca="center">
                        <p>0.6462</p>
                     </c>
                     <c ca="center">
                        <p>0.1266</p>
                     </c>
                     <c ca="center">
                        <p>26.28%</p>
                     </c>
                     <c ca="center">
                        <p>0.0733</p>
                     </c>
                     <c ca="center">
                        <p>0.4962</p>
                     </c>
                     <c ca="center">
                        <p>0.1713</p>
                     </c>
                     <c ca="center">
                        <p>24.29%</p>
                     </c>
                     <c ca="center">
                        <p>0.0962</p>
                     </c>
                     <c ca="center">
                        <p>0.6227</p>
                     </c>
                     <c ca="center">
                        <p>0.1620</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="13" ca="center">
                        <p>human-mouse orthologs</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>GY</p>
                     </c>
                     <c ca="center">
                        <p>25.86%</p>
                     </c>
                     <c ca="center">
                        <p>0.0901</p>
                     </c>
                     <c ca="center">
                        <p>0.7163</p>
                     </c>
                     <c ca="center">
                        <p>0.1291</p>
                     </c>
                     <c ca="center">
                        <p>25.65%</p>
                     </c>
                     <c ca="center">
                        <p>0.0961</p>
                     </c>
                     <c ca="center">
                        <p>0.7118</p>
                     </c>
                     <c ca="center">
                        <p>0.1390</p>
                     </c>
                     <c ca="center">
                        <p>25.52%</p>
                     </c>
                     <c ca="center">
                        <p>0.1128</p>
                     </c>
                     <c ca="center">
                        <p>0.7422</p>
                     </c>
                     <c ca="center">
                        <p>0.1543</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>YN</p>
                     </c>
                     <c ca="center">
                        <p>26.07%</p>
                     </c>
                     <c ca="center">
                        <p>0.0877</p>
                     </c>
                     <c ca="center">
                        <p>0.7002</p>
                     </c>
                     <c ca="center">
                        <p>0.1344</p>
                     </c>
                     <c ca="center">
                        <p>25.69%</p>
                     </c>
                     <c ca="center">
                        <p>0.0923</p>
                     </c>
                     <c ca="center">
                        <p>0.6904</p>
                     </c>
                     <c ca="center">
                        <p>0.1465</p>
                     </c>
                     <c ca="center">
                        <p>25.31%</p>
                     </c>
                     <c ca="center">
                        <p>0.1091</p>
                     </c>
                     <c ca="center">
                        <p>0.7439</p>
                     </c>
                     <c ca="center">
                        <p>0.1564</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>MYN</p>
                     </c>
                     <c ca="center">
                        <p>24.83%</p>
                     </c>
                     <c ca="center">
                        <p>0.0863</p>
                     </c>
                     <c ca="center">
                        <p>0.8321</p>
                     </c>
                     <c ca="center">
                        <p>0.1157</p>
                     </c>
                     <c ca="center">
                        <p>26.93%</p>
                     </c>
                     <c ca="center">
                        <p>0.0940</p>
                     </c>
                     <c ca="center">
                        <p>0.6527</p>
                     </c>
                     <c ca="center">
                        <p>0.1566</p>
                     </c>
                     <c ca="center">
                        <p>25.24%</p>
                     </c>
                     <c ca="center">
                        <p>0.1090</p>
                     </c>
                     <c ca="center">
                        <p>0.7734</p>
                     </c>
                     <c ca="center">
                        <p>0.1526</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>&#947;-MYN</p>
                     </c>
                     <c ca="center">
                        <p>24.91%</p>
                     </c>
                     <c ca="center">
                        <p>0.0894</p>
                     </c>
                     <c ca="center">
                        <p>0.9501</p>
                     </c>
                     <c ca="center">
                        <p>0.1058</p>
                     </c>
                     <c ca="center">
                        <p>27.10%</p>
                     </c>
                     <c ca="center">
                        <p>0.0980</p>
                     </c>
                     <c ca="center">
                        <p>0.7390</p>
                     </c>
                     <c ca="center">
                        <p>0.1468</p>
                     </c>
                     <c ca="center">
                        <p>25.35%</p>
                     </c>
                     <c ca="center">
                        <p>0.1140</p>
                     </c>
                     <c ca="center">
                        <p>0.8934</p>
                     </c>
                     <c ca="center">
                        <p>0.1398</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="13" ca="center">
                        <p>human-chimp orthologs</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>GY</p>
                     </c>
                     <c ca="center">
                        <p>25.47%</p>
                     </c>
                     <c ca="center">
                        <p>0.0273</p>
                     </c>
                     <c ca="center">
                        <p>0.0663</p>
                     </c>
                     <c ca="center">
                        <p>0.5118</p>
                     </c>
                     <c ca="center">
                        <p>25.18%</p>
                     </c>
                     <c ca="center">
                        <p>0.0262</p>
                     </c>
                     <c ca="center">
                        <p>0.0579</p>
                     </c>
                     <c ca="center">
                        <p>0.5685</p>
                     </c>
                     <c ca="center">
                        <p>25.98%</p>
                     </c>
                     <c ca="center">
                        <p>0.0228</p>
                     </c>
                     <c ca="center">
                        <p>0.0420</p>
                     </c>
                     <c ca="center">
                        <p>0.4364</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>YN</p>
                     </c>
                     <c ca="center">
                        <p>25.79%</p>
                     </c>
                     <c ca="center">
                        <p>0.0302</p>
                     </c>
                     <c ca="center">
                        <p>0.0646</p>
                     </c>
                     <c ca="center">
                        <p>0.5237</p>
                     </c>
                     <c ca="center">
                        <p>25.33%</p>
                     </c>
                     <c ca="center">
                        <p>0.0297</p>
                     </c>
                     <c ca="center">
                        <p>0.0564</p>
                     </c>
                     <c ca="center">
                        <p>0.5790</p>
                     </c>
                     <c ca="center">
                        <p>23.81%</p>
                     </c>
                     <c ca="center">
                        <p>0.0506</p>
                     </c>
                     <c ca="center">
                        <p>0.0595</p>
                     </c>
                     <c ca="center">
                        <p>0.3893</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>MYN</p>
                     </c>
                     <c ca="center">
                        <p>24.34%</p>
                     </c>
                     <c ca="center">
                        <p>0.0299</p>
                     </c>
                     <c ca="center">
                        <p>0.0719</p>
                     </c>
                     <c ca="center">
                        <p>0.4792</p>
                     </c>
                     <c ca="center">
                        <p>26.87%</p>
                     </c>
                     <c ca="center">
                        <p>0.0307</p>
                     </c>
                     <c ca="center">
                        <p>0.0516</p>
                     </c>
                     <c ca="center">
                        <p>0.6312</p>
                     </c>
                     <c ca="center">
                        <p>23.72%</p>
                     </c>
                     <c ca="center">
                        <p>0.0493</p>
                     </c>
                     <c ca="center">
                        <p>0.0601</p>
                     </c>
                     <c ca="center">
                        <p>0.3863</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>&#947;-MYN</p>
                     </c>
                     <c ca="center">
                        <p>24.34%</p>
                     </c>
                     <c ca="center">
                        <p>0.0305</p>
                     </c>
                     <c ca="center">
                        <p>0.0741</p>
                     </c>
                     <c ca="center">
                        <p>0.4768</p>
                     </c>
                     <c ca="center">
                        <p>26.89%</p>
                     </c>
                     <c ca="center">
                        <p>0.0324</p>
                     </c>
                     <c ca="center">
                        <p>0.0530</p>
                     </c>
                     <c ca="center">
                        <p>0.6309</p>
                     </c>
                     <c ca="center">
                        <p>23.72%</p>
                     </c>
                     <c ca="center">
                        <p>0.0515</p>
                     </c>
                     <c ca="center">
                        <p>0.0626</p>
                     </c>
                     <c ca="center">
                        <p>0.3847</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>The results showed a few interesting trends. First, GY performs in a similar way as YN does as compared to MYN and <it>&#947;</it>-MYN; it is consistent with our previous simulation results, as they share a common consideration of transition/transversion rate bias and nucleotide frequencies bias <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B42">42</abbr></abbrgrp>. Second, the trends of &#969; estimates with the three methods, YN, MYN and <it>&#947;</it>-MYN, are consistent with our simulation results. In the cases of |&#954;<sub>R </sub>- &#954;<sub>Y</sub>| &#8804; 0.4 and &#954;<sub>R </sub>- &#954;<sub>Y </sub>> 0.4, when &#954;<sub>R </sub>= 4, 5, 6, 7, 8, 9, and 10, the order of estimated values of &#969; is: <it>&#947;</it>-MYN &lt; MYN &lt; YN. When confined to &#954;<sub>R </sub>- &#954;<sub>Y </sub>&lt; -0.4, when &#954;<sub>R </sub>= 1, 2, and 3, YN underestimates &#969; and MYN overestimates &#969; as compared to <it>&#947;</it>-MYN. Taking &#969; estimates as an example, they are 0.1695, 0.1800, and 0.1713 for human-dog orthologs, 0.1465, 0.1566, and 0.1468 for human-mouse orthologs, and 0.5790, 0.6312 and 0.6309 for human-chimp orthologs, calculated with YN, MYN, and <it>&#947;</it>-MYN, respectively. These findings are in agreement with our simulation studies. Third, the orders of S% estimates with the three methods (YN, MYN and <it>&#947;</it>-MYN) are also consistent with our simulation results. For example, when &#954;<sub>R </sub>- &#954;<sub>Y </sub>> 0.4, &#954;<sub>R </sub>= 5, 6, 7, 8, 9 and 10, YN overestimates S% and MYN underestimates S% as compared to <it>&#947;</it>-MYN. In the case of &#954;<sub>R </sub>- &#954;<sub>Y </sub>&lt; -0.4, when &#954;<sub>R </sub>= 1, 2, and 3, the order of estimated values of S% is: YN &lt; MYN &lt;<it>&#947;</it>-MYN. Fourth, we took one gene as an example to show the outperformance of our new method over others. Among the human-chimp orthologs, &#969; values of an immunoglobulin interleukin-1-related receptor (NP_068577) are listed as 1.02406, 0.622611, 0.59999, and 0.843755, when <it>&#947;</it>-MYN, MYN, YN, and GY are used for the calculation, respectively. Obviously, only <it>&#947;</it>-MYN is able to detect positive selection for this gene, and others failed. This gene has been studied previously based on a population genetics analysis of extended-haplotype-homozygosity in Northeast Asians, and a possible positive selection scheme was proposed for it <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. This result is in accordance with the result of large-scale scanning on positively selected genes between human and chimpanzee genomes <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Program availability and performance</p>
            </st>
            <p>A C++ program implementing <it>&#947;</it>-MYN method is included in the updated KaKs_Calculator <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, available upon request. And we tested the running time with YN, MYN, <it>&#947;</it>-MYN, and GY, using the three testing datasets (14,725 human-dog, 16,368 human-mouse, and 15,646 human-chimp gene pairs). Table <tblr tid="T5">5</tblr> shows the time consumption for each method to compute Ka/Ks ratios from the three datasets and their average running time. On average, <it>&#947;</it>-MYN takes 600 folds less time than GY does, and YN, MYN, and <it>&#947;</it>-MYN perform similarly in time consumption. We believe that <it>&#947;</it>-MYN may become a useful tool for large-scale studies, when ML-based methods (such as GY) are deemed time-consuming.</p>
            <tbl id="T5">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>Timing comparisons on YN, MYN, &#947;-MYN and GY methods</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="center">
                        <p>Method</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Time Required, seconds (hr:min:sec)</p>
                     </c>
                     <c ca="center">
                        <p>Average</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>human-dog</p>
                     </c>
                     <c ca="center">
                        <p>human-mouse</p>
                     </c>
                     <c ca="center">
                        <p>human-chimp</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>YN</p>
                     </c>
                     <c ca="center">
                        <p>332(0:5:32)</p>
                     </c>
                     <c ca="center">
                        <p>389(0:6:29)</p>
                     </c>
                     <c ca="center">
                        <p>280(0:4:40)</p>
                     </c>
                     <c ca="center">
                        <p>334(0:5:34)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>MYN</p>
                     </c>
                     <c ca="center">
                        <p>529(0:8:49)</p>
                     </c>
                     <c ca="center">
                        <p>641(0:10:41)</p>
                     </c>
                     <c ca="center">
                        <p>396(0:6:36)</p>
                     </c>
                     <c ca="center">
                        <p>522(0:8:42)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>&#947;-MYN</p>
                     </c>
                     <c ca="center">
                        <p>533(0:8:53)</p>
                     </c>
                     <c ca="center">
                        <p>639(0:10:39)</p>
                     </c>
                     <c ca="center">
                        <p>395(0:6:35)</p>
                     </c>
                     <c ca="center">
                        <p>522(0:8:42)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>GY</p>
                     </c>
                     <c ca="center">
                        <p>154309(42:51:49)</p>
                     </c>
                     <c ca="center">
                        <p>233899(64:58:19)</p>
                     </c>
                     <c ca="center">
                        <p>602381(167:19:41)</p>
                     </c>
                     <c ca="center">
                        <p>330196(91:43:16)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The analyses were performed on IBM HS21, INTEL 5335 2.0GHz, memory of 16GB, ROCKS LINUX 4.3 X86-64 platform.</p>
               </tblfn>
            </tbl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <sec>
            <st>
               <p>Why should we continue developing Ka/Ks methods?</p>
            </st>
            <p>A major limitation of Ka/Ks methods, mentioned in literatures, is their poor ability for detecting positive selection (adaptive selection) <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp>. To detect positive selection at sites requires that &#969; value averaged over all branches is >1 and to detect positive selection along lineages requires &#969; value averaged over all sites is >1 <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Therefore, Ka/Ks methods are only useful to weight average selection pressure over sites and branches. They may not be able to detect positive selection for some highly conserved proteins that are mostly invariable but become fragile when a single site alters. Other detrimental cases include transmembrane domains where high variability may not change its physiochemical property. To overcome the weakness, there have been methods developed, such as Likelihood Ratio Test (LRT) <abbrgrp><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp> implemented in PAML <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> and Hyphy software <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>, to identify positive selection <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>, which tend to be qualitative. An obvious pitfall of these methods is that they do not weigh the relative degree of two genes under negative selection. Ka/Ks methods can perform better in this regard <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B62">62</abbr><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp> as they tend to be more quantative. In addition, Ka/Ks methods can be readily extended from our current work to detect the sequence alternations that lead to protein structure changes and positive selection, in combination with other techniques, such as ancestral sequence reconstruction <abbrgrp><abbr bid="B65">65</abbr><abbr bid="B66">66</abbr><abbr bid="B67">67</abbr></abbrgrp> and primary <abbrgrp><abbr bid="B68">68</abbr><abbr bid="B69">69</abbr></abbrgrp> or tertiary windowing <abbrgrp><abbr bid="B70">70</abbr><abbr bid="B71">71</abbr></abbrgrp>. Therefore, we believe that the two different lines of methods (LRT-like methods and Ka/Ks methods) should also be useful under appropriate conditions.</p>
         </sec>
         <sec>
            <st>
               <p>Why should we introduce new parameters?</p>
            </st>
            <p>With the introduction of the parameter <it>&#945;</it>, our method <it>&#947;</it>-MYN shows significant improvements when compared with the other two related methods in both simulation and tests on real data. As <it>&#947;</it>-MYN assumes that evolutionary rate at each site follows <it>&#947;</it>-distribution, we found that the parameter &#945; has observable effects under different evolution rates. For instance, when &#969; >= 1, <it>&#947;</it>-MYN remains stable. We also observed that selective pressure can overwhelm the variable substitution rates across sites and it becomes the most influential factor when increasing dramatically. Therefore, when we consider strong positive selection and neutral evolution, the effect of variable substitution rates across sites can be somewhat neglected. In addition, more parameters often lead to increase of complexity of an algorithm, resulting in the decrease of efficiency. However, we hold the view that proper introduction of parameters is worthwhile.</p>
            <p>It has been noticed that the majority of the evolutionary selections are actually negative in nature, and the statement is confirmed by our analyses on real data. When &#969; varies from 0.1 to 1, we selected optimal <it>&#945; </it>to minimize biases and found that <it>&#947;</it>-MYN is very sensitive when <it>&#945; </it>changes under negative selection. Furthermore, the optimal &#945; becomes smaller when &#969; becomes larger under negative selection, and the effects of various substitution rates across sites become evident under negative selection, emphasizing the importance of our method in the calculation under negative selection.</p>
            <p>As to &#947;-Tamura-Nei model, it usually leads to higher variations, especially in phylogeny analyses. However, we found some multi-level observations in both simulation and real data testing (see in additional file <supplr sid="S1">1</supplr>). The difference of variations of &#969; between YN and &#947;-MYN (or MYN) seems to be correlated to the value of &#954;<sub>R </sub>- &#954;<sub>Y</sub>, when results in Tables S2-S6 were examined together. For instance, when &#954;<sub>R</sub>-&#954;<sub>Y </sub>&lt; 0, &#969; values vary more when &#947;-MYN is used than YN is. As &#954;<sub>R</sub>-&#954;<sub>Y </sub>increases, the &#969; variation values from &#947;-MYN decrease, leading to lower numbers than those of YN. The variations of &#969; calculated with <it>&#947;</it>-MYN are slight higher than those yielded from MYN in most cases but not all. These results reflect the distinction between the usage of &#947;-Tamura-Nei model (or Tamura-Nei model) in KaKs computation and that of phylogeny reconstruction.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>Standard deviations of &#969; and supplementary method.</b> Additional file 1 contains: (I) the standard deviations of &#969; values that are used in Figure <figr fid="F1">1</figr>, Figure <figr fid="F2">2</figr>, Figure <figr fid="F4">4</figr>, Figure <figr fid="F5">5</figr> and Table <tblr tid="T4">4</tblr>; (II) human codon and rice codon frequencies; and (III) an approach of determining the optimal &#945; values in each test.</p>
               </text>
               <file name="1745-6150-4-20-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>How the variable substitution rates influence the Ka/Ks calculations?</p>
            </st>
            <p>We begin our discussion with how <it>&#945; </it>parameter in <it>&#947;</it>-MYN improves MYN method. It is known that ignoring rate variation among sites leads to underestimation of both the sequence distance and the transition/transversion rate ratio &#954; (both &#954;<sub>R </sub>and &#954;<sub>Y</sub>) <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. &#954; is used not only in estimating S and N but also in generating a transition probability matrix for estimating S<sub>d </sub>and N<sub>d</sub>. If we derive an approximate formula for &#969; = Ka/Ks &#8776; (N<sub>d</sub>/N)/(S<sub>d</sub>/S) (the symbol of "&#8776;" is used to emphasize the absence of correction for multiple substitutions), &#969; is composed of two parts: N<sub>d</sub>/N and S<sub>d</sub>/S. For purifying selection, synonymous substitutions occur more frequently than nonsynonymous ones so we should only focus on Ks (S<sub>d</sub>/S). Since &#954; decrease is related to the reduction of substitution rate between two codons, underestimation of &#954; leads to underestimation of S<sub>d</sub>. In addition, nucleotide transitions between two codons are more likely to be synonymous especially at the third codon positions, underestimation of &#954; leads to underestimation of S. However, the influence of &#954; on S<sub>d </sub>is significantly stronger than that on S. As a consequence, an underestimated &#954;, when used in MYN, may give rise to underestimation of S<sub>d</sub>/S, resulting in overestimation of &#969; as compared with <it>&#947;</it>-MYN. Our theoretical deductions are consistent with both simulation (Figure <figr fid="F2">2</figr>) and real data (Table <tblr tid="T4">4</tblr>).</p>
            <p>In addition, we found the optimal <it>&#945; </it>values fall between 1 and 5 (Figure <figr fid="F3">3</figr>). In these cases, the distribution of gamma values is bell-shaped, meaning that most sites have intermediate rates around 1 whereas a few sites have either very low or very high rates <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. When selection pressure increases, the number of sites of intermediate rates decreases (Figure <figr fid="F3">3</figr>). In particular, when &#945; approaches infinity, the distribution diminishes into the model of a single rate for all sites, which is used in MYN method. If <it>&#945; </it>&#8804; 1, the distribution has a highly skewed L-shape, suggesting that most sites have either very low rates of substitution or are nearly "invariable" with possible substitution hotspots. Furthermore, estimates of <it>&#945; </it>from real data in many species over multiple sequences show increases from 0.26 to 3.0, and this relatively wide window allows us to explore the spectrum of different substitution rates over different sites <abbrgrp><abbr bid="B4">4</abbr></abbrgrp></p>
         </sec>
         <sec>
            <st>
               <p>Applications of our new method</p>
            </st>
            <p>Divergence time (<it>t</it>) is another parameter important for the estimation of Ka and Ks. When divergence time reaches the extremes, the compared sequences among genes often vary considerably and their corresponding protein structures may changed over greater evolutionary time scale. Therefore, under such conditions it may become meaningless to calculate the substitution rates of such genes. However, most methods for calculating Ka and Ks use homologous genes for estimating substitution rates among closely-related species or within close lineages, and observable selections are mostly negative. Since our method <it>&#947;</it>-MYN has better performance than other methods when &#969; &lt; 1, it provides a useful alternative for more comprehensive Ka and Ks calculations.</p>
            <p>Our past work has testified that methods for estimating Ka and Ks should be used cautiously and one should not draw simple conclusions on gene evolution from Ka and Ks analyses based on a single method. Therefore, we recommend a method based on model selection and model averaging <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B42">42</abbr></abbrgrp>, and <it>&#947;</it>-MYN has just brought a new choice into such endeavours. Our method does not challenge other methods such as GY (Goldman-Yang) method, a typical maximum likelihood (ML) that has been always considered to be the method of choice <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B33">33</abbr></abbrgrp>. It has been suggested in the literature that GY and YN both give rise to similar estimates on Ka and Ks primarily due to the fact that they both take account of major dynamic features of DNA sequence evolution, including transition/transversion rate and nucleotide/codon frequency biases <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B42">42</abbr></abbrgrp>. As <it>&#947;</it>-MYN performs better than other methods under certain conditions, despite the fact that its advantages seemed less obvious under other conditions, we believe that <it>&#947;</it>-MYN may become a useful tool for large-scale sequence analysis when ML-based methods are deemed time-consuming.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We compared <it>&#947;</it>-MYN with two other methods, YN and MYN, by examining long sequences, performing computer simulations, and analyzing real datasets. Since neglecting the variation of substitution rates among different sites may lead to biased estimates, our new method has minimal deviations when parameters vary within normal ranges defined by empirical data. <it>&#947;</it>-MYN performs better when genes are under strong purifying selection and comparable to the other two methods when genes are under positive selection or remain neutral. In addition, we showed that biased estimates of Ka and Ks primarily originate not only from biased estimates of &#954;&#8211;or both &#954;<sub>R </sub>and &#954;<sub>Y</sub>&#8211;but also from the neglect of variable substitution rates.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Mutation model</p>
            </st>
            <p>In Markov-chain models of codon substitution, the codon triplet is considered as the unit of evolution, and a Markov chain is used to describe substitutions from one codon to another codon <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. In detail, the state space of the chain is the sense codons with regard to the canonical genetic code. Stop codons are not allowed inside a functional protein and are not considered. Although there are several mutation (substitution) models that take different sequence variation features into account, in this report we limit our discussions to the Tamura-Nei Models <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> (see Table S1 in additional file <supplr sid="S2">2</supplr> for details).</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p><b><b>Description of the Tamura-Nei model and the detailed derivations of &#954;<sub>R </sub>and &#954;<sub>Y</sub></b>.</b> Additional file 2 contains supplementary tables and derivation process in this study. This file contains two sections: section I shows the description of the Tamura-Nei model and section II details the procedures for deducing &#954;<sub>R </sub>and &#954;<sub>Y</sub>.</p>
               </text>
               <file name="1745-6150-4-20-S2.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p><it>&#947;</it>-MYN also needs a transition probability matrix similar to YN and MYN. We assigned the substitution rate q<sub>ij </sub>from any codon i to j (i &#8800; j) to generate a transition probability matrix as follows:</p>
            <p>
               <display-formula id="M1">
                  <graphic file="1745-6150-4-20-i1.gif"/>
               </display-formula>
            </p>
            <p>The diagonal elements of the transition probability matrix, <it>Q </it>= {<it>q</it><sub><it>ij</it></sub>}, are determined based on the mathematical requirement that the row sums equal to zero. The matrix is normalized with the result that the sum over non-diagonal terms is 1.</p>
         </sec>
         <sec>
            <st>
               <p>Estimating &#954;<sub>R </sub>and &#954;<sub>Y</sub></p>
            </st>
            <p>To generate the transition probability matrix, we need to estimate &#954;<sub>R </sub>and &#954;<sub>Y</sub>. Similar to YN and MYN, we calculated four nucleotide frequencies (g<sub>T</sub>, g<sub>C</sub>, g<sub>A</sub>, g<sub>G</sub>), proportions of transitional differences between purines (T<sub>R</sub>), and between pyrimidines (T<sub>Y</sub>), and the proportion of transversional differences (V) from compared sequences:</p>
            <p>
               <display-formula id="M2">
                  <graphic file="1745-6150-4-20-i2.gif"/>
               </display-formula>
            </p>
            <p>where g<sub>R </sub>= g<sub>A </sub>+ g<sub>G </sub>and g<sub>Y </sub>= g<sub>T </sub>+ g<sub>C</sub>. Note that &#945; is the square of the inverse of the variation coefficient in the gamma function.</p>
            <p>We then used equation 3 to estimate &#954;<sub>R </sub>and &#954;<sub>Y</sub>.</p>
            <p>
               <display-formula id="M3">
                  <graphic file="1745-6150-4-20-i3.gif"/>
               </display-formula>
            </p>
            <p>
               <display-formula id="M4">
                  <graphic file="1745-6150-4-20-i4.gif"/>
               </display-formula>
            </p>
            <p>The detailed procedures for deducing &#954;<sub>R </sub>and &#954;<sub>Y </sub>were summarized in additional file <supplr sid="S2">2</supplr>. We also made other modifications accordingly, such as using &#954;<sub>R </sub>and &#954;<sub>Y </sub>to estimate S and N, generating relevant transition probability matrix (Equation 1), considering different transitional evolution pathways to count S<sub>d </sub>and N<sub>d</sub>, and correcting for multiple substitutions when estimating Ka and Ks (Equation 4; <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>).</p>
         </sec>
         <sec>
            <st>
               <p>The algorithm</p>
            </st>
            <p>When compared to MYN and YN, our <it>&#947;</it>-MYN method considers that the rate of nucleotide substitution <it>&#955; </it>approximately follows <it>&#947;</it>-distribution. In fact, if the rate of nucleotide substitution <it>&#955; </it>is the same for all sites considered, the model becomes the model that is used in MYN.</p>
            <p><it>&#947;</it>-MYN uses an iterative approach to estimate Ka and Ks. Before iteration, <it>&#947;</it>-MYN computes nucleotide frequencies (regarding to the three codon positions), &#954;<sub>R</sub>, &#954;<sub>Y</sub>, S, and N from the sequences to be analyzed. Based on the F3 &#215; 4 model <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, codon frequencies are calculated by multiplying each nucleotide frequencies. &#954;<sub>R </sub>and &#954;<sub>Y </sub>are estimated from four-fold degenerate sites at the third codon position and non-degenerate sites. S and N are calculated by using &#954;<sub>R</sub>, &#954;<sub>Y</sub>, and codon frequencies. <it>&#947;</it>-MYN chooses initial values for <it>t </it>and &#969; as starting point for iteration. It generates a transition probability matrix that represents substitution probabilities from one codon to another by using &#969;, <it>t</it>, &#954;, and codon frequencies. This transition probability matrix is subsequently used to deduce S<sub>d </sub>and N<sub>d </sub>and for new estimates of &#969; and <it>t</it>. <it>&#947;</it>-MYN repeats the calculation for another transition probability matrix, until the algorithm converges.</p>
         </sec>
         <sec>
            <st>
               <p>Comparative analysis on Ka and Ks estimations</p>
            </st>
            <p>We used simulated sequences generated from hypothetical common ancestral sequences for our comparative analysis by randomly choosing codons (61 excluding stop codons) from the ancestral sequences according to codon frequencies that were derived from three empirical datasets: (1) equal codon frequencies <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, (2) human codon frequencies deduced from 39,420 human protein-coding genes from ENSEMBL database (Release 35; <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>) and (3) rice codon frequencies deduced from 19,079 rice protein-coding genes <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> (see in additional file <supplr sid="S1">1</supplr>).</p>
            <p>In addition to codon frequencies, we also have to fix or choose ranges of other parameters for the simulation, including sequence length, divergence time (<it>t</it>), two ratios of transitional rate between purines (&#954;<sub>R</sub>) and between pyrimidines (&#954;<sub>Y</sub>) to transversional rate, and selective pressure &#969;. Although &#969; varies from gene to gene, &#969; = 1 and 3 can be regarded as "typical values" for neutral mutation and positive selection, respectively <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B13">13</abbr><abbr bid="B67">67</abbr></abbrgrp>, which are observable from real datasets. Since most calculated &#969; values indicate negative selection and variation of parameter &#945; has stronger influence under negative selection, we analyzed the variation of &#969; in a range of 0.1 to 0.9 for the evaluation of effects of &#945; on &#969;. To accurately examine the effect of one parameter and to avoid stochastic errors arising from other factors, we generated 2,000 pairs of sequences. Three orthologous gene sets were downloaded from NCBI's HomoloGene database (Build 61), which contained 14,725 human-dog, 16,368 human-mouse, and 15,646 human-chimp gene pairs <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>. We considered "NA" occurrence (in any of Ka, Ks or &#969;) as unreliable data and filtered the orthologous pairs (extremes in sequence homology) that have such labels, and 14,323 human-dog, 16,066 human-mouse, and 12,351 human-chimp gene pairs were remained. The datasets were used for comparing <it>&#947;</it>-MYN with other methods.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The authors declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>DPW deduced formulas and analyzed the data. DPW, HLW and SZ drafted the manuscript. SZ programmed this new method. DPW, HLW and SZ carried out computer simulations. JY supervised the research and revised the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Reviewers' comments</p>
         </st>
         <sec>
            <st>
               <p>Reviewer 1</p>
            </st>
            <p>Kateryna Makova (assisted by Mr. Chungoo Park), Center for Comparative Genomics and Bioinformatics, Department of Biology, 305 Wartik Lab, Penn State University, University Park, PA, 16802</p>
            <p>In this paper, the authors suggest a new algorithm, called &#947;-MYN, to accurately estimate nonsynonymous and synonymous substitution rates (Ka and Ks) of protein-coding DNA.</p>
            <p>The &#947;-MYN considers the variation of substitution rates among different sites in a sequence, which is overlooked by existing methods, and shows that their unequal substitution rates affect Ka and Ks.</p>
            <sec>
               <st>
                  <p>Specific comments</p>
               </st>
               <p>1. Authors should highlight standard deviations of &#969; (as well as Ka and Ks) in all tests and show significance of difference in all comparisons.</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>Agreed. We added the standard deviations for the calculations of &#969;, Ka, Ks and S% in key analyses in the supplementary materials</it>.</p>
               <p>2. To represent distinct codon usages by two genomes (human and rice), authors should reveal their codon usage difference in the paper.</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>Agreed. We added the tables for human codon frequencies and rice codon frequencies to the supplementary materials</it>.</p>
               <p>3. Why were some tests carried out for rice genome, while the other tests for human genome? For example, rice sequences (and not human sequences) were used to study the effect of sequence lengths.</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>Human and rice were selected to represent the genomes of animals and plants in this research. In the part of "testing the effect of codon frequencies", we did not observe any significant differences when use different codon frequencies. Under most conditions, we chose to use human codon frequencies</it>.</p>
               <p>4. How are optimal &#945; values statistically determined in each test?</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>We added the details to the supplementary materials</it>.</p>
               <p>5. The authors find that human-chimp orthologs have higher &#969; values than human-dog (or human-mouse) orthologs and thus claim that many genes (i.e., 1075 from human-chimp orthologs; 25 and 14 for human-dog and human-mouse, respectively) evolve under strong positive selection. However, this is incongruent with Bakewell et al. (2007; PMID: 17449636) in terms of the number of genes under positive selection and &#969; values. Authors should discuss the difference in detail.</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>Bakewell et al</it><abbrgrp><abbr bid="B44">44</abbr></abbrgrp><it>indeed identified two gene sets, 154 and 233 positive selected genes or PSG for human and chimpanzee lineages, respectively, the authors also claimed that the branch-site likelihood method was not able to detect all PSGs according to their results from computer simulation. Therefore, we speculate that both their and our estimates about the numbers of PSG are not thorough enough, limited by our methodology. In our calculation, we did not distinguish the two lineages, only computed the average values across the lineages, and did not consider the common ancestor of human and chimpanzee. We believe that only an in-depth population genetic analysis may resolve such issues. As far as &#969; is concerned, our methods are based on the raw definition of Ka/Ks and their methods are based on branch-site test. Interestingly, most of the functional categories of PSG genes in both studies overlap significantly, especially in "protein metabolism &amp; modification" and "stress response and immunity"</it>.</p>
               <p>6. Regardless of the number of codons and divergence time (t), why do all three methods (&#947;-MYN, MYN, and YN) overestimate &#969; values?</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>We believe that this is perhaps due to similar assumptions with the same parameter settings (i.e. &#954;<sub>R </sub>= 10, &#954;<sub>Y </sub>= 1) in the two calculations. However, the differences are also obvious as we tried to demonstrate throughout our manuscript</it>.</p>
               <p>7. It is not clear how the "unreliable data" from three orthologous gene sets were excluded.</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>We addressed this in our revised manuscript</it>.</p>
               <p>8. It is not clear how multiple splicing variants for each gene were handled to obtain codon frequencies.</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>We did not distinguish splicing variants in our study. In other words, we assumed that codon frequencies are the same for all possible alternatively spliced forms</it>.</p>
               <p>9. It is surprising that a set of human-dog gene pairs took longer to compute Ka/Ks ratios than that of human-chimp gene pairs(533 sec versus 395 sec for &#947;-MYN), even though the number of human-dog gene pairs was smaller than that of human-chimp gene pairs (14,725 versus 15,646).</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>As the sequence variation of individual gene pairs governs the time for the calculation, the required time is not proportional to the number of gene pairs but the number of effective sites</it>.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Reviewer 2</p>
            </st>
            <p>David A. Liberles, Department of Molecular Biology, University of Wyoming, Laramie WY 82071, USA</p>
            <p>"&#947;-MYN: A new algorithm for estimating Ka and Ks with consideration of variable substitution rates" by Wang, Wan, Zhang, and Yu describes a more model rich version of the original Yang and Nielsen 2000 model <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Previous work added parameters to differentiate between purine transitions and pyrimidine transitions <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The current work adds a gamma distribution on top of the previously described work.</p>
            <p>The stepwise addition of parameters to the Yang and Nielsen approach reflects an attempt to add increasing layers of biological realism. Differentiating between purine and pyrimidine transitions is driven by potential underlying forces like codon bias to the extent that it is correlated across codons in a gene and the chemistry (specificity) of DNA damaging agents, DNA polymerase, and DNA repair enzymes (see <abbrgrp><abbr bid="B73">73</abbr></abbrgrp> for example). The biological link to the gamma distribution is somewhat less clear in the way that it has been applied. Nucleotide sequences as well as amino acid sequences typically show support for a gamma distribution characterizing rates across sites. At the nucleotide level, this is typically related to two components: differences in substitution rate in the different codon positions due to the nature of the genetic code as well as amino acid level constraint on the protein. The former category is already modeled with &#969;, potentially creating some degree of redundancy between the &#969; and &#945; parameters. Modeling &#945; at the amino acid level (translated codons) would not suffer from the redundancy and likely accounts for the improvement in performance by &#947;-MYN.</p>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>We agree with reviewer's excellent explanation why &#947;-MYN is capable of improving the performance of omega calculation. Modeling &#945; at the amino acid level avoids suffering from the parameter redundancy especially when genes are subjected to negative selection as in one of our unpublished results, we found that the interplay of &#945; parameter and other evolutionary features may show some degrees of redundancy</it>.</p>
               <p>The authors show the improved performance of &#947;-MYN on simulated data, where the correct answer is known. This is necessary but not sufficient to support the use of additional parameters on real data, as the model is effectively recovering itself on simulated data. The authors do apply &#947;-MYN to mammalian comparative genomic data. However, it should be possible to evaluate the likelihood of the sequence data given the model and its parameterization for &#947;-MYN compared to simpler models and to evaluate the performance of the models with AIC, even if the methods are approximate rather than proper likelihood estimates.</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>In our previous work, we did incorporate AIC to KaKs calculations </it><abbrgrp><abbr bid="B42">42</abbr></abbrgrp><it>and found that the selected models in the calculation did not depend on combinations of various parameters. We speculate that &#947;-MYN perhaps may not be the best choice under certain conditions, when the smallest AIC is considered as the criteria. We will incorporate AIC into our new model and the updated KaKs Calculator (the software through model selection and model averaging)</it>.</p>
               <p>A more minor point is that the authors suggest that approximate methods need to average over all sites or all branches. Based upon earlier work using ancestral sequence reconstruction coupled with counting methods <abbrgrp><abbr bid="B65">65</abbr><abbr bid="B66">66</abbr><abbr bid="B67">67</abbr></abbrgrp>, approximate methods have been developed or can easily be extended from current work based upon primary windowing to detect selective sweeps <abbrgrp><abbr bid="B68">68</abbr><abbr bid="B69">69</abbr></abbrgrp> and tertiary windowing to detect structural covariation leading to positive selection <abbrgrp><abbr bid="B70">70</abbr><abbr bid="B71">71</abbr></abbrgrp>. This should probably be discussed when discussing the power of these methods.</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>We expanded our discussion into this issue. The power of detecting positive selection in KaKs methods certainly can be enhanced by introductions of ancestral sequence reconstruction and sliding windows. However, it is still an interesting question as which one is better when compared to the LRT methods</it>.</p>
               <p>Further development of models based upon mechanistic molecular and biological underpinnings is always a welcome addition to the literature. A number of problems from multiple sequence alignment to amino acid-based phylogeny to problems in detecting positive selection suffer from the divorce of common models from underlying processes. Well-performing mechanistic models will be broadly applicable across bioinformatics.</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>We fully agree with this comment. In the real world, sequence analysis can be complex and difficult. Therefore, models considering more biological parameters lay foundations for broader applications, especially in the field of molecular evolution (phylogeny tree reconstruction and mechanics of evolution dynamics)</it>.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Reviewer 3</p>
            </st>
            <p>Zhaolei Zhang, Banting &amp; Best Dept. of Medical Research (BBDMR), Department of Molecular Genetics, University of Toronto, 160 College St., Room 608, Donnelly CCBR Building, Toronto, ON M5S 3E1, Canada</p>
            <p>"<b>&#947; (gamma)-MYN: A new algorithm for estimating Ka and Ks with consideration of variable substitution rates</b>"</p>
            <p>Authors: Da-Peng Wang, Hao-Lei Wan, Song Zhang, and Jun Yu.</p>
            <sec>
               <st>
                  <p>General comments</p>
               </st>
               <p>This manuscript describes a new method to estimate the ratio of Ka/Ks taking into account the evolutionary rate variation. Ka/Ks ratio is commonly used as an indicator of selective pressure acting on protein-coding genes. Current methods mostly use simplified substitution models, which may have effect on the estimation of Ka/Ks. Here, based on their previous work, the authors present a new method that the evolutionary rates across sites are modeled by a gamma distribution. Using both simulated and real data, the authors show that the new method performs better than current methods under some conditions.</p>
               <p>The novelty of this manuscript is that this is the first Ka/Ks estimation method that considers the rate variation among sites. It is an important contribution to the scientific communities that use Ka/Ks in their research, and likely will open avenues for new researches in this area.</p>
            </sec>
            <sec>
               <st>
                  <p>Specific comments</p>
               </st>
               <p>I found the overall writing is clear, albeit a little verbose at some places.</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>We revised the manuscript again for clarity</it>.</p>
            </sec>
            <sec>
               <st>
                  <p>Concern of overfitting</p>
               </st>
               <p>Can the authors address the concern of over-fitting by introducing additional parameters?</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>We would like to address this issue as it has been a major concern all along. First, more complex models (e.g. allowing for the correlation of substitution rates at adjacent sites and thus parameter-rich) used in phylogenetic analyses usually produce similar results to simple gamma models </it><abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. <it>Second, the opposite is usually true also as some methods with simple assumptions often lead to similar results over complex ones. For instance, Nei </it><abbrgrp><abbr bid="B7">7</abbr></abbrgrp><it>developed a simple method (giving no weights to different types of codon substitutions) that gives essentially the same results as those more complicated methods (such as giving different evolutionary pathways different weights). Third, more parameters usually lead to higher sensitivities to sequence variations albeit untenable in certain cases, especially when testing some real data. However, setting more parameters, especially by estimating their optimal ranges, we should be able to assess the relationships between parameters and the characteristics of the real data as well as tradeoffs between parameters and models</it>.</p>
            </sec>
            <sec>
               <st>
                  <p>Specific examples</p>
               </st>
               <p>Is it possible for the authors to show one specific example (a gene) that the new method out-performs other methods, i.e. the conclusion is more biologically relevant?</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>Limited by the manuscript length, we decided to show our analysis on one real gene in the "testing real data" section</it>.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Reviewer 4</p>
            </st>
            <p>Shamil Sunyaev, Harvard Medical School, Boston, MA, United States</p>
            <p>This manuscript presents a new method to compute Ka and Ks. The authors incorporated Tamura-Nei model into the Yang-Nielsen approach. This extension of the method would be of interest to experts in molecular evolution. I have a few comments and suggestions.</p>
            <p>1) The manuscript would benefit from a much clearer justification of the method and discussion of its applicability. Tamura-Nei model was developed for the control region of the mitochondrial genome. Is the model with the uniform selective constraint (reflected by parameter omega) and raw mutation rates following gamma distribution realistic for nuclear protein coding genes? The dominant source of mutation rate variation in mammalian genes (used by the authors for testing the method) is likely to be the context-dependency, predominantly due to CpG contexts. Is the gamma distribution model capable of capturing this variation? Different rates for transitions between purines and pyrimidines imply strong strand bias. Is this a realistic assumption in nuclear genes and does it justify incorporation of additional parameters? Also, Tamura-Nei distance is known to have higher variance. How is this reflected in the performance of the method? I would suggest discussing these issues in the introduction and discussion sections.</p>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>We are grateful to the valuable suggestions and revised relevant text accordingly. After it was brought forth by Professor Ziheng Yang in researching globin genes </it><abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, <it>gamma distribution has been widely used in characteristic of variable substitution in coding genes </it><abbrgrp><abbr bid="B74">74</abbr></abbrgrp>, <it>especially in phylogeny analyses. In our method, we only computed the raw omega value (averaging all sites in a gene) based on raw mutation rates following gamma distribution. But this can be easily expanded to omega variations among sites in a manner using the sliding-window methods when necessary. It was proposed that nucleotide substitutions in both coding and noncoding regions are context-dependent in the sense that substitution rates depend on the identity of neighboring bases by adopting an approach of incorporating gamma distribution </it><abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. <it>Furthermore, models that allow for the correlation of substitution rates at adjacent sites were also developed </it><abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. <it>However, as these models tend to produce results similar to the simple gamma model and variations of &#945; can make the distribution suitable for accommodating different levels of rate variation in various data sets </it><abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, <it>we chose the simple gamma distribution as the depiction of raw various mutation rates. As to the difference between purine and pyrimidine transitions, they are driven by potential underlying forces such as codon bias to the extent that it is correlated across codons in a gene and the chemistry (specificity) of DNA synthesis, damaging agents, DNA polymerase, and DNA repair enzymes </it><abbrgrp><abbr bid="B73">73</abbr></abbrgrp>. <it>In our computer simulations, we found that the new method did not always have higher variations in related parameter estimations as in compared with other methods</it>.</p>
               <p>2) I suggest that the presentation of the manuscript will be improved. For example, it is not clear that by gamma distribution of substitution rates (and, in general, by variable substitution rates) the authors mean gamma distribution of raw mutation rate rather than gamma distribution of omega.</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>Done. We revised the text to clarify this point. We do mean &#947; distribution of the raw mutation rate rather than omega</it>.</p>
               <p>3) Tests on real data: I would suggest eliminating the discussion of positive selection between human and chimpanzee. Small number of substitutions and relaxation of selection due to small effective population size may easily lead to the observed increase in genes with Ka/Ks > 1. Also, I do not see why human-chimpanzee comparison would be a good test of the method because there are essentially no multiple hits, so any method including simple counting should be reliable. A good test would be the analysis of known examples of proteins evolving under positive or negative selection and demonstration that the new method has higher power to detect selection (e.g. using fewer species or partial sequences). I understand, however, that this may be a subject of a separate study.</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>Agreed. We have removed the discussion on positive selection on the human-chimp dataset. We added an example for an evaluation of our new method. We are performing a systematic study on sequences from diverse evolutionary distances and planned to publish the results in separate manuscripts</it>.</p>
               <p>4) Is the software implementation of the method available?</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>Yes. Since the new integrated version of KaKs_Calculator 2.0 is still being programmed, a simple C++ source code package (can be used in Linux) is available upon request from the authors now</it>.</p>
            </sec>
         </sec>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Dr. Zhang Zhang and Miss Yanyang Zhi for their constructive comments on this manuscript. We are also grateful to Prof. Songnian Hu for many thoughtful suggestions and thank all staffs in Beijing Institute of Genomics for their sincere supports. We thank four reviewers for several very good suggestions. This work was supported by the National Basic Research Program of China (2006CB910404) awarded to JY.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <aug>
               <au>
                  <snm>Gillespie</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>The Causes of Molecular Evolution</source>
            <publisher>Oxford University Press, USA</publisher>
            <pubdate>1991</pubdate>
         </bibl>
         <bibl id="B2">
            <aug>
               <au>
                  <snm>Kimura</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>The neutral theory of molecular evolution</source>
            <publisher>Cambridge, England, Cambridge University Press</publisher>
            <pubdate>1983</pubdate>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Molecular Evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>WH</fnm>
               </au>
            </aug>
            <publisher>Sunderland, Mass. Sinauer Associates</publisher>
            <pubdate>1997</pubdate>
         </bibl>
         <bibl id="B4">
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Computational Molecular Evolution</source>
            <publisher>Oxford University Press, USA</publisher>
            <pubdate>2006</pubdate>
         </bibl>
         <bibl id="B5">
            <title>
               <p>The Ka/Ks ratio: diagnosing the form of sequence evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <fpage>486</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(02)02722-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">12175810</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Statistical methods for detecting molecular adaptation</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Bielawski</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Trends Ecol Evol</source>
            <pubdate>2000</pubdate>
            <volume>15</volume>
            <fpage>496</fpage>
            <lpage>503</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0169-5347(00)01994-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">11114436</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions</p>
            </title>
            <aug>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gojobori</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1986</pubdate>
            <volume>3</volume>
            <fpage>418</fpage>
            <lpage>426</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3444411</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A new method for estimating synonymous and nonsynonymous rates of nucleotide substitution considering the relative likelihood of nucleotide and codon changes</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>WH</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>CI</fnm>
               </au>
               <au>
                  <snm>Luo</snm>
                  <fnm>CC</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1985</pubdate>
            <volume>2</volume>
            <fpage>150</fpage>
            <lpage>174</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3916709</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Unbiased estimation of the rates of synonymous and nonsynonymous substitution</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>WH</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1993</pubdate>
            <volume>36</volume>
            <fpage>96</fpage>
            <lpage>99</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF02407308</pubid>
                  <pubid idtype="pmpid">8433381</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Evolution of the Zfx and Zfy genes: rates and interdependence between the genes</p>
            </title>
            <aug>
               <au>
                  <snm>Pamilo</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bianchi</snm>
                  <fnm>NO</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1993</pubdate>
            <volume>10</volume>
            <fpage>271</fpage>
            <lpage>281</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8487630</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Comparison of three methods for estimating rates of synonymous and nonsynonymous nucleotide substitutions</p>
            </title>
            <aug>
               <au>
                  <snm>Tzeng</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>WH</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>2290</fpage>
            <lpage>2298</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh242</pubid>
                  <pubid idtype="pmpid" link="fulltext">15329386</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Evaluation of six methods for estimating synonymous and nonsynonymous substitution rates</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genomics Proteomics Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <fpage>173</fpage>
            <lpage>181</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1672-0229(06)60030-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">17127215</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2000</pubdate>
            <volume>17</volume>
            <fpage>32</fpage>
            <lpage>43</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10666704</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Computing Ka and Ks with a consideration of unequal transitional substitutions</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <fpage>44</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1552089</pubid>
                  <pubid idtype="pmpid" link="fulltext">16740169</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-6-44</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>The estimate of total nucleotide substitutions from pairwise differences is biased</p>
            </title>
            <aug>
               <au>
                  <snm>Fitch</snm>
                  <fnm>WM</fnm>
               </au>
            </aug>
            <source>Philos Trans R Soc Lond B Biol Sci</source>
            <pubdate>1986</pubdate>
            <volume>312</volume>
            <fpage>317</fpage>
            <lpage>324</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1098/rstb.1986.0010</pubid>
                  <pubid idtype="pmpid" link="fulltext">2870524</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>A method for estimating the number of invariant amino acid coding positions in a gene using cytochrome c as a model case</p>
            </title>
            <aug>
               <au>
                  <snm>Fitch</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Margoliash</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Biochem Genet</source>
            <pubdate>1967</pubdate>
            <volume>1</volume>
            <fpage>65</fpage>
            <lpage>71</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00487738</pubid>
                  <pubid idtype="pmpid">5610702</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>An improved method for determining codon variability in a gene and its application to the rate of fixation of mutations in evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Fitch</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Markowitz</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Biochem Genet</source>
            <pubdate>1970</pubdate>
            <volume>4</volume>
            <fpage>579</fpage>
            <lpage>593</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00486096</pubid>
                  <pubid idtype="pmpid">5489762</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The spatial distribution of fixed mutations within genes coding for proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Holmquist</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Goodman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Conroy</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Czelusniak</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1983</pubdate>
            <volume>19</volume>
            <fpage>437</fpage>
            <lpage>448</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF02102319</pubid>
                  <pubid idtype="pmpid">6317874</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Fitting discrete probability distributions to evolutionary events</p>
            </title>
            <aug>
               <au>
                  <snm>Uzzell</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Corbin</snm>
                  <fnm>KW</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1971</pubdate>
            <volume>172</volume>
            <fpage>1089</fpage>
            <lpage>1096</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.172.3988.1089</pubid>
                  <pubid idtype="pmpid" link="fulltext">5574514</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Substitution rate variation among sites in hypervariable region 1 of human mitochondrial DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Wakeley</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1993</pubdate>
            <volume>37</volume>
            <fpage>613</fpage>
            <lpage>623</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00182747</pubid>
                  <pubid idtype="pmpid">8114114</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Variations of substitution rates and estimation of evolutionary distances of DNA sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>PhD Thesis</source>
            <publisher>Beijing Agricultural University</publisher>
            <pubdate>1992</pubdate>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Limitations of the evolutionary parsimony method of phylogenetic analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Jin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1990</pubdate>
            <volume>7</volume>
            <fpage>82</fpage>
            <lpage>102</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2299983</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Molecular phylogeny of Rodentia, Lagomorpha, Primates, Artiodactyla, and Carnivora and molecular clocks</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>WH</fnm>
               </au>
               <au>
                  <snm>Gouy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sharp</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>O'HUigin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>YW</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1990</pubdate>
            <volume>87</volume>
            <fpage>6703</fpage>
            <lpage>6707</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">54605</pubid>
                  <pubid idtype="pmpid">2395871</pubid>
                  <pubid idtype="doi">10.1073/pnas.87.17.6703</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees</p>
            </title>
            <aug>
               <au>
                  <snm>Tamura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1993</pubdate>
            <volume>10</volume>
            <fpage>512</fpage>
            <lpage>526</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8336541</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1993</pubdate>
            <volume>10</volume>
            <fpage>1396</fpage>
            <lpage>1401</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8277861</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1994</pubdate>
            <volume>39</volume>
            <fpage>306</fpage>
            <lpage>314</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00160154</pubid>
                  <pubid idtype="pmpid">7932792</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Comparison of models for nucleotide substitution used in maximum-likelihood phylogenetic estimation</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Goldman</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Friday</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1994</pubdate>
            <volume>11</volume>
            <fpage>316</fpage>
            <lpage>324</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8170371</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Dating of the human-ape splitting by a molecular clock of mitochondrial DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kishino</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yano</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1985</pubdate>
            <volume>22</volume>
            <fpage>160</fpage>
            <lpage>174</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF02101694</pubid>
                  <pubid idtype="pmpid">3934395</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Phylogenetic estimation of context-dependent substitution rates by maximum likelihood</p>
            </title>
            <aug>
               <au>
                  <snm>Siepel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>468</fpage>
            <lpage>488</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh039</pubid>
                  <pubid idtype="pmpid" link="fulltext">14660683</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>A Hidden Markov Model approach to variation among sites in rate of evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Churchill</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1996</pubdate>
            <volume>13</volume>
            <fpage>93</fpage>
            <lpage>104</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8583911</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Among-site rate variation and its impact on phylogenetic analyses</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Trends in Ecology &amp; Evolution</source>
            <pubdate>1996</pubdate>
            <volume>11</volume>
            <fpage>367</fpage>
            <lpage>372</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/0169-5347(96)10041-0</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>A method for estimating the numbers of synonymous and nonsynonymous substitutions per site</p>
            </title>
            <aug>
               <au>
                  <snm>Comeron</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1995</pubdate>
            <volume>41</volume>
            <fpage>1152</fpage>
            <lpage>1159</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00173196</pubid>
                  <pubid idtype="pmpid">8587111</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>A codon-based model of nucleotide substitution for protein-coding DNA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Goldman</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1994</pubdate>
            <volume>11</volume>
            <fpage>725</fpage>
            <lpage>736</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7968486</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Kimura</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1980</pubdate>
            <volume>16</volume>
            <fpage>111</fpage>
            <lpage>120</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF01731581</pubid>
                  <pubid idtype="pmpid">7463489</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome</p>
            </title>
            <aug>
               <au>
                  <snm>Muse</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Gaut</snm>
                  <fnm>BS</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1994</pubdate>
            <volume>11</volume>
            <fpage>715</fpage>
            <lpage>724</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7968485</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>PAML: a program package for phylogenetic analysis by maximum likelihood</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1997</pubdate>
            <volume>13</volume>
            <fpage>555</fpage>
            <lpage>556</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9367129</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Ensembl 2005</p>
            </title>
            <aug>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Andrews</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Caccamo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cameron</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Clamp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Coates</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Cunningham</snm>
                  <fnm>F</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D447</fpage>
            <lpage>453</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540092</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608235</pubid>
                  <pubid idtype="doi">10.1093/nar/gki138</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>The Genomes of Oryza sativa: a history of duplications</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ni</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>C</fnm>
               </au>
               <etal/>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e38</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">546038</pubid>
                  <pubid idtype="pmpid" link="fulltext">15685292</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030038</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Evolution of protein molecules</p>
            </title>
            <aug>
               <au>
                  <snm>Jukes</snm>
                  <fnm>TH</fnm>
               </au>
               <au>
                  <snm>Cantor</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>Mammalian Protein Metabolism</source>
            <pubdate>1969</pubdate>
            <volume>3</volume>
            <fpage>21</fpage>
            <lpage>132</lpage>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Models of molecular evolution and phylogeny</p>
            </title>
            <aug>
               <au>
                  <snm>Lio</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Goldman</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>1998</pubdate>
            <volume>8</volume>
            <fpage>1233</fpage>
            <lpage>1244</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9872979</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Initial sequence of the chimpanzee genome and comparison with the human genome</p>
            </title>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>437</volume>
            <fpage>69</fpage>
            <lpage>87</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04072</pubid>
                  <pubid idtype="pmpid" link="fulltext">16136131</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>KaKs_Calculator: calculating Ka and Ks through model selection and model averaging</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>XQ</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genomics Proteomics Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <fpage>259</fpage>
            <lpage>263</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1672-0229(07)60007-2</pubid>
                  <pubid idtype="pmpid">17531802</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Evidence for natural selection on leukocyte immunoglobulin-like receptors for HLA class I in Northeast Asians</p>
            </title>
            <aug>
               <au>
                  <snm>Hirayasu</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ohashi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kashiwase</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ogawa</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Takanashi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Satake</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jia</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Chimge</snm>
                  <fnm>NO</fnm>
               </au>
               <au>
                  <snm>Sideltseva</snm>
                  <fnm>EW</fnm>
               </au>
               <etal/>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2008</pubdate>
            <volume>82</volume>
            <fpage>1075</fpage>
            <lpage>1083</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2427302</pubid>
                  <pubid idtype="pmpid" link="fulltext">18439545</pubid>
                  <pubid idtype="doi">10.1016/j.ajhg.2008.03.012</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>More genes underwent positive selection in chimpanzee evolution than in human evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Bakewell</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Shi</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>7489</fpage>
            <lpage>7494</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1863478</pubid>
                  <pubid idtype="pmpid" link="fulltext">17449636</pubid>
                  <pubid idtype="doi">10.1073/pnas.0701705104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>A scan for positively selected genes in the genomes of humans and chimpanzees</p>
            </title>
            <aug>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bustamante</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Glanowski</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sackton</snm>
                  <fnm>TB</fnm>
               </au>
               <au>
                  <snm>Hubisz</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Fledel-Alon</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tanenbaum</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Civello</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>TJ</fnm>
               </au>
               <etal/>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e170</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1088278</pubid>
                  <pubid idtype="pmpid" link="fulltext">15869325</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030170</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Bayesian estimation of positively selected sites</p>
            </title>
            <aug>
               <au>
                  <snm>Huelsenbeck</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Dyer</snm>
                  <fnm>KA</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2004</pubdate>
            <volume>58</volume>
            <fpage>661</fpage>
            <lpage>672</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00239-004-2588-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">15461423</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>A Dirichlet process model for detecting positive selection in protein-coding DNA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Huelsenbeck</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Jain</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Frost</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Pond</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>6263</fpage>
            <lpage>6268</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1458866</pubid>
                  <pubid idtype="pmpid" link="fulltext">16606848</pubid>
                  <pubid idtype="doi">10.1073/pnas.0508279103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>A novel method for estimating substitution rate variation among sites in a large dataset of homologous DNA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Pesole</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Saccone</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2001</pubdate>
            <volume>157</volume>
            <fpage>859</fpage>
            <lpage>865</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461530</pubid>
                  <pubid idtype="pmpid">11157002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Site-to-site variation of synonymous substitution rates</p>
            </title>
            <aug>
               <au>
                  <snm>Pond</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Muse</snm>
                  <fnm>SV</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <fpage>2375</fpage>
            <lpage>2385</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi232</pubid>
                  <pubid idtype="pmpid" link="fulltext">16107593</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>908</fpage>
            <lpage>917</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12032247</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Codon-substitution models for heterogeneous selection pressure at amino acid sites</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Goldman</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>AM</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2000</pubdate>
            <volume>155</volume>
            <fpage>431</fpage>
            <lpage>449</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461088</pubid>
                  <pubid idtype="pmpid">10790415</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Codon-substitution models to detect adaptive evolution that account for heterogeneous selective pressures among site classes</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Swanson</snm>
                  <fnm>WJ</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>49</fpage>
            <lpage>57</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11752189</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Accuracy and power of the likelihood ratio test in detecting adaptive molecular evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Anisimova</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bielawski</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>1585</fpage>
            <lpage>1592</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11470850</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Maximum likelihood methods for detecting adaptive evolution after gene duplication</p>
            </title>
            <aug>
               <au>
                  <snm>Bielawski</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>J Struct Funct Genomics</source>
            <pubdate>2003</pubdate>
            <volume>3</volume>
            <fpage>201</fpage>
            <lpage>212</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1022642807731</pubid>
                  <pubid idtype="pmpid" link="fulltext">12836699</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene</p>
            </title>
            <aug>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1998</pubdate>
            <volume>148</volume>
            <fpage>929</fpage>
            <lpage>936</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1460041</pubid>
                  <pubid idtype="pmpid">9539414</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1998</pubdate>
            <volume>15</volume>
            <fpage>568</fpage>
            <lpage>573</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9580986</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>PAML 4: phylogenetic analysis by maximum likelihood</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <fpage>1586</fpage>
            <lpage>1591</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msm088</pubid>
                  <pubid idtype="pmpid" link="fulltext">17483113</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>HyPhy: hypothesis testing using phylogenies</p>
            </title>
            <aug>
               <au>
                  <snm>Pond</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Frost</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Muse</snm>
                  <fnm>SV</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>676</fpage>
            <lpage>679</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti079</pubid>
                  <pubid idtype="pmpid" link="fulltext">15509596</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Inferring nonneutral evolution from human-chimp-mouse orthologous gene trios</p>
            </title>
            <aug>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Glanowski</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Kejariwal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Todd</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Tanenbaum</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Civello</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>B</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>302</volume>
            <fpage>1960</fpage>
            <lpage>1963</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1088821</pubid>
                  <pubid idtype="pmpid" link="fulltext">14671302</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Genes under positive selection in Escherichia coli</p>
            </title>
            <aug>
               <au>
                  <snm>Petersen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bollback</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Dimmic</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hubisz</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <fpage>1336</fpage>
            <lpage>1343</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1950902</pubid>
                  <pubid idtype="pmpid" link="fulltext">17675366</pubid>
                  <pubid idtype="doi">10.1101/gr.6254707</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Comparative genome sequencing of Drosophila pseudoobscura: chromosomal, gene, and cis-element evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Richards</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Bettencourt</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Hradecky</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Letovsky</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Thornton</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hubisz</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Meisel</snm>
                  <fnm>RP</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>1</fpage>
            <lpage>18</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540289</pubid>
                  <pubid idtype="pmpid" link="fulltext">15632085</pubid>
                  <pubid idtype="doi">10.1101/gr.3059305</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>A complete mitochondrial genome sequence of the wild two-humped camel (Camelus bactrianus ferus): an evolutionary history of camelidae</p>
            </title>
            <aug>
               <au>
                  <snm>Cui</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ji</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ding</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Qi</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gao</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Meng</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>241</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1939714</pubid>
                  <pubid idtype="pmpid" link="fulltext">17640355</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-8-241</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Genome analysis of the platypus reveals unique signatures of evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Warren</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Hillier</snm>
                  <fnm>LW</fnm>
               </au>
               <au>
                  <snm>Marshall Graves</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ponting</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Grutzner</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Belov</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Chinwalla</snm>
                  <fnm>AT</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2008</pubdate>
            <volume>453</volume>
            <fpage>175</fpage>
            <lpage>183</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature06936</pubid>
                  <pubid idtype="pmpid" link="fulltext">18464734</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>On the nature of human housekeeping genes</p>
            </title>
            <aug>
               <au>
                  <snm>Zhu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2008</pubdate>
            <volume>24</volume>
            <fpage>481</fpage>
            <lpage>484</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2008.08.004</pubid>
                  <pubid idtype="pmpid" link="fulltext">18786740</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Post-genomic science: converting primary structure into physiological function</p>
            </title>
            <aug>
               <au>
                  <snm>Benner</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Trabesinger</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Schreiber</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Adv Enzyme Regul</source>
            <pubdate>1998</pubdate>
            <volume>38</volume>
            <fpage>155</fpage>
            <lpage>180</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0065-2571(97)00019-8</pubid>
                  <pubid idtype="pmpid">9762352</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Evaluation of methods for determination of a reconstructed history of gene sequence evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Liberles</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>2040</fpage>
            <lpage>2047</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11606700</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Episodic adaptive evolution of primate lysozymes</p>
            </title>
            <aug>
               <au>
                  <snm>Messier</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Stewart</snm>
                  <fnm>CB</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1997</pubdate>
            <volume>385</volume>
            <fpage>151</fpage>
            <lpage>154</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/385151a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">8990116</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>A sliding window-based method to detect selective constraints in protein-coding genes and its application to RNA viruses</p>
            </title>
            <aug>
               <au>
                  <snm>Fares</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Elena</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Ortiz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Moya</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Barrio</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2002</pubdate>
            <volume>55</volume>
            <fpage>509</fpage>
            <lpage>521</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00239-002-2346-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">12399925</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>A simple covarion-based approach to analyse nucleotide substitution rates</p>
            </title>
            <aug>
               <au>
                  <snm>Siltberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Liberles</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Journal of Evolutionary Biology</source>
            <pubdate>2002</pubdate>
            <volume>15</volume>
            <fpage>588</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1046/j.1420-9101.2002.00416.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Tertiary windowing to detect positive diversifying selection</p>
            </title>
            <aug>
               <au>
                  <snm>Berglund</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Wallner</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Elofsson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Liberles</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2005</pubdate>
            <volume>60</volume>
            <fpage>499</fpage>
            <lpage>504</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00239-004-0223-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">15883884</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Three-dimensional window analysis for detecting positive selection at structural regions of proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>2352</fpage>
            <lpage>2359</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh249</pubid>
                  <pubid idtype="pmpid" link="fulltext">15356273</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>NCBI HomoloGene</p>
            </title>
            <url>ftp://ftp.ncbi.nih.gov/pub/HomoloGene/</url>
         </bibl>
         <bibl id="B73">
            <title>
               <p>Estimating changes in mutational mechanisms of evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Ota</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Penny</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2003</pubdate>
            <volume>57</volume>
            <issue>Suppl 1</issue>
            <fpage>S233</fpage>
            <lpage>240</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00239-003-0032-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">15008420</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Patterns of nucleotide substitution in mitochondrial protein coding genes of vertebrates</p>
            </title>
            <aug>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1996</pubdate>
            <volume>143</volume>
            <fpage>537</fpage>
            <lpage>548</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1207285</pubid>
                  <pubid idtype="pmpid" link="fulltext">8722802</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
