<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1745-6150-3-9</ui>
   <ji>1745-6150</ji>
   <fm>
      <dochead>Discovery notes</dochead>
      <bibl>
         <title>
            <p>Endogenous retroviruses of the chicken genome</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Huda</snm>
               <fnm>Ahsan</fnm>
               <insr iid="I1"/>
               <email>ahsan.huda@gatech.edu</email>
            </au>
            <au id="A2">
               <snm>Polavarapu</snm>
               <fnm>Nalini</fnm>
               <insr iid="I1"/>
               <email>nalini@gatech.edu</email>
            </au>
            <au id="A3" ca="yes">
               <snm>Jordan</snm>
               <fnm>I King</fnm>
               <insr iid="I1"/>
               <email>king.jordan@biology.gatech.edu</email>
            </au>
            <au id="A4">
               <snm>McDonald</snm>
               <mi>F</mi>
               <fnm>John</fnm>
               <insr iid="I1"/>
               <email>john.mcdonald@biology.gatech.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>School of Biology, Georgia Institute of Technology, Atlanta, GA, USA</p>
            </ins>
         </insg>
         <source>Biology Direct</source>
         <issn>1745-6150</issn>
         <pubdate>2008</pubdate>
         <volume>3</volume>
         <issue>1</issue>
         <fpage>9</fpage>
         <url>http://www.biology-direct.com/content/3/1/9</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18361801</pubid>
               <pubid idtype="doi">10.1186/1745-6150-3-9</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>06</day>
               <month>3</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>24</day>
               <month>3</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>24</day>
               <month>3</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Huda et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p/>
               </st>
               <p>We analyzed the chicken (<it>Gallus gallus</it>) genome sequence to search for previously uncharacterized endogenous retrovirus (ERV) sequences using <it>ab initio </it>and combined evidence approaches. We discovered 11 novel families of ERVs that occupy more than 21 million base pairs, approximately 2%, of the chicken genome. These novel families include a number of recently active full-length elements possessing identical long terminal repeats (LTRs) as well as intact <it>gag </it>and <it>pol </it>open reading frames. The abundance and diversity of chicken ERVs we discovered underscore the utility of an approach that combines multiple methods for the identification of interspersed repeats in vertebrate genomes.</p>
            </sec>
            <sec>
               <st>
                  <p>Reviewers</p>
               </st>
               <p>This article was reviewed by Igor Zhulin and Itai Yanai.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Findings</p>
         </st>
         <p>Chicken, a modern descendant of the dinosaurs, is the first avian to have its genome sequenced <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Phylogenetically, its position between fish and mammals provides valuable insight into the evolution of vertebrates. The chicken genome has a size of 1.2 billion bases, approximately one third of the size of the human genome.</p>
         <p>The overall interspersed repeat, <it>i.e. </it>transposable element (TE), content of the chicken genome was determined to be less than 9% <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. This fraction is considerably lower than that of mammalian genomes, where transposable elements (TEs) account for 40&#8211;50% of genomic sequences <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. While chicken has long been a model system for the study of retroviruses <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, a mere 1.3% of the chicken genome can be classified as endogenous retroviruses (ERVs) compared to about 5% in humans <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Nevertheless, protein coding sequences still make up only a minor fraction of the chicken genome leaving a substantial quotient that has yet to be been accounted for. The authors of the initial analysis of the chicken genome posited that much of the uncharacterized sequence was likely to be derived from unrecognized TEs <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Indeed, novel or previously uncharacterized TE sequences may be missed by homology-based methods for the detection of repeats, such as the widely used RepeatMasker program <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, which rely on the comparison of genomic sequences to libraries of known repeat consensus sequences. <it>Ab initio </it>methods, on the other hand, identify repeats by virtue of their structural characteristics without regard to any sequence similarity to known elements. We used a combination of <it>ab initio </it>detection, sequence similarity searches, motif identification and evaluation of element structural (repeat) features to search for novel ERVs that may have been missed in the initial analysis of the chicken genome.</p>
         <p>LTR_STRUC was the first <it>ab initio </it>program designed to detect long terminal repeat (LTR) containing elements, such as ERVs, in genomic sequence <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Briefly, LTR_STRUC works by sliding a window along genomic sequence and looking for direct repeats that are spaced apart within a specified size range (<it>e.g. </it>5&#8211;10 kb). After identifying putative LTRs in this way, it searches for other characteristic features of LTR elements such as target site duplications, inverted repeats at LTR termini, primer binding sites and poly purine tracts. Based on these features, it predicts the direction of the LTR element and provides the corresponding three frame translation of the reverse transcriptase (RT) sequence in the internal region of the element. LTR_STRUC has proven effective at identifying novel LTR elements, including ERVs, in chimpanzee <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, mouse <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> and rice genome sequences <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
         <p>LTR_STRUC was run on the 2004 build of the chicken genome sequence, <it>i.e. </it>the v1.0 draft assembly from the Washington University Genome Sequencing Center <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> distributed on the UCSC Genome Browser <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, resulting in the detection of 39 putative full-length LTR elements. RT homologous sequences were identified in these elements and used as queries in TBLASTN <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> searches against the chicken genome sequence. The BLAST output and flanking genomic sequences were visually inspected to look for ERV characteristic features such as LTRs, target site repeats and terminal inverted repeats. LTRs are direct repeats at the 5' and 3' termini of the ERVs that are ~200&#8211;350 bp in length. Characteristic dinucleotide terminal inverted repeats are found at the beginning (TG) and ends (CA) of ERV LTRs. Target site repeats are short (4&#8211;6 bp) direct repeats found immediately upstream and downstream of ERV insertions that result from resolution of a staggered break that is made when the elements integrate in the genome. We identified a total of 89 putative ERVs in the genome using the combined <it>ab initio</it>, sequence similarity and element feature detection approach. The presence of intact open reading frames that encode sequences that have significant sequence similarity to RT along with the canonical RT catalytic motif <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> were used to validate 61 of these cases as intact full-length ERVs.</p>
         <p>Phylogenetic analysis of an RT nucleotide sequence alignment was used to classify the chicken ERV sequences that we identified. ERV phylogenies were built using the neighbor-joining and maximum parsimony methods implemented in the program MEGA <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and maximum likelihood using the program PhyML <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. For neighbor-joining and maximum parsimony 1,000 bootstrap replicates were run to assess the support for internal branches on the phylogeny, and the approximate likelihood ratio test <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> was used to evaluate the support for branches along the maximum likelihood tree. The ERV phylogeny shows a number of well resolved groups that correspond for 14 distinct families of chicken ERVs, 11 of which are described here for the first time (Figure <figr fid="F1">1</figr>). In the absence of a standard naming convention for viral families in the chicken genome, we named the families using GGERVNN, for <it>Gallus gallus </it>endogenous retrovirus followed by the family number. We also reported the new families to Repbase <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> where they constitute nearly half (8 out of 17) of all the ERV families known for the chicken genome.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Chicken endogenous retrovirus families</p>
            </caption>
            <text>
               <p><b>Chicken endogenous retrovirus families</b>. Phylogenetic analysis of an RT multiple sequence alignment for all full-length elements was used to delineate chicken ERV families. The neighbor-joining phylogeny is shown; maximum parsimony and maximum likelihood trees were also reconstructed. The names of the taxa (ERV sequences) correspond to the chicken chromosome number, strand, start and end coordinates from the May 2006 build, v2.1 draft assembly from the Washington University Genome Sequencing Center, found on the UCSC Genome Browser. Family names and characteristics for the 11 novel ERV families discovered here are shown below the tree. Family copy numbers are indicated along with the family averages of intra-element percent identity between 5' and 3' LTRs and their age ranges (lower-to-upper bounds). For each family, percent support values are shown for the internal branch that subtends the family based on bootstrap analysis, for neighbor-joining and maximum parsimony, and the approximate likelihood ratio test for maximum likelihood.</p>
            </text>
            <graphic file="1745-6150-3-9-1"/>
         </fig>
         <p>The 11 new ERV families we discovered using LTR_STRUC and BLAST analysis include 48 full-length elements and 1,542 fragmented sequences, most of which are solo LTRs that result from intra-element LTR-LTR recombination. When representatives of the 11 novel families were used to search the chicken genome for homologous sequences using the RepeatMasker program <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, they hit ~21 megabases of ERV sequence, or 2.0% of the genome. Together, the previously characterized and newly characterized ERVs represent more than 30 megabases of sequence and 2.9% of the chicken genome, a substantial increase over the previous figure of 1.3% of ERV sequences.</p>
         <p>GGERV21, GGERV22 and GGERV30 are the most abundant lineages and account for more than half of all the viral sequences in the genome. However, only a few full-length elements were found for these abundant families; most of their sequences exist as fragments or solo LTRs. These abundant families are most closely related to the Birdawg and Kronos LTR elements previously identified as high copy number elements using cot-based sequencing and analysis of the chicken genome <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. However, we did not identify any full-length elements corresponding to the Hithcock or Soprano LTR elements identified in the same study.</p>
         <p>The LTRs at the 5' and 3' ends of a full-length ERV genomic sequence are generated from a single template during reverse transcription of RNA into DNA <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Therefore, at the moment that a full-length ERV integrates into the genome, its 5' and 3' LTRs are expected to be identical in sequence, and intra-element sequence differences between LTRs can be used to estimate the time that has elapsed since an element was active <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. The ages of chicken ERVs were estimated in this way using the formula <it>t </it>= <it>d</it>/2<it>r</it>, where <it>t </it>is the time since insertion, <it>d </it>is the nucleotide sequence divergence per site between 5' and 3' LTRs of a single element and <it>r </it>is the rate of nucleotide substitution per site per million years. The value of <it>r </it>used here, 7.5 &#215; 10<sup>-4</sup>, is based on comparisons of nuclear genes among four avian taxa <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
         <p>Age ranges for the 11 novel ERV families we detected are shown in Figure <figr fid="F1">1</figr>. The youngest family of chicken ERVs is GGERV10, which includes 10 full-length elements with 5' and 3' LTRs that are either identical or differ by only 1 bp. The GGERV10 family of element sequences integrated from 0&#8211;3 million years ago. Full-length GGERV10 family members encode a ~1,600 base pair intact <it>gag </it>open reading frame (ORF) and a ~3,300 base pair <it>pol </it>ORF that encodes a polyprotein with homology to the protease, RT, RNAseH and integrase enzymes that catalyze reverse transcription. In other words, GGERV10 family members are potentially active ERVs that were integrated into the chicken genome very recently. Incidentally, the GGERV10 family is substantially younger than the GGERVLA (Figure <figr fid="F1">1</figr>) family that was previously described as the most recently active family in the genome <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>.</p>
         <p>The next youngest family is GGERV29, with elements that inserted 0&#8211;17.9 million years ago, and the oldest family we identified is GGERV24 at 136.5&#8211;143.9 million years old. This wide range of ages encompasses all newly discovered and previously characterized chicken ERVs. Even though the <it>ab initio </it>approach we used is best suited to find relatively young elements with readily identifiable structural elements (<it>i.e. </it>LTRs), it was able to detect families that were active hundreds-of-millions of years apart.</p>
         <p>Using a combined evidence approach that integrates <it>ab initio </it>element detection with sequence similarity searches, motif identification and evaluation of element features we detected 11 novel ERV families covering more than 21 megabases of previously uncharacterized chicken genome sequence. Several of these families were fairly ancient, consistent with the expectation that degenerated element sequences may be missed by homology-based detection methods. However, a number of the ERVs we identified are members of young families that have been active very recently in the chicken genome. These results underscore the importance of integrating multiple methods <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> for the detection of interspersed repeats in eukaryotic genomes.</p>
      </sec>
      <sec>
         <st>
            <p>Reviewers' comments</p>
         </st>
         <sec>
            <st>
               <p>Reviewer's report 1</p>
            </st>
            <p>
               <it>Igor Zhulin, University of Tennessee and Oak Ridge National Laboratory</it>
            </p>
            <p>This is an interesting discovery of novel viral families in the chicken genome, which accounts for more than 2% of the genome sequence. I do not have any major concerns regarding this paper and support its publication; however, I would like to offer some comments for authors' consideration, mainly regarding the clarity and presentation.</p>
         </sec>
         <sec>
            <st>
               <p>Authors' response</p>
            </st>
            <p>We are grateful to the reviewer Dr. Igor Zhulin for providing a number of very specific and constructive comments regarding the clarity and presentation of the manuscript. We revised the paper according to his suggestions.</p>
         </sec>
         <sec>
            <st>
               <p>Reviewer's report 2</p>
            </st>
            <p>
               <it>Itai Yanai, Harvard University</it>
            </p>
            <p>I support publication of this manuscript.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The author(s) declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>AH and NP implemented LTR_STRUC on the chicken genome. AH performed all other sequence analyses and the phylogenetic analysis under the supervision of JFM and IKJ. AH and IKJ drafted the manuscript. All authors reviewed and approved the final version of the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>AH, NP and IKJ are supported by the School of Biology, Georgia Institute of Technology. JFM and NP were supported by a grant from the Georgia Tech Research Foundation. We would like to thank members of the McDonald lab and Jordan lab for their support and technical assistance.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Consortium</snm>
                  <fnm>ICGS</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>432</volume>
            <fpage>695</fpage>
            <lpage>716</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03154</pubid>
                  <pubid idtype="pmpid" link="fulltext">15592404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Initial sequence of the chimpanzee genome and comparison with the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Consortium</snm>
                  <fnm>CSaA</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>437</volume>
            <fpage>69</fpage>
            <lpage>87</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04072</pubid>
                  <pubid idtype="pmpid" link="fulltext">16136131</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Initial sequencing and analysis of the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Linton</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Nusbaum</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zody</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Baldwin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Devon</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dewar</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>FitzHugh</snm>
                  <fnm>W</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <fpage>860</fpage>
            <lpage>921</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35057062</pubid>
                  <pubid idtype="pmpid" link="fulltext">11237011</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Initial sequencing and comparative analysis of the mouse genome</p>
            </title>
            <aug>
               <au>
                  <snm>Waterston</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Lindblad-Toh</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Abril</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Agarwal</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Agarwala</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ainscough</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Alexandersson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>An</snm>
                  <fnm>P</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>420</volume>
            <fpage>520</fpage>
            <lpage>562</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01262</pubid>
                  <pubid idtype="pmpid" link="fulltext">12466850</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>A transmissible avian neoplasm</p>
            </title>
            <aug>
               <au>
                  <snm>Rous</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>J Exp Med</source>
            <pubdate>1910</pubdate>
            <volume>12</volume>
            <fpage>696</fpage>
            <lpage>705</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1084/jem.12.5.696</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>RepeatMasker Open-3.0</p>
            </title>
            <url>http://www.repeatmasker.org</url>
         </bibl>
         <bibl id="B7">
            <title>
               <p>LTR_STRUC: a novel search and identification program for LTR retrotransposons</p>
            </title>
            <aug>
               <au>
                  <snm>McCarthy</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>McDonald</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <fpage>362</fpage>
            <lpage>367</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btf878</pubid>
                  <pubid idtype="pmpid" link="fulltext">12584121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Identification, characterization and comparative genomics of chimpanzee endogenous retroviruses</p>
            </title>
            <aug>
               <au>
                  <snm>Polavarapu</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bowen</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>McDonald</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>R51</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1779541</pubid>
                  <pubid idtype="pmpid" link="fulltext">16805923</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Long terminal repeat retrotransposons of Mus musculus</p>
            </title>
            <aug>
               <au>
                  <snm>McCarthy</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>McDonald</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>R14</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">395764</pubid>
                  <pubid idtype="pmpid" link="fulltext">15003117</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-3-r14</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Long terminal repeat retrotransposons of Oryza sativa</p>
            </title>
            <aug>
               <au>
                  <snm>McCarthy</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lizhi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>McDonald</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>RESEARCH0053</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">134482</pubid>
                  <pubid idtype="pmpid" link="fulltext">12372141</pubid>
                  <pubid idtype="doi">10.1186/gb-2002-3-10-research0053</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>The human genome browser at UCSC</p>
            </title>
            <aug>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Sugnet</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Furey</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Roskin</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Pringle</snm>
                  <fnm>TH</fnm>
               </au>
               <au>
                  <snm>Zahler</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>996</fpage>
            <lpage>1006</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">186604</pubid>
                  <pubid idtype="pmpid" link="fulltext">12045153</pubid>
                  <pubid idtype="doi">10.1101/gr.229102. Article published online before print in May 2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
                  <pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Origin and evolution of retroelements based upon their reverse transcriptase sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Xiong</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Eickbush</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>1990</pubdate>
            <volume>9</volume>
            <fpage>3353</fpage>
            <lpage>3362</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">552073</pubid>
                  <pubid idtype="pmpid">1698615</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tamura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Brief Bioinform</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>150</fpage>
            <lpage>163</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bib/5.2.150</pubid>
                  <pubid idtype="pmpid" link="fulltext">15260895</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>PHYML Online &#8211; a web server for fast maximum likelihood-based phylogenetic inference</p>
            </title>
            <aug>
               <au>
                  <snm>Guindon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lethiec</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Duroux</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gascuel</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>W557</fpage>
            <lpage>559</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1160113</pubid>
                  <pubid idtype="pmpid" link="fulltext">15980534</pubid>
                  <pubid idtype="doi">10.1093/nar/gki352</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative</p>
            </title>
            <aug>
               <au>
                  <snm>Anisimova</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gascuel</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Syst Biol</source>
            <pubdate>2006</pubdate>
            <volume>55</volume>
            <fpage>539</fpage>
            <lpage>552</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150600755453</pubid>
                  <pubid idtype="pmpid" link="fulltext">16785212</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Repbase Update, a database of eukaryotic repetitive elements</p>
            </title>
            <aug>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kapitonov</snm>
                  <fnm>VV</fnm>
               </au>
               <au>
                  <snm>Pavlicek</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Klonowski</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kohany</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Walichiewicz</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Cytogenet Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>110</volume>
            <fpage>462</fpage>
            <lpage>467</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1159/000084979</pubid>
                  <pubid idtype="pmpid" link="fulltext">16093699</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The repetitive landscape of the chicken genome</p>
            </title>
            <aug>
               <au>
                  <snm>Wicker</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Robertson</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Schulze</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Feltus</snm>
                  <fnm>FA</fnm>
               </au>
               <au>
                  <snm>Magrini</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Morrison</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Mardis</snm>
                  <fnm>ER</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Paterson</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Ivarie</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>126</fpage>
            <lpage>136</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540276</pubid>
                  <pubid idtype="pmpid" link="fulltext">15256510</pubid>
                  <pubid idtype="doi">10.1101/gr.2438005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>The steps of reverse transcription of Drosophila mobile dispersed genetic elements and U3-R-U5 structure of their LTRs</p>
            </title>
            <aug>
               <au>
                  <snm>Arkhipova</snm>
                  <fnm>IR</fnm>
               </au>
               <au>
                  <snm>Mazo</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Cherkasova</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Gorelova</snm>
                  <fnm>TV</fnm>
               </au>
               <au>
                  <snm>Schuppe</snm>
                  <fnm>NG</fnm>
               </au>
               <au>
                  <snm>Llyin</snm>
                  <fnm>YV</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1986</pubdate>
            <volume>44</volume>
            <fpage>555</fpage>
            <lpage>563</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0092-8674(86)90265-5</pubid>
                  <pubid idtype="pmpid">2418981</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Nested retrotransposons in the intergenic regions of the maize genome</p>
            </title>
            <aug>
               <au>
                  <snm>SanMiguel</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Tikhonov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jin</snm>
                  <fnm>YK</fnm>
               </au>
               <au>
                  <snm>Motchoulskaia</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Zakharov</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Melake-Berhan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Springer</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Edwards</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Avramova</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Bennetzen</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1996</pubdate>
            <volume>274</volume>
            <fpage>765</fpage>
            <lpage>768</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.274.5288.765</pubid>
                  <pubid idtype="pmpid" link="fulltext">8864112</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Continental breakup and the ordinal diversification of birds and mammals</p>
            </title>
            <aug>
               <au>
                  <snm>Hedges</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Parker</snm>
                  <fnm>PH</fnm>
               </au>
               <au>
                  <snm>Sibley</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1996</pubdate>
            <volume>381</volume>
            <fpage>226</fpage>
            <lpage>229</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/381226a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">8622763</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Combined evidence annotation of transposable elements in genome sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Quesneville</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bergman</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Andrieu</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Autard</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nouaud</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Anxolabehere</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>PLoS Comput Biol</source>
            <pubdate>2005</pubdate>
            <volume>1</volume>
            <fpage>166</fpage>
            <lpage>175</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1185648</pubid>
                  <pubid idtype="pmpid" link="fulltext">16110336</pubid>
                  <pubid idtype="doi">10.1371/journal.pcbi.0010022</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
