<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1745-6150-4-29</ui>
   <ji>1745-6150</ji>
   <fm>
      <dochead>Hypothesis</dochead>
      <bibl>
         <title>
            <p>Prokaryotic homologs of Argonaute proteins are predicted to function as key components of a novel system of defense against mobile genetic elements</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Makarova</snm>
               <mi>S</mi>
               <fnm>Kira</fnm>
               <insr iid="I1"/>
               <email>makarova@ncbi.nlm.nih.gov</email>
            </au>
            <au id="A2">
               <snm>Wolf</snm>
               <mi>I</mi>
               <fnm>Yuri</fnm>
               <insr iid="I1"/>
               <email>wolf@ncbi.nlm.nih.gov</email>
            </au>
            <au id="A3">
               <snm>van der Oost</snm>
               <fnm>John</fnm>
               <insr iid="I2"/>
               <email>john.vanderoost@wur.nl</email>
            </au>
            <au ca="yes" id="A4">
               <snm>Koonin</snm>
               <mi>V</mi>
               <fnm>Eugene</fnm>
               <insr iid="I1"/>
               <email>koonin@ncbi.nlm.nih.gov</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>National Center for Biotechnology Information, NLM, National Institutes of Health, Bethesda, Maryland 20894, USA</p>
            </ins>
            <ins id="I2">
               <p>Laboratory of Microbiology, Department of Agrotechnology and Food Sciences, Wageningen University, Dreijenplein 10, 6703 HB Wageningen, Netherlands</p>
            </ins>
         </insg>
         <source>Biology Direct</source>
         <issn>1745-6150</issn>
         <pubdate>2009</pubdate>
         <volume>4</volume>
         <issue>1</issue>
         <fpage>29</fpage>
         <url>http://www.biology-direct.com/content/4/1/29</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/1745-6150-4-29</pubid>
               <pubid idtype="pmpid">19706170</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>21</day>
               <month>8</month>
               <year>2009</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>25</day>
               <month>8</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>25</day>
               <month>8</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Makarova et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>In eukaryotes, RNA interference (RNAi) is a major mechanism of defense against viruses and transposable elements as well of regulating translation of endogenous mRNAs. The RNAi systems recognize the target RNA molecules via small guide RNAs that are completely or partially complementary to a region of the target. Key components of the RNAi systems are proteins of the Argonaute-PIWI family some of which function as slicers, the nucleases that cleave the target RNA that is base-paired to a guide RNA. Numerous prokaryotes possess the CRISPR-associated system (CASS) of defense against phages and plasmids that is, in part, mechanistically analogous but not homologous to eukaryotic RNAi systems. Many prokaryotes also encode homologs of Argonaute-PIWI proteins but their functions remain unknown.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We present a detailed analysis of Argonaute-PIWI protein sequences and the genomic neighborhoods of the respective genes in prokaryotes. Whereas eukaryotic Ago/PIWI proteins always contain PAZ (oligonucleotide binding) and PIWI (active or inactivated nuclease) domains, the prokaryotic Argonaute homologs (pAgos) fall into two major groups in which the PAZ domain is either present or absent. The monophyly of each group is supported by a phylogenetic analysis of the conserved PIWI-domains. Almost all pAgos that lack a PAZ domain appear to be inactivated, and the respective genes are associated with a variety of predicted nucleases in putative operons. An additional, uncharacterized domain that is fused to various nucleases appears to be a unique signature of operons encoding the short (lacking PAZ) pAgo form. By contrast, almost all PAZ-domain containing pAgos are predicted to be active nucleases. Some proteins of this group (e.g., that from <it>Aquifex aeolicus</it>) have been experimentally shown to possess nuclease activity, and are not typically associated with genes for other (putative) nucleases. Given these observations, the apparent extensive horizontal transfer of pAgo genes, and their common, statistically significant over-representation in genomic neighborhoods enriched in genes encoding proteins involved in the defense against phages and/or plasmids, we hypothesize that pAgos are key components of a novel class of defense systems. The PAZ-domain containing pAgos are predicted to directly destroy virus or plasmid nucleic acids via their nuclease activity, whereas the apparently inactivated, PAZ-lacking pAgos could be structural subunits of protein complexes that contain, as active moieties, the putative nucleases that we predict to be co-expressed with these pAgos. All these nucleases are predicted to be DNA endonucleases, so it seems most probable that the putative novel phage/plasmid-defense system targets phage DNA rather than mRNAs. Given that in eukaryotic RNAi systems, the PAZ domain binds a guide RNA and positions it on the complementary region of the target, we further speculate that pAgos function on a similar principle (the guide being either DNA or RNA), and that the uncharacterized domain found in putative operons with the short forms of pAgos is a functional substitute for the PAZ domain.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The hypothesis that pAgos are key components of a novel prokaryotic immune system that employs guide RNA or DNA molecules to degrade nucleic acids of invading mobile elements implies a functional analogy with the prokaryotic CASS and a direct evolutionary connection with eukaryotic RNAi. The predictions of the hypothesis including both the activities of pAgos and those of the associated endonucleases are readily amenable to experimental tests.</p>
            </sec>
            <sec>
               <st>
                  <p>Reviewers</p>
               </st>
               <p>This article was reviewed by Daniel Haft, Martijn Huynen, and Chris Ponting.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification id="endnote" subtype="user_supplied_xml" type="bmc"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The discovery of elaborate and versatile systems of RNA-mediated gene silencing in eukaryotes is one of the pivotal advances in biology of the last decade <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. There are three major, distinct forms of regulatory small RNAs involved in eukaryotic gene silencing: small interfering (si) RNAs, micro (mi) RNAs, and PIWI-associated (pi) RNA (previously referred to as rasiRNA) <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The siRNAs are derived from double-stranded RNAs of viruses and transposable elements, which are processed by Dicer, one of the essential components of the RNA-Induced Silencing Complexes (RISCs) <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. Dicer cleaves long dsRNA molecules into short, 21&#8211;22 nucleotide duplexes which are subsequently unwound and the guide strand is loaded on another crucial component of RISC, the Argonaute (Ago) slicer nuclease. The Ago-siRNA complex then binds to the target mRNA which is cleaved by the PIWI domain of Argonaute (Ago), after which the mRNA fragments are released and the RISC-siRNA catalytic complex is recycled <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>.</p>
         <p>Variant, paralogous Dicers and Argonautes are involved in the mechanisms of the other classes of small RNA such as miRNA and piRNA <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Unlike the siRNAs, 21&#8211;25 nt-long miRNAs are encoded in eukaryotic genomes and are either perfectly (in plants) or imperfectly (in animals) complementary to sequences in the 3'-untranslated regions of specific endogenous mRNAs <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Base-pairing of miRNAs with the target mRNAs, which is mediated by a distinct form of RISC, results either in RNA cleavage or in down-regulation of translation without cleavage <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Evidence is rapidly accumulating that numerous of miRNAs in animals and plants are major players in development regulation and chromatin remodeling <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
         <p>Dicer and Argonaute are the core components of RISCs. Dicer is a multi-domain protein that typically consists of a DEXD/H-type helicase domain fused with an RNA-binding PAZ domain, two RNAse III domains, and in some cases a dsRNA-binding domain <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The Argonaute protein is composed of four domains including the PAZ RNA-binding domain and the PIWI family exonuclease, and performs the slicer function <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. Both Dicer and Argonaute are represented by variable numbers of paralogs in eukaryotes, and different paralogs are included in RISCs with distinct functions <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>.</p>
         <p>Prokaryotes possess apparent functional counterparts to the miRNA system, that is, regulation of bacterial gene expression by small antisense RNAs. The best characterized of these pathways employ the RNA-binding protein Hfq for small RNA presentation and RNAse E for target degradation <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. <it>Escherichia coli </it>appears to encode ~60 microRNA genes <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>, and comparable numbers of expressed, small antisense RNAs have been detected in the archaea <it>Archaeoglobus fulgidus </it><abbrgrp><abbr bid="B20">20</abbr></abbrgrp> and <it>Sulfolobus solfataricus </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp> suggesting an important role of this regulatory mechanism in prokaryotic physiology. In addition, small antisense RNAs have been shown to regulate plasmid replication and to kill plasmid-free bacterial cells by silencing specific plasmid genes <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
         <p>The recently discovered major prokaryotic phage/plasmid defense system, the CRISPR associated system (CASS) [<abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>, Waters, 2009 #566], also relies on guide RNA that apparently targets invader DNA <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. The hallmark of the CASS is that this system encompasses a still poorly understood mechanism for integrating fragments of bacteriophage DNA into a specific site within the CRISPR repeat cassette; at least in part, integration of these fragments is probably mediated by the Cas1 proteins that has been predicted <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B25">25</abbr></abbrgrp> and more recently experimentally demonstrated to possess DNAse activity <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. The unique, phage/plasmid-specific CRISPR inserts are then transcribed and processed to guide RNAs that are directed to the target DNA by the Cascade complex which (in <it>Escherichia coli </it>K12) consists of 5 Cas proteins and seems to a be a functional analog of the RISC <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Despite general functional analogies, the molecular mechanisms of CASS and eukaryotic RNAi are distinct, and the protein components of the two systems are not homologous <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B28">28</abbr></abbrgrp>.</p>
         <p>Many archaea and bacteria do encode homologs of the major protein components of eukaryotic RNAi, in particular, Argonaute-PIWI family proteins, and the helicase and RNAse III domains of Dicer although the fusion of these domains in a single protein appears to be a eukaryotic signature <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. The crystal structures of Argonaute homologs from two thermophilic bacteria <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp> and two archaea <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp> have been solved, and the structures appear to be very similar to those of eukaryotic Argonautes <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. However the functions of the prokaryotic Argonaute homologs (hereinafter pAgo) remain obscure, despite the <it>in vitro </it>demonstration of the RNAse H-like ribonuclease activity (cleavage of RNA in a DNA/RNA duplex) of the pAgos from the bacteria <it>Aquifex aeolicus </it><abbrgrp><abbr bid="B35">35</abbr></abbrgrp> and <it>Thermus thermophilus </it><abbrgrp><abbr bid="B36">36</abbr></abbrgrp>.</p>
         <p>Here, we apply comparative genomics and in-depth computational analysis of Argonaute-PIWI family proteins and other proteins that are typically encoded in their genomic neighborhoods to predict the biological functions of pAgo. We present a hypothesis that the prokaryotic Argonautes are key components of a novel class of virus/plasmid defense systems.</p>
      </sec>
      <sec>
         <st>
            <p>Results and Discussion</p>
         </st>
         <sec>
            <st>
               <p>Prokaryotic Argonaute homologs belong to two major groups based on the presence or absence of the PAZ domain</p>
            </st>
            <p>To identify all prokaryotic Argonaute homologs, we performed a PSI-BLAST search against the NCBI non-redundant protein sequence database using the PIWI domain (the most highly conserved domain in the Argonaute family proteins) sequence from the <it>Thermus thermophilus </it>HB27 pAgo (TT_P0026, pdb: <ext-link ext-link-id="3DLB" ext-link-type="pdb">3DLB</ext-link> containing; PIWI domain sequences in amino acid positions 415&#8211;685). The search was run until convergence (after the 3<sup>rd </sup>iteration) and resulted in the identification of 100 sequences, some of which were fragmented or truncated proteins; additional searches started with some of the detected proteins showed that this sequence set represents the full complement of PIWI-domain proteins (pAgo) encoded in currently available prokaryotic genomes. For more detailed analysis, we selected 85 sequences from 80 genomes (the genomes of the bacteria <it>Parvularcula bermudensis </it>HTCC2503 and <it>Halorubrum lacusprofundi </it>ATCC 49239 encode three pAgo proteins each, and the genome of <it>Acidobacterium capsulatum </it>ATCC 51196 encodes two pAgos) (see Additional File <supplr sid="S1">1</supplr>).</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>The list of all identified PIWI domain containing proteins and their closest neighborhood</b>. The data provided represent list of all identified PIWI domain containing proteins that were further analyzed in this work.</p>
               </text>
               <file name="1745-6150-4-29-S1.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>Comparative sequence analysis of the identified pAgos showed that the conserved, alignable region shared by all these sequences approximately corresponded to the L2, Mid and PIWI domains, as inferred from the crystal structures of the pAgos from the hyperthermophilic bacterium <it>Aquifex aeolicus </it>(AaAgo; pdb: <ext-link ext-link-id="1YVU" ext-link-type="pdb">1YVU</ext-link><abbrgrp><abbr bid="B35">35</abbr></abbrgrp>), <it>Thermus thermophilus </it>(TtAgo; pdb <ext-link ext-link-id="3DLB" ext-link-type="pdb">3DLB</ext-link><abbrgrp><abbr bid="B31">31</abbr><abbr bid="B36">36</abbr></abbrgrp>), as well as the archaea <it>Pyrococcus furiosus </it>(PfAgo; pdb <ext-link ext-link-id="1Z25" ext-link-type="pdb">1Z25</ext-link><abbrgrp><abbr bid="B33">33</abbr></abbrgrp>) and <it>Archaeoglobus fulgidus </it>(AfAgo; pdb: <ext-link ext-link-id="1W9H" ext-link-type="pdb">1W9H</ext-link><abbrgrp><abbr bid="B37">37</abbr></abbrgrp>) (Figure <figr fid="F1">1</figr>; see also Additional File <supplr sid="S2">2</supplr>). In addition to the three conserved domains, both pAgos whose structures have been solved contain an N-terminal domain, an L1 domain, and a PAZ domain that, as in eukaryotic Argonaute, binds the 3' end of a siRNA guide and positions the middle of siRNA guide bound to the target mRNA in the catalytic pocket of the PIWI nuclease <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. However, among the identified pAgos, more than half lack the N-terminal, L1 and PAZ domains although several instead contain an N-terminal fusion with predicted nucleases of the Sir2 family (Figure <figr fid="F1">1</figr> and see details below).</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p><b>Multiple alignment for full length PIWI domain containing proteins</b>. The provided alignment shows distinct group of PIWI proteins.</p>
               </text>
               <file name="1745-6150-4-29-S2.ali">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Domain architecture variation in homologs of Argonaute from prokaryotes (pAgos) and eukaryotes (Ago)</p>
               </caption>
               <text>
                  <p><b>Domain architecture variation in homologs of Argonaute from prokaryotes (pAgos) and eukaryotes (Ago)</b>. Structural domains (N-term, L1, PAZ, L2, Mid, PIWI) are projected from the tertiary structure of AaAgo (pdb: <ext-link ext-link-type="pdb" ext-link-id="1YVU">1YVU</ext-link><abbrgrp><abbr bid="B35">35</abbr></abbrgrp>). Red bars show the inactivated catalytic sites of PIWI domain. Sir2, predicted Sir2 family nuclease domain. APAZ, a domain identified in this work that is associated with pAgos. The domains are shown roughly to scale.</p>
               </text>
               <graphic file="1745-6150-4-29-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>PIWI domain is inactivated in numerous pAgos</p>
            </st>
            <p>The PIWI domain of Argonaute proteins belongs to the RNAse H fold and shares the divalent cation-binding motif DDE (aspartate, aspartate, glutamate) involved in catalysis with many other nucleases that cleave both RNA and DNA <url>http://scop.mrc-lmb.cam.ac.uk/scop/data/scop.b.d.hh.html</url><abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. The two aspartates are essential for the slicer activity of eukaryotic Argonautes whereas the third catalytic residue can be glutamate, histidine, aspartate or lysine <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Another conserved feature of Argonautes is the presence of a basic residue (in most instances, arginine) that is located in the catalytic site <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Some eukaryotic Argonaute proteins appear to be inactive (hence denoted non-slicer Argonautes), especially, in nematodes <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Apparently, non-slicer Argonautes interfere with translation through binding rather than cleavage of mRNA <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Examination of the multiple alignment of the catalytic cores of prokaryotic PIWI domains strongly suggests that the majority of these domains are inactivated as indicated by the replacement of two or all three acidic residues required for catalysis; this apparent abrogation of the nuclease activity is particularly common in those pAgo proteins that lack the PAZ domain (Figure <figr fid="F2">2</figr>).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Prokaryotic PIWI-domains: predicted active nucleases and apparently inactivated forms</p>
               </caption>
               <text>
                  <p><b>Prokaryotic PIWI-domains: predicted active nucleases and apparently inactivated forms</b>. The multiple sequence alignment includes the core motifs of PIWI domains encompassing the amino acid residues that comprise the (D/E)-(D/E)XK active site. The sequences are denoted by their GI numbers and species names. The positions of the first and the last residues of the aligned region in the corresponding protein are indicated for each sequence. The numbers within the alignment represent poorly conserved inserts that are not shown. The catalytic residues of the D-RD-EXK active site are shown in reverse shading and shown underneath the secondary structure, which corresponds to the solved structure for Pf-Ago (PDB: <ext-link ext-link-type="pdb" ext-link-id="1Z25">1Z25</ext-link>); 'H' indicates &#945;-helix, 'E' indicates extended conformation (&#946;-strand). Sequence identifiers for pAgos that are not associated with other proteins in putative operons are highlighted in bold. The coloring is based on the consensus shown underneath the alignment; 'h' indicates hydrophobic residues (WFYMLIVACTH), 'p' indicates polar residues (EDKRNQHTS), 's' indicates small residues (ACDGNPSTV).</p>
               </text>
               <graphic file="1745-6150-4-29-2"/>
            </fig>
            <p>The AfAgo protein, which does not contain a PAZ domain, also lacks the catalytic aspartates but has been shown to bind dsRNA <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B40">40</abbr></abbrgrp>. Structural analysis of AfAgo complexed with a siRNA-like duplex showed that in this protein a Cd<sup>2+ </sup>ion bound to the carboxy-terminal carboxylate and several amino acid residues in the middle (MID) domain are involved in the recognition of the unpaired 5' nucleotide of siRNA <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B40">40</abbr></abbrgrp>. In contrast, a structural and biochemical study of AaAgo, which contains the PAZ domain and the conserved catalytic residues, showed that this protein is an active RNAse H with a preference for a DNA/RNA hybrid as a substrate, suggesting that some pAgos employ small guide DNA molecules to cleave mRNA <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. The detailed study of the <it>Thermus thermophilus </it>pAgo corroborated the findings on AaAgo by revealing the details of interactions with the 5'-phosphorylated 21-base DNA guide strand and the DNA-guided RNA cleavage by this protein <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B36">36</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analysis of the Argonaute family suggests extensive horizontal gene transfer in prokaryotes</p>
            </st>
            <p>We constructed a phylogenetic tree of the PIWI domains from all the detected pAgos (after excluding sequences that were fragmented or truncated due to poor annotation) and a subset of eukaryotic Argonautes (Figure <figr fid="F3">3</figr>). The majority of the PIWI domains from pAgos that lack a PAZ domain form a distinct clade although a few of these short forms cluster within the other clade that consists mostly of full-size, PAZ-containing pAgos. Within the latter clade, the short proteins do not form a distinct group (Figure <figr fid="F3">3</figr>), suggesting the N-terminal part of pAgo was lost independently in several lineages. Consistent with the similarity of domain architectures and with the results of previous analyses <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, eukaryotic Argonautes belong to a well-supported clade together with a distinct subset of archaeal pAgos; in particular the structurally characterized <it>Pyrococcus furiosus </it>protein, that is considered to be the model for Argonaute functioning in eukaryotes <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Other archaeal proteins are scattered in the tree, suggesting multiple horizontal gene transfers (HGT) between bacteria and archaea (Figure <figr fid="F3">3</figr>). Despite the existence of several small lineage-specific groups (alpha proteobacteria, gamma proteobacteria, bacteroides and cyanobacteria), the results of our phylogenetic analysis strongly suggest that pAgo genes mostly disseminated by HGT; the patchy distribution of these genes makes it unlikely that they perform indispensible functions in any bacteria or archaea (Figure <figr fid="F3">3</figr>).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Phylogenetic analysis of PIWI-domains and organization of the predicted pAgo operons</p>
               </caption>
               <text>
                  <p><b>Phylogenetic analysis of PIWI-domains and organization of the predicted pAgo operons</b>. The ML tree is rooted between the (predominantly) PAZ-domain-containing and PAZ-domain- lacking branches. The RELL bootstrap values are indicated (%) for selected major branches. Color code: gray, Eukaryota; orange, Archaea; blue, Proteobacteria, green, Firmicutes; black, other lineages of bacteria. Each organism is denoted by the full systematic name and the Gene Identifier (GI) number. The PDB ID is indicated for those sequences for which tertiary structure is solved. Sequences of short PIWI proteins (that have lost N-terminal part including PAZ domain) but belong to the branch that consists mostly of full size sequences are indicated by "#" symbol. For those PIWI-domain proteins that are associated with genes encoding a nuclease domain, the domain architectures of the pAgo-associated proteins are shown.</p>
               </text>
               <graphic file="1745-6150-4-29-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>The pAgos are contextually linked to at least three distinct families of predicted nucleases</p>
            </st>
            <p>We further examined the genomic context of the pAgo genes; analysis of genomic context has been established as a powerful approach for prediction of the biological functions of prokaryotic genes using the "guilt by association" principle <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. In many cases, these genes form potential operons with a variety of genes encoding uncharacterized proteins (neighbor genes were predicted to be encoded in a potential operon with pAgos if they were located upstream or downstream of the respective pAgo gene on the same DNA strand and if the intergenic distances in such an array of co-directional genes were shorter than 100 nt; see Additional File <supplr sid="S1">1</supplr>). We performed an in-depth analysis of the sequences of the proteins encoded in the genes co-localized with pAgos using PSI-BLAST, HHpred and CDD search (see Methods). This analysis resulted in the identification of four protein families that are predicted to be co-expressed and thus functionally linked with the pAgos.</p>
            <p>The first family is typified by the xccb100_3097 protein from <it>Xanthomonas campestris </it>B100, the only protein among the pAgo neighbors that, in the current sequence databases, is annotated as a "putative Sir2-family regulator" rather than a "hypothetical protein". Indeed, CDD search detected statistically significant similarity between the N-terminal domain of this protein and the SIR2 domain (cl00195, E-value = 5 &#215; 10<sup>-5</sup>). The Sir2 proteins, also known as sirtuins, are a well characterized family of NAD<sup>+</sup>-dependent histone deacetylases in eukaryotes where they play key roles in the regulation of gene silencing, DNA repair, metabolic enzymes, and life span <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>. Representatives of this family also have been identified in both bacteria and archaea, and the structures of several Sir2 family proteins have been solved <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>. So far all experimentally characterized Sir2 family proteins have been shown to possess protein deacetylase activity <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. However, a distinct family of prokaryotic sirtuins is associated with DNA-pumping ATPases of the FtsK-HerA family <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. Because in numerous other instances the FtsK-like ATPases are associated with known nucleases, both functionally and in terms of the operon structure, it was hypothesized that this particular family of sirtuins could function as nucleases, and a conserved DxH motif was implicated in the predicted nuclease activity <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. The majority of the xccb100_3097-like proteins contain only one of these residues, namely, the aspartate in the loop between strand 7 and helix 11 (according to the crystal structure of human Sirt2 histone deacetylase, pdb: <ext-link ext-link-id="1j8f" ext-link-type="pdb">1j8f</ext-link><abbrgrp><abbr bid="B51">51</abbr></abbrgrp>) but instead have an additional aspartate in the strand 2 that is conserved within this family(Figure <figr fid="F4">4A</figr>). Similarly to Sir2 proteins associated with the FtsK-like ATPases, xccb100_3097-like proteins lack the Zn-ribbon insert between strand 4 and helix 10 that is characteristic of most sirtuins, but retain all NAD<sup>+</sup>-binding site residues, suggesting that these proteins are active enzymes (Figure <figr fid="F4">4A</figr>).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Multiple alignment of predicted nuclease domains found in the genomic neighborhoods of pAgo genes</p>
               </caption>
               <text>
                  <p><b>Multiple alignment of predicted nuclease domains found in the genomic neighborhoods of pAgo genes</b>. A. Predicted nucleases of the Sir2 family. Numbering of the secondary structure elements corresponds that those reported for PDB: <ext-link ext-link-type="pdb" ext-link-id="1j8f">1j8f</ext-link><abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. B. (D/E)-(D/E)XK family nucleases. The designations are as in Figure 1. Additional coloring is 'o', hydroxyl-group containing residues (ST); '@', aromatic residues (YWF).</p>
               </text>
               <graphic file="1745-6150-4-29-4"/>
            </fig>
            <p>For the C-terminal domain of xccb100_3097, we failed to detect any statistically significant similarities to known domains using CDD search or HHpred. However, PSI-BLAST search with the xccb100_3097 used as a query revealed many homologs with similar domain architectures, all of which are associated with pAgos in putative operons; moreover, several multidomain proteins (eg. GIs: 91783256, 218130589, 229435559) comprise fusions of xccb100_3097-like and PIWI domains (see the alignment of this domain in Additional File <supplr sid="S3">3</supplr>).</p>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p><b>Multiple alignment of uncharacterized C-terminal domain of proteins also containing N-terminal nuclease domain and associated with PIWI proteins</b>. The provided alignment shows the previously undetected domain associated with PIWI proteins.</p>
               </text>
               <file name="1745-6150-4-29-S3.ali">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The second family of PIWI-associated proteins is typified by the mlr6203 (GI: 13475182) protein from <it>Mesorhizobium loti</it>. The HHpred search convincingly shows that the N-terminal domain of these proteins belongs to the Mrr family of restriction endonucleases, with the hallmark (D/E)-(D/E)XK active site <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp> (for example, the best hit is to pdb: <ext-link ext-link-id="2ost" ext-link-type="pdb">2ost</ext-link>, homing endonuclease from <it>Synechocystis sp</it>., E-value = 0.04; followed by a hit to pfam04471, Restriction endonuclease, E-value = 0.04). All experimentally characterized superfamily representatives are site-specific endonucleases that cleave dsDNA and possess an enormous variety of recognition sites <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr></abbrgrp>. The active site residues are conserved in all mlr6203 homologs (Figure <figr fid="F4">4B</figr>), so this domain probably is an active DNA endonuclease. As with the xccb100_3097 family proteins, no similarity to the C-terminal domain of the mlr6203 was detected in CDD and HHpred searches. However, the PSI-BLAST search identified 17 homologous proteins with the same domain architecture and predicted operon organization (see Additional File <supplr sid="S1">1</supplr>).</p>
            <p>A typical representative of the third family is RHECIAT_PB0000019 (GI: 190894000) from <it>Rhizobium etli</it>. This protein contains an N-terminal TIR domain that was easily detected by HHpred (the best hit is to pdb: <ext-link ext-link-id="2js7" ext-link-type="pdb">2js7</ext-link>, TIR domain of myeloid differentiation primary response protein MYD88 from human, E-value of 1.1 &#215; 10<sup>-30</sup>). The TIR domain mediates protein-protein interactions and belongs to the STIR superfamily that includes mostly eukaryotic proteins involved in diverse signaling pathways as well as a variety of poorly characterized multidomain proteins from bacteria and archaea with large genomes (that also have been implicated in transcription regulation and signaling <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp>). Notably, TIR domains play important roles in disease and stress resistance in plants <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. Similarly, in mammals, TIR-domains are key components of the immune system-based antimicrobial and antiviral response, and the programmed cell death (PCD) system <abbrgrp><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr></abbrgrp>. Analysis of domain architectures led to the hypothesis that prokaryotic TIR-domain proteins also could be involved in PCD <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. All closely related homologs of the RHECIAT_PB0000019 protein contain the TIR domain (see Additional File <supplr sid="S3">3</supplr>), whereas several proteins in this family (e.g. GI: 162145848) also contain an additional N-terminal domain that belongs to the PD-(D/E)XK nuclease superfamily (a vast assemblage of nucleases that includes, among others, the restriction endonucleases) with all catalytic residues typically conserved (Figure <figr fid="F4">4B</figr>). The C-terminal domain of these proteins is not similar to any known domain, but does show a weak sequence similarity (with statistical significance difficult to demonstrate) to the C-terminal domain of the mlr6203-like family. Considering similar sizes of the corresponding domains in both families and, most importantly, the genomic association with predicted nucleases and pAgos, we strongly suspect that these domains are homologous; examination of their multiple alignment indeed shows several distinct, conserved motifs (see Additional File <supplr sid="S3">3</supplr>). The predicted secondary structure indicates that this is a globular domain, however, the pattern of amino acid residue conservation does not seem to suggest an enzymatic function. Given that the proteins containing this domain are found exclusively in the same neighborhoods with pAgos that lack the PAZ domain, it is tempting to speculate that this uncharacterized domain is functionally analogous to the PAZ domain, that is, involved in binding a guide nucleic acid molecule (hereinafter we refer to this domain as APAZ, after Analog of PAZ).</p>
            <p>The fourth family of pAgo-associated proteins is linked to full-size, PAZ-domain-containing Argonaute homologs and can be typified by the protein PTH_0722 (GI: 147677057) from <it>Pelotomaculum thermopropionicum</it>. This protein contains a C-terminal domain that belongs to the PD-(D/E)XK nuclease superfamily (HHPred detects similarity to SfsA: Sugar fermentation stimulation protein, which contains a PD-(D/E)XK nuclease domain, with E-value = 0.022) and contains all the catalytic residues (Figure <figr fid="F4">4B</figr>); this putative nuclease is clearly distinct from and only very distantly related to the restriction endonuclease domain of the mlr6203-like family proteins. The N-terminal domain of this protein does not show similarity to any characterized domains, has a predicted predominantly &#945;-helical structure and is present only in close homologs of PTH_0722 (see Additional File <supplr sid="S4">4</supplr>). In the GobsU_24486 protein of <it>Gemmata obscuriglobus</it>, the nuclease domain is replaced by the apparently functionally unrelated SEFIR domain of the STIR superfamily, that is only distantly related to the TIR domain, but is also involved in various signaling pathways <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>.</p>
            <suppl id="S4">
               <title>
                  <p>Additional file 4</p>
               </title>
               <text>
                  <p><b>Multiple alignment of uncharacterized N-terminal domain of proteins also containing C-terminal nuclease domain and associated with PIWI proteins</b>. The provided alignment shows the previously undetected domain associated with PIWI proteins.</p>
               </text>
               <file name="1745-6150-4-29-S4.ali">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>Several other genomic neighbors of pAgos are worth mentioning (Figure <figr fid="F3">3</figr>). Two genes that encode PAZ-domain-containing but, apparently, inactivated pAgos (in the bacteria <it>Pedobacter heparinus </it>and <it>Spirosoma lingual</it>) are associated with predicted Sir2 family nucleases (Figure <figr fid="F4">4A</figr>). Furthermore, three long forms of pAgos (one inactivated, in the bacterium <it>Dehalococcoides </it>sp, and two apparently active ones in <it>Microcystis aeruginosa </it>and <it>Clostridium bartletti</it>) are associated with PD-(D/E)XK nucleases of a distinct subfamily related to Cas4 (COG1468), which is mostly represented within CASS <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Most conspicuously, as noticed previously, in the archaeon <it>Methanopyrus kandleri</it>, the pAgo is encoded within an operon that otherwise encodes components of the CASS <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>.</p>
            <p>A potentially important pattern revealed by this analysis of the genomic context of prokaryotic PIWI-domain proteins is that, almost without exception, pAgos with an apparently inactivated catalytic PIWI domain are associated with a predicted nuclease in a putative operon (Figures <figr fid="F2">2</figr>, <figr fid="F3">3</figr> and see Additional File <supplr sid="S1">1</supplr>). This observation suggests the possibility of functional complementarity between the nuclease activity of PIWI domains of pAgos and other nucleases, in particular, homologs of restriction endonucleases (see discussion below).</p>
         </sec>
         <sec>
            <st>
               <p>Statistical analysis of the genomic neighborhoods of pAgos reveals a significant link to phage resistance systems</p>
            </st>
            <p>Considering (i) the central role of Argonaute proteins in siRNA-based antiviral response in eukaryotes, (ii) the contextual links between pAgos and nucleases (in particular, restriction endonucleases) that are involved in phage/plasmid defense in prokaryotes, and (iii) links to the TIR domain that also functions in antimicrobial response in eukaryotes, it is tempting to hypothesize that an important if not the principal function of the pAgos has to do with phage defense (or, more generally, defense against viruses, plasmids, and other mobile elements). Phage defense systems in prokaryotes are notably prone to HGT (the CASS being the prime showcase), and phylogenetic analysis of the pAgos clearly indicates that HGT shapes the evolution of pAgo-encoding genes as well (Figure <figr fid="F3">3</figr>). In addition, phage defense systems are often encoded in genomic islands <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. Therefore we sought to statistically test the hypothesis that pAgo genes are non-randomly associated with known phage resistance genes in prokaryotic genomes. To this end, we identified 4 classes of phage defense systems (some of which are also involved in a broader range of stress response reactions) in a representative set of 45 prokaryotic genomes and computed the fractions of these genes throughout the genomes and in the vicinity of pAgo genes (see Methods for details). The Fisher Omnibus test <abbrgrp><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp> reveals a statistically highly significant enrichment of the pAgo genomic neighborhoods (see Methods for details) for different combinations of 4 classes of phage defense genes used as a target set (Table <tblr tid="T1">1</tblr>). As a control, we performed the same analysis for pAgo genes and typical components of the bacterial mobilome including transposases and various phage-derived genes; no statistically significant association was found between pAgos and these mobile genes (p = 0.63; see Additional Files <supplr sid="S5">5</supplr> and <supplr sid="S6">6</supplr>).</p>
            <suppl id="S5">
               <title>
                  <p>Additional file 5</p>
               </title>
               <text>
                  <p><b>The list of all COGs implicated in antiphage defense</b>. The data provided represent list of phage defense COGs of four distinct systems used for the Fisher Omnibus test.</p>
               </text>
               <file name="1745-6150-4-29-S5.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S6">
               <title>
                  <p>Additional file 6</p>
               </title>
               <text>
                  <p><b>The data used for the Fisher Omnibus test</b>. The file contains data and calculations for the Fisher Omnibus test. Each worksheet corresponds to the analysis of a distinct set of phage defense COGs (see also AF3_Ph_def_COGs.xls). On the left hand side are calculations for the whole set of genome. On the right hand side, highlighted in yellow, calculations for a representative set of genomes (closely related genomes were excluded).</p>
               </text>
               <file name="1745-6150-4-29-S6.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Results of the Fisher Omnibus test for the genomic association of pAgo genes with four classes of phage defense/stress response systems</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="center">
                        <p>RM</p>
                     </c>
                     <c ca="center">
                        <p>ABI</p>
                     </c>
                     <c ca="center">
                        <p>CASS</p>
                     </c>
                     <c ca="center">
                        <p>TA</p>
                     </c>
                     <c ca="center">
                        <p>Combined p-value</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>5.1 &#215; 10<sup>-7</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>2.9 &#215; 10<sup>-13</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>5.8 &#215; 10<sup>-10</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>4.6 &#215; 10<sup>-16</sup></p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>RM, Restriction-modification related COGs; ABI, abortive infection related COGs; CASS, CASS-associated systems; TA, toxin-antitoxin systems related COGs. The phage defense systems that were included in the target genes combination in each of the 4 analyses with the Fisher Omnibus test are shown by "+" (for instance, the first row shows the results of statistical analysis for RM and ABI systems).</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Hypothesis: pAgo is a key component of a novel prokaryotic immune system in which it functions either as a nuclease or as a structural subunit of nuclease complexes that utilizes guide RNAs or DNAs to degrade virus/plasmid genomes</p>
            </st>
            <p>Several convergent lines of evidence point to defense against invading mobile elements as the primary function of pAgos. (1). The analogy to eukaryotic Argonautes many of which are dedicated to the defense against viruses and transposable elements. (2). The guide-DNA-dependent nuclease activity of AaAgo and TtAgo. (3). Extensive HGT of pAgos which is best compatible with a stress-response related function. (4). Preferential location of pAgo genes in genomic neighborhoods significantly enriched in known phage-defense genes. (5). Co-localization of PIWI-domain protein genes with genes encoding other (predicted) nucleases. (6). The near perfect complementarity between the predicted nuclease and guide-binding activities of pAgos and co-localization with other putative nucleases: the inactivated pAgos that lack the PAZ domain are associated with genes encoding predicted nucleases whereas the apparently active, PAZ-containing pAgos are not (Figure <figr fid="F3">3</figr>). The latter observation suggests that pAgos function within nuclease complexes, in some cases as their catalytic subunits, and in other cases, as structural subunits interacting with the actual nucleases.</p>
            <p>Additional functional clues allow us to tentatively propose more specific mechanisms for the functions of pAgos in the defense of prokaryotes against mobile elements (Figure <figr fid="F5">5</figr>). In eukaryotic Argonautes, the PAZ domain binds the small guide RNA and facilitates its hybridization with the complementary region of the target mRNA. Most of the pAgos that are predicted to be active nucleases also contain PAZ domains suggesting that they function via a similar mechanism, in agreement with the experimental data for AaAgo and TtAgo <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B36">36</abbr><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp>. The apparently inactivated pAgos lack PAZ domains but are co-localized with genes encoding predicted nucleases and the APAZ domain (Figure <figr fid="F1">1</figr>, <figr fid="F2">2</figr>). The (so far) exclusive presence of the APAZ domain within predicted operons encoding inactivated pAgos makes us speculate that, similary to PAZ domains, the APAZ domains bind guide molecules and target the putative nuclease complex to phage nucleic acids.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Possible mechanisms of the hypothetical novel prokaryotic systems of defense against mobile elements centered around pAgo compared to the mechanisms of CASS and eukaryotic RNAi</p>
               </caption>
               <text>
                  <p><b>Possible mechanisms of the hypothetical novel prokaryotic systems of defense against mobile elements centered around pAgo compared to the mechanisms of CASS and eukaryotic RNAi</b>. Currently, models (3) and/or (4) are the most likely functional mechanisms for pAgo (see text) but the eukaryotic Ago-like (1) and the prokaryotic CASS-like (2) models cannot be ruled out at this stage. RNA molecules are shown in red and DNA molecules in blue. Circles denote the proteins that form complexes with the guide RNA or DNA. Arrows indicate the directions of the respective processes.</p>
               </text>
               <graphic file="1745-6150-4-29-5"/>
            </fig>
            <p>The PD-(D/E)XK superfamily nucleases, to which the predicted nucleases associated with the majority of pAgos are homologous, so far have been shown to cleave exclusively dsDNA. Thus, it seems most likely that the predicted pAgo-based defense systems directly target invader dsDNA genomes rather than mRNAs (Figure <figr fid="F5">5</figr>). On the other hand, as stated above, in vitro analyses have revealed that AaAgo and TtAgo are most active as DNA-guided ribonuclease, suggesting that RNA may be a target as well [REFS <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>]. The guide molecule could be either a small RNA (with the implication that the respective nuclease cleaves a RNA-DNA hybrid) or a small DNA as suggested by the study of AaAgo <abbrgrp><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp> and TtAgo <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B36">36</abbr></abbrgrp>.</p>
            <p>The proposed model for the pAgo-based phage defense shows functional analogies to both CASS and the eukaryotic RNAi (Figure <figr fid="F5">5</figr>). Given the phylogenetic affinity of a distinct family of apparently active archaeal pAgos and eukaryotic Argonautes (Figure <figr fid="F3">3</figr>), this hypothetical defense system is the probable evolutionary progenitor of the eukaryotic RNAi. The spread of RNA viruses in eukaryotes that was accompanied by the displacement of the majority of DNA viruses <abbrgrp><abbr bid="B65">65</abbr></abbrgrp> could have been the driving force behind the switch of the specificity of this defense system from DNA to RNA.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The functions of the pAgos to some extent have been characterized <it>in vitro </it>(Yuan 2005)<abbrgrp><abbr bid="B31">31</abbr><abbr bid="B36">36</abbr></abbrgrp> but remain to be determined <it>in vivo</it>. The convergence of several lines of evidence discussed here seems to strongly support the hypothesis that pAgos are key components of a novel class of immune system that employ guide DNA or RNA molecules to destroy virus and plasmid DNA or mRNA). These proposed mechanisms of action suggest functional parallels between the predicted pAgo-based defense systems and CASS, and a direct evolutionary link between the former and eukaryotic RNAi. The predictions of the hypothesis, in particular, the nuclease activity catalyzed by PAZ-domain-containing but not by PAZ-domain-lacking pAgos, the complementary activities of associated putative nucleases, and guide DNA or RNA binding by the APAZ domains are amenable to straightforward experimental validation.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Sequence analysis</p>
            </st>
            <p>All analyzed sequences were from the non-redundant protein sequence database at the NCBI. Database searches were performed using PSI-BLAST <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>, typically, with the inclusion threshold E = 0.01, and no composition-based statistics or low complexity filtering, or the HH search program available through the HHpred server <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>. Multiple alignments of protein sequences were constructed by combining the results obtained with the PROMALS program <abbrgrp><abbr bid="B68">68</abbr></abbrgrp> and the MUSCLE program <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>, followed by a minimal manual correction on the basis of local alignments obtained using PSI-BLAST <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. Protein secondary structure was predicted using the PSIPRED program <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>.</p>
            <p>Maximum likelihood (ML) phylogenetic trees were constructed from the alignment of PIWI domain region (only positions with less than 30% gaps were used for reconstruction &#8211; 258 altogether), by using the MOLPHY program <abbrgrp><abbr bid="B71">71</abbr></abbrgrp> with the JTT substitution matrix to perform local rearrangement of an original Fitch tree <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>. The MOLPHY program was also used to compute RELL bootstrap values.</p>
         </sec>
         <sec>
            <st>
               <p>Fisher Omnibus test</p>
            </st>
            <p>Only 45 completely sequenced genomes were used for this analysis; the complete genome information was obtained from FTP of RefSeq database (<url>ftp://ftp.ncbi.nih.gov/genomes/Bacteria/</url>;<abbrgrp><abbr bid="B73">73</abbr></abbrgrp>). Proteins in these genomes were assigned to COGs using a modified COGNITOR program <abbrgrp><abbr bid="B74">74</abbr></abbrgrp>. The target sets of phages defense proteins were obtained from the following sources: restriction-modification (RM) systems related protein from REBASE <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>; abortive infection (ABI) related genes from the Chopin et al. review <abbrgrp><abbr bid="B76">76</abbr></abbrgrp>; CRISPR systems related genes from <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and toxin-antitoxin related genes from <abbrgrp><abbr bid="B77">77</abbr></abbrgrp>. Proteins of the RM and ABI systems were assigned to COG as indicated above, and for other systems, COG numbers have been already reported in the aforementioned papers (see the complete list of these COGs in Additional File <supplr sid="S5">5</supplr>).</p>
            <p>In each genome, we identified the genes that belong to each of the aforementioned four well-characterized phage defense systems and computed the gene counts for each system in the entire genome (<it>K </it>phage defense genes in a genome containing <it>N </it>genes) as well as within each of windows of size &#177; <it>w </it>= 10 surrounding each pAgo gene (<it>k </it>genes in window). For each window, the probability to observe &#8805;<it>k </it>phage defense genes by chance was approximated using the binomial distribution:</p>
            <p>
               <display-formula>
                  <graphic file="1745-6150-4-29-i1.gif"/>
               </display-formula>
            </p>
            <p>The results obtained for multiple windows were combined using the Bailey and Gribskov's variant of the Fisher Omnibus test <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>.</p>
            <p>
               <display-formula>
                  <graphic file="1745-6150-4-29-i2.gif"/>
               </display-formula>
            </p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The authors declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>KSM and JVDO initiated the study; KSM performed sequence analysis and genome comparison; YIW devised and performed the statistical tests; KSM, JVDO and EVK interpreted the results and formulated the hypothesis; KSM and YIW wrote the first draft of the manuscript; EVK and JVDO wrote the final manuscript that was read and approved by all authors.</p>
      </sec>
      <sec>
         <st>
            <p>Reviewers' comments</p>
         </st>
         <sec>
            <st>
               <p>Reviewer 1</p>
            </st>
            <sec>
               <st>
                  <p>Daniel Haft, The J. Craig Venter Institute</p>
               </st>
               <p>Draft Public Comments</p>
               <p>"Emerging evidence about prokaryotic homologs of Argonaute (pAgo) makes it clear that these proteins are related to their eukaryotic counterparts not just in sequence and structure, but also in molecular function. They might be related as well in terms of biological process, perhaps with many or most serving a primary function of phage resistance rather than of host gene transcriptional regulation. The case made in this manuscript, as argued by the interpretation of protein domain architecture, is highly suggestive. However, the statistical test for genomic association of pAgo with other phage resistance systems is currently unconvincing in the absence of a negative control. Other possible roles for pAgos seem equally consistent with available data."</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>a negative control, namely, a test of the possible association of pAgos with mobile genes that are not involved in phage defense is included in the revised manuscript </it>(see Additional File <supplr sid="S5">5</supplr>). <it>As the result of this test was indeed negative, we find the statistical evidence as convincing as it can be although the final proof, of course, can only be experimental</it>.</p>
               <p>"One alternate possibility is that most pAgos serve as machinery for boutique host regulatory systems. Anti-sense RNA expression in bacteria has been underappreciated; its prevalence likely is still underestimated. Some antisense RNA is cis-acting, through a mechanism of transcriptional interference, but some is trans-acting, through mechanisms of dsRNA formation. Since the trans-acting antisense RNAs themselves have won only a limiting understanding, it stands to reason that mechanisms acting downstream of dsRNA formation also are incompletely understood. A role for many pAgo proteins in the control of host gene expression seems quite likely."</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>The possibility that some pAgos are also involved in regulation of bacterial genes is certainly interesting and not implausible. However, the data presented in this paper suggest to us that the functions in defense against mobile elements are primary</it>.</p>
               <p>"A second possibility for these systems, supported by their apparent high degree of lateral transfer, is that most are selfish genetic elements. By analogy to transposons, homing endonucleases encoded within inteins, and temperate phage, these systems may carry out nuclease reactions simply to mediate their own spread. Some incidental benefit to host genomes is possible; any endogenous nuclease, it may be assumed, has some potential to cleave phage DNA or RNA, as in the example of ribonuclease HIII vs. RNA phage. But that level of phage resistance capability could be regarded as secondary."</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>All prokaryotic defense and stress response systems are to a large extent selfish as discussed in detail for restriction-modification and toxin-atitoxin systems. We strongly suspect that this is indeed the case for the putative pAgo-centered system as well</it>.</p>
               <p>"The extreme selective pressures of phage/host warfare make it quite likely that the proposed role for pAgos in phage resistance in prokaryotes is at least occasionally true. The greater question is whether pAgos proteins represent a new, major player in prokaryotic resistance to phage attack, and whether most pAgos proteins have host defense as a primary role. This is a mirror to the question of whether CRISPR arrays might be co-opted to serve perform regulatory functions, given their extreme plasticity and their transcription into small RNAs &#8211; one might examine repeat arrays in after phage-free serial passage of selected strains under extreme selection."</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>Cooperation of pAgo with the CRISPR system cannot be ruled out but appears unlikely. Of the 780 bacterial and archaeal genomes that we analyzed for the presence of CRISPR and pAgo, 291 encoded CRISPR and 51 encoded pAgo, with the overlap of only 28 genomes. Of course, the localization of the pAgo gene within the Cas gene array in Methanopyrus kandleri is suggestive but so far this remains the only genome that shows such an association</it>.</p>
               <p>"Restriction enzyme systems, especially restriction/modification systems, discriminate self vs. non-self by recognizing short sequence signatures in phage that are either masked or missing in the host. CRISPR systems discriminate self from non-self by capture and expression of samples of exogenous DNA. Both abortive infection systems and toxin-antitoxin systems have the potential to shut down the host cell, in response to stress from phage infection, in order to block the phage life cycle. Each of these schemes provides a clear model of how defense mechanisms are triggered. The trickiest part of the model for pAgos in phage defense concerns the source of guide DNA or RNA. Is it DNA encoded on the host chomosome? Will it have a promoter and a terminator? It seems at least theoretically possible that CRISPR arrays themselves might be a source. If a typical CRISPR system targets phage DNA according to exact matches to spacer sequences, one might postulate a backup system in which the same small RNAs, with some tolerance for mismatches, silence phage mRNA. It therefore makes sense to ask &#8211; what fraction of pAgos-containing genomes have CRISPR systems, and is the prevalence significantly higher for any subgroup of pAgos?"</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>It is indeed true that we do not have any inkling of the source of the putative guide DNA or RNA that is employed by pAgo. The idea that pAgo might share the guide molecules with CRISPR is very interesting. The problem is that, as indicated above, there is no clear sign of cooperation between pAgo and CRISPR, and what is most damning for this provocative idea, is that the majority of the genomes that encode pAgo possess no CRISPR</it>.</p>
               <p><it>We attempted to search for sequence conservation and repetitive elements in the upstream and downstream regions of pAgo operons but failed to find anything suggestive. When more closely related genomes encoding pAgo become available, it will be necessary to repeat this attempt</it>.</p>
               <p>A reasonable view of genome organization is that some regions of a genome are more plastic than others. The more plastic regions would be expected to accumulate prophages, transposons, integrated plasmids, conjugation regions, pseudogenes, and "fitness factors" such as CASS, antibiotic resistance genes, virulence genes, and capsular polysaccharide genes, all in close proximity. In this view, genes encoding restriction systems and CRISPR systems likely would occur close to each other because both the region tolerates insertion, not because both system mediate host defense. The statistical argument, therefore, does not currently allow one to discriminate phage defense from other possible functions for these systems. If the statistical association with RM and CASS is not replicated by associations with secretion systems, pilus proteins, integrases and recombinases, plasmid partition proteins, capsular polysaccharide biosynthesis genes, etc, then it may become somewhat more convincing.</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>We appreciate this suggestion and sought to test the hypothesis that co-localization of pAgo genes with those for other systems of antiphage defence is a trivial consequence of the occurrence of all these genes in highly plastic regions of prokaryotic genomes. To this end, we examined the potential association of pAgo genes with typical components of the mobilome such as transposases, integrases, and various genes of apparent phage origin. As indicated in the revised text of the article and presented in detail in the </it>Additional Files <supplr sid="S5">5</supplr> and <supplr sid="S6">6</supplr>, <it>there was no significant association between pAgo and the elements of the mobilome. Thus we believe that the most parsimonious interpretation of the data is that there are indeed phage defence islands in prokaryotic genomes and pAgo genes show a strong association with these islands</it>.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Reviewer 2</p>
            </st>
            <sec>
               <st>
                  <p>Martijn Huynen, Radboud University, Nijmegen Medical Centre</p>
               </st>
               <p>The manuscript by Makarova and co-workers provides a compelling argument for the functional link between Bacterial and Archaeal Argonaute proteins and proteins that are involved in defense against "foreign" DNA.</p>
               <p>I only have a few comments:</p>
               <p>Studies on the value of the genomic association of genes for the prediction of functional links between proteins have gone to a great length to actually benchmark at which level of genomic association it not only becomes statistically significant, but also functionally meaningful in terms of predicting that proteins are actually involved in the same pathway. I cannot judge the level of "functional relevance" of the P-values provided in table <tblr tid="T1">1</tblr>.</p>
               <p>Along the same lines: can the authors give simple numbers of how often the four protein families were discovered in the vicinity of the 100 pAgos genes.</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>This information is now available in the new </it>Additional File <supplr sid="S6">6</supplr><it>for the set of 45 genomes that were analyzed using the Fisher Omnibus test</it>.</p>
               <p>I take it that all genomes that were included in the significance study were phylogenetically distant enough to assure that gene order conservation was not trivial?</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>No, we did this analysis for all available genomes, since even in some closely related genomes the location of the pAgo operons is different. In response to these concerns, we have redone the analysis for distantly related genomes only. The results have not substantially change; actually, even more significant p-values were obtained </it>(see the new Additional File <supplr sid="S6">6</supplr>).</p>
               <p>"This analysis resulted" I cannot find how this analysis was done, Fisher Ombnibus test mentioned in the methods does not require genes to be part of the same potential operon, and "predicted to be co-expressed" can thus not be concluded from it.</p>
            </sec>
            <sec>
               <st>
                  <p>Authors' response</p>
               </st>
               <p><it>In the revised manuscript, the criteria for calling potential operons are given explicitly</it>.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Reviewer 3</p>
            </st>
            <sec>
               <st>
                  <p>Chris Ponting, Oxford University</p>
               </st>
               <p>Makarova et al. have undertaken a thorough and illuminating analysis of prokaryotic Argonaute homologs. Their analysis consists first of detailed sequence analysis of PIWI domain homologs followed by investigation of putative operons. The manuscript ends with a nice demonstration that pAgo genomic regions are significantly enriched for phage defense genes. This allows them to pose an important and testable hypothesis which provides the major contribution of this paper. The manuscript is well written and its analyses are sound.</p>
            </sec>
         </sec>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>KSM, YIW and EVK are supported by intramural funds of the DHHS (National Library of Medicine, National Institutes of Health)</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>RNAi: an ever-growing puzzle</p>
            </title>
            <aug>
               <au>
                  <snm>Denli</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Hannon</snm>
                  <fnm>GJ</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>2003</pubdate>
            <volume>28</volume>
            <issue>4</issue>
            <fpage>196</fpage>
            <lpage>201</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0968-0004(03)00058-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">12713903</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>RNA interference</p>
            </title>
            <aug>
               <au>
                  <snm>Hannon</snm>
                  <fnm>GJ</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>418</volume>
            <issue>6894</issue>
            <fpage>244</fpage>
            <lpage>251</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/418244a</pubid>
                  <pubid idtype="pmpid" link="fulltext">12110901</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Ribo-gnome: the big world of small RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Zamore</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Haley</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>309</volume>
            <issue>5740</issue>
            <fpage>1519</fpage>
            <lpage>1524</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1111444</pubid>
                  <pubid idtype="pmpid" link="fulltext">16141061</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>On the road to reading the RNA-interference code</p>
            </title>
            <aug>
               <au>
                  <snm>Siomi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Siomi</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2009</pubdate>
            <volume>457</volume>
            <issue>7228</issue>
            <fpage>396</fpage>
            <lpage>404</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature07754</pubid>
                  <pubid idtype="pmpid" link="fulltext">19158785</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Small silencing RNAs: an expanding universe</p>
            </title>
            <aug>
               <au>
                  <snm>Ghildiyal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zamore</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2009</pubdate>
            <volume>10</volume>
            <issue>2</issue>
            <fpage>94</fpage>
            <lpage>108</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg2504</pubid>
                  <pubid idtype="pmcid">2724769</pubid>
                  <pubid idtype="pmpid" link="fulltext">19148191</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Small RNAs in transcriptional gene silencing and genome defence</p>
            </title>
            <aug>
               <au>
                  <snm>Moazed</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2009</pubdate>
            <volume>457</volume>
            <issue>7228</issue>
            <fpage>413</fpage>
            <lpage>420</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature07756</pubid>
                  <pubid idtype="pmpid" link="fulltext">19158787</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>RNAi: the nuts and bolts of the RISC machine</p>
            </title>
            <aug>
               <au>
                  <snm>Filipowicz</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2005</pubdate>
            <volume>122</volume>
            <issue>1</issue>
            <fpage>17</fpage>
            <lpage>20</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2005.06.023</pubid>
                  <pubid idtype="pmpid" link="fulltext">16009129</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>siRNA and miRNA: an insight into RISCs</p>
            </title>
            <aug>
               <au>
                  <snm>Tang</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>2005</pubdate>
            <volume>30</volume>
            <issue>2</issue>
            <fpage>106</fpage>
            <lpage>114</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tibs.2004.12.007</pubid>
                  <pubid idtype="pmpid" link="fulltext">15691656</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Assembly and function of RNA silencing complexes</p>
            </title>
            <aug>
               <au>
                  <snm>Sontheimer</snm>
                  <fnm>EJ</fnm>
               </au>
            </aug>
            <source>Nat Rev Mol Cell Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>2</issue>
            <fpage>127</fpage>
            <lpage>138</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrm1568</pubid>
                  <pubid idtype="pmpid" link="fulltext">15654322</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The role of RNAi and microRNAs in animal virus replication and antiviral immunity</p>
            </title>
            <aug>
               <au>
                  <snm>Umbach</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Cullen</snm>
                  <fnm>BR</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2009</pubdate>
            <volume>23</volume>
            <issue>10</issue>
            <fpage>1151</fpage>
            <lpage>1164</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gad.1793309</pubid>
                  <pubid idtype="pmpid" link="fulltext">19451215</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Viral and cellular messenger RNA targets of viral microRNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Cullen</snm>
                  <fnm>BR</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2009</pubdate>
            <volume>457</volume>
            <issue>7228</issue>
            <fpage>421</fpage>
            <lpage>425</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature07757</pubid>
                  <pubid idtype="pmpid" link="fulltext">19158788</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Origins and Mechanisms of miRNAs and siRNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Carthew</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Sontheimer</snm>
                  <fnm>EJ</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2009</pubdate>
            <volume>136</volume>
            <issue>4</issue>
            <fpage>642</fpage>
            <lpage>655</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2009.01.035</pubid>
                  <pubid idtype="pmcid">2675692</pubid>
                  <pubid idtype="pmpid" link="fulltext">19239886</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Slicer function of Drosophila Argonautes and its involvement in RISC formation</p>
            </title>
            <aug>
               <au>
                  <snm>Miyoshi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Tsukumo</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nagami</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Siomi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Siomi</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2005</pubdate>
            <volume>19</volume>
            <issue>23</issue>
            <fpage>2837</fpage>
            <lpage>2848</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gad.1370605</pubid>
                  <pubid idtype="pmcid">1315391</pubid>
                  <pubid idtype="pmpid" link="fulltext">16287716</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>A three-dimensional view of the molecular machinery of RNA interference</p>
            </title>
            <aug>
               <au>
                  <snm>Jinek</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Doudna</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2009</pubdate>
            <volume>457</volume>
            <issue>7228</issue>
            <fpage>405</fpage>
            <lpage>412</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature07755</pubid>
                  <pubid idtype="pmpid" link="fulltext">19158786</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Micros for microbes: non-coding regulatory RNAs in bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Gottesman</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>7</issue>
            <fpage>399</fpage>
            <lpage>404</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2005.05.008</pubid>
                  <pubid idtype="pmpid" link="fulltext">15913835</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Bacterial small RNA regulators</p>
            </title>
            <aug>
               <au>
                  <snm>Majdalani</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Vanderpool</snm>
                  <fnm>CK</fnm>
               </au>
               <au>
                  <snm>Gottesman</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Crit Rev Biochem Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>40</volume>
            <issue>2</issue>
            <fpage>93</fpage>
            <lpage>113</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10409230590918702</pubid>
                  <pubid idtype="pmpid" link="fulltext">15814430</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Regulatory RNAs in bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Waters</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Storz</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2009</pubdate>
            <volume>136</volume>
            <issue>4</issue>
            <fpage>615</fpage>
            <lpage>628</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2009.01.043</pubid>
                  <pubid idtype="pmpid" link="fulltext">19239884</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Global analysis of small RNA and mRNA targets of Hfq</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wassarman</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Rosenow</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tjaden</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Storz</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gottesman</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>50</volume>
            <issue>4</issue>
            <fpage>1111</fpage>
            <lpage>1124</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.2003.03734.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">14622403</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Deep sequencing analysis of small noncoding RNA and mRNA targets of the global post-transcriptional regulator, Hfq</p>
            </title>
            <aug>
               <au>
                  <snm>Sittka</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lucchini</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Papenfort</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sharma</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Rolle</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Binnewies</snm>
                  <fnm>TT</fnm>
               </au>
               <au>
                  <snm>Hinton</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Vogel</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2008</pubdate>
            <volume>4</volume>
            <issue>8</issue>
            <fpage>e1000163</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pgen.1000163</pubid>
                  <pubid idtype="pmcid">2515195</pubid>
                  <pubid idtype="pmpid" link="fulltext">18725932</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Identification of 86 candidates for small non-messenger RNAs from the archaeon Archaeoglobus fulgidus</p>
            </title>
            <aug>
               <au>
                  <snm>Tang</snm>
                  <fnm>TH</fnm>
               </au>
               <au>
                  <snm>Bachellerie</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Rozhdestvensky</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Bortolin</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Huber</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Drungowski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Elge</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Brosius</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Huttenhofer</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <issue>11</issue>
            <fpage>7536</fpage>
            <lpage>7541</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.112047299</pubid>
                  <pubid idtype="pmcid">124276</pubid>
                  <pubid idtype="pmpid" link="fulltext">12032318</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>RNA antitoxins</p>
            </title>
            <aug>
               <au>
                  <snm>Gerdes</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>EG</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>2007</pubdate>
            <volume>10</volume>
            <issue>2</issue>
            <fpage>117</fpage>
            <lpage>124</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.mib.2007.03.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">17376733</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>A putative RNA-interference-based immune system in prokaryotes: computational analysis of the predicted enzymatic machinery, functional analogies with eukaryotic RNAi, and hypothetical mechanisms of action</p>
            </title>
            <aug>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Grishin</snm>
                  <fnm>NV</fnm>
               </au>
               <au>
                  <snm>Shabalina</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Biol Direct</source>
            <pubdate>2006</pubdate>
            <volume>1</volume>
            <issue>1</issue>
            <fpage>7</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1745-6150-1-7</pubid>
                  <pubid idtype="pmcid">1462988</pubid>
                  <pubid idtype="pmpid" link="fulltext">16545108</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>CRISPR &#8211; a widespread system that provides acquired resistance against phages in bacteria and archaea</p>
            </title>
            <aug>
               <au>
                  <snm>Sorek</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kunin</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Hugenholtz</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nat Rev Microbiol</source>
            <pubdate>2008</pubdate>
            <volume>6</volume>
            <issue>3</issue>
            <fpage>181</fpage>
            <lpage>186</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrmicro1793</pubid>
                  <pubid idtype="pmpid" link="fulltext">18157154</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>CRISPR interference limits horizontal gene transfer in staphylococci by targeting DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Marraffini</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Sontheimer</snm>
                  <fnm>EJ</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2008</pubdate>
            <volume>322</volume>
            <issue>5909</issue>
            <fpage>1843</fpage>
            <lpage>1845</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1165771</pubid>
                  <pubid idtype="pmcid">2695655</pubid>
                  <pubid idtype="pmpid" link="fulltext">19095942</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>A DNA repair system specific for thermophilic Archaea and bacteria predicted by genomic context analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Grishin</snm>
                  <fnm>NV</fnm>
               </au>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <issue>2</issue>
            <fpage>482</fpage>
            <lpage>496</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/30.2.482</pubid>
                  <pubid idtype="pmcid">99818</pubid>
                  <pubid idtype="pmpid" link="fulltext">11788711</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Structural Basis for DNase Activity of a Conserved Protein Implicated in CRISPR-Mediated Genome Defense</p>
            </title>
            <aug>
               <au>
                  <snm>Wiedenheft</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jinek</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Coyle</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Doudna</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Structure</source>
            <pubdate>2009</pubdate>
            <volume>17</volume>
            <issue>6</issue>
            <fpage>904</fpage>
            <lpage>912</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.str.2009.03.019</pubid>
                  <pubid idtype="pmpid" link="fulltext">19523907</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Small CRISPR RNAs guide antiviral defense in prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Brouns</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Jore</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Lundgren</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Westra</snm>
                  <fnm>ER</fnm>
               </au>
               <au>
                  <snm>Slijkhuis</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Snijders</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Dickman</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Oost</snm>
                  <mnm>van der</mnm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2008</pubdate>
            <volume>321</volume>
            <issue>5891</issue>
            <fpage>960</fpage>
            <lpage>964</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1159689</pubid>
                  <pubid idtype="pmpid" link="fulltext">18703739</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>CRISPR-based adaptive and heritable immunity in prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Oost</snm>
                  <mnm>van der</mnm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jore</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Westra</snm>
                  <fnm>ER</fnm>
               </au>
               <au>
                  <snm>Lundgren</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brouns</snm>
                  <fnm>SJJ</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>2009</pubdate>
            <inpress/>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">19646880</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Origins and evolution of eukaryotic RNA interference</p>
            </title>
            <aug>
               <au>
                  <snm>Shabalina</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Trends Ecol Evol</source>
            <pubdate>2008</pubdate>
            <volume>23</volume>
            <issue>10</issue>
            <fpage>578</fpage>
            <lpage>587</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tree.2008.06.005</pubid>
                  <pubid idtype="pmcid">2695246</pubid>
                  <pubid idtype="pmpid" link="fulltext">18715673</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Structure of Aquifex aeolicus argonaute highlights conformational flexibility of the PAZ domain as a potential regulator of RNA-induced silencing complex function</p>
            </title>
            <aug>
               <au>
                  <snm>Rashid</snm>
                  <fnm>UJ</fnm>
               </au>
               <au>
                  <snm>Paterok</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Koglin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gohlke</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Piehler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2007</pubdate>
            <volume>282</volume>
            <issue>18</issue>
            <fpage>13824</fpage>
            <lpage>13832</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M608619200</pubid>
                  <pubid idtype="pmpid" link="fulltext">17130125</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Structure of the guide-strand-containing argonaute silencing complex</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Sheng</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Juranek</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tuschl</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Patel</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2008</pubdate>
            <volume>456</volume>
            <issue>7219</issue>
            <fpage>209</fpage>
            <lpage>213</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature07315</pubid>
                  <pubid idtype="pmpid" link="fulltext">18754009</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Structural basis for 5'-end-specific recognition of guide RNA by the A. fulgidus Piwi protein</p>
            </title>
            <aug>
               <au>
                  <snm>Ma</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Yuan</snm>
                  <fnm>YR</fnm>
               </au>
               <au>
                  <snm>Meister</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pei</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Tuschl</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Patel</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>434</volume>
            <issue>7033</issue>
            <fpage>666</fpage>
            <lpage>670</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03514</pubid>
                  <pubid idtype="pmpid" link="fulltext">15800629</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Crystal structure of Argonaute and its implications for RISC slicer activity</p>
            </title>
            <aug>
               <au>
                  <snm>Song</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Hannon</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Joshua-Tor</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>305</volume>
            <issue>5689</issue>
            <fpage>1434</fpage>
            <lpage>1437</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1102514</pubid>
                  <pubid idtype="pmpid" link="fulltext">15284453</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>The Argonautes</p>
            </title>
            <aug>
               <au>
                  <snm>Joshua-Tor</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Cold Spring Harb Symp Quant Biol</source>
            <pubdate>2006</pubdate>
            <volume>71</volume>
            <fpage>67</fpage>
            <lpage>72</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/sqb.2006.71.048</pubid>
                  <pubid idtype="pmpid" link="fulltext">17381282</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Crystal structure of A. aeolicus argonaute, a site-specific DNA-guided endoribonuclease, provides insights into RISC-mediated mRNA cleavage</p>
            </title>
            <aug>
               <au>
                  <snm>Yuan</snm>
                  <fnm>YR</fnm>
               </au>
               <au>
                  <snm>Pei</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Kuryavyi</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Zhadina</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Meister</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>HY</fnm>
               </au>
               <au>
                  <snm>Dauter</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Tuschl</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Patel</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2005</pubdate>
            <volume>19</volume>
            <issue>3</issue>
            <fpage>405</fpage>
            <lpage>419</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.molcel.2005.07.011</pubid>
                  <pubid idtype="pmpid" link="fulltext">16061186</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Structure of an argonaute silencing complex with a seed-containing guide DNA and target RNA duplex</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Juranek</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Sheng</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Tuschl</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Patel</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2008</pubdate>
            <volume>456</volume>
            <issue>7224</issue>
            <fpage>921</fpage>
            <lpage>926</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature07666</pubid>
                  <pubid idtype="pmpid" link="fulltext">19092929</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Crystal structure of a PIWI protein suggests mechanisms for siRNA recognition and slicer activity</p>
            </title>
            <aug>
               <au>
                  <snm>Parker</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Roe</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Barford</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>2004</pubdate>
            <volume>23</volume>
            <issue>24</issue>
            <fpage>4727</fpage>
            <lpage>4737</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.emboj.7600488</pubid>
                  <pubid idtype="pmcid">535097</pubid>
                  <pubid idtype="pmpid" link="fulltext">15565169</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Recombining the structures of HIV integrase, RuvC and RNase H</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Steitz</snm>
                  <fnm>TA</fnm>
               </au>
            </aug>
            <source>Structure</source>
            <pubdate>1995</pubdate>
            <volume>3</volume>
            <issue>2</issue>
            <fpage>131</fpage>
            <lpage>134</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0969-2126(01)00142-3</pubid>
                  <pubid idtype="pmpid">7735828</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Slicer and the argonautes</p>
            </title>
            <aug>
               <au>
                  <snm>Tolia</snm>
                  <fnm>NH</fnm>
               </au>
               <au>
                  <snm>Joshua-Tor</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nat Chem Biol</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <issue>1</issue>
            <fpage>36</fpage>
            <lpage>43</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nchembio848</pubid>
                  <pubid idtype="pmpid" link="fulltext">17173028</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Structural insights into mRNA recognition from a PIWI domain-siRNA guide complex</p>
            </title>
            <aug>
               <au>
                  <snm>Parker</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Roe</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Barford</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>434</volume>
            <issue>7033</issue>
            <fpage>663</fpage>
            <lpage>666</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03462</pubid>
                  <pubid idtype="pmpid" link="fulltext">15800628</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Guilt by association: contextual information in genome analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <issue>8</issue>
            <fpage>1074</fpage>
            <lpage>1077</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.10.8.1074</pubid>
                  <pubid idtype="pmpid" link="fulltext">10958625</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Who's your neighbor? New computational approaches for functional genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2000</pubdate>
            <volume>18</volume>
            <issue>6</issue>
            <fpage>609</fpage>
            <lpage>613</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/76443</pubid>
                  <pubid idtype="pmpid" link="fulltext">10835597</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Predicting protein function by genomic context: quantitative evaluation and qualitative inferences</p>
            </title>
            <aug>
               <au>
                  <snm>Huynen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lathe</snm>
                  <fnm>W</fnm>
                  <suf>3rd</suf>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <issue>8</issue>
            <fpage>1204</fpage>
            <lpage>1210</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.10.8.1204</pubid>
                  <pubid idtype="pmcid">310926</pubid>
                  <pubid idtype="pmpid" link="fulltext">10958638</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Sir2: an NAD-dependent histone deacetylase that connects chromatin silencing, metabolism, and aging</p>
            </title>
            <aug>
               <au>
                  <snm>Imai</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>FB</fnm>
               </au>
               <au>
                  <snm>Marciniak</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>McVey</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>PU</fnm>
               </au>
               <au>
                  <snm>Guarente</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Cold Spring Harb Symp Quant Biol</source>
            <pubdate>2000</pubdate>
            <volume>65</volume>
            <fpage>297</fpage>
            <lpage>302</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/sqb.2000.65.297</pubid>
                  <pubid idtype="pmpid">12760043</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Sirtuins: Sir2-related NAD-dependent protein deacetylases</p>
            </title>
            <aug>
               <au>
                  <snm>North</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Verdin</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>5</issue>
            <fpage>224</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2004-5-5-224</pubid>
                  <pubid idtype="pmcid">416462</pubid>
                  <pubid idtype="pmpid" link="fulltext">15128440</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Sirtuin 1, stem cells, aging, and stem cell aging</p>
            </title>
            <aug>
               <au>
                  <snm>Mantel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Broxmeyer</snm>
                  <fnm>HE</fnm>
               </au>
            </aug>
            <source>Curr Opin Hematol</source>
            <pubdate>2008</pubdate>
            <volume>15</volume>
            <issue>4</issue>
            <fpage>326</fpage>
            <lpage>331</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/MOH.0b013e3283043819</pubid>
                  <pubid idtype="pmcid">2653857</pubid>
                  <pubid idtype="pmpid" link="fulltext">18536570</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Conserved metabolic regulatory functions of sirtuins</p>
            </title>
            <aug>
               <au>
                  <snm>Schwer</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Verdin</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Cell Metab</source>
            <pubdate>2008</pubdate>
            <volume>7</volume>
            <issue>2</issue>
            <fpage>104</fpage>
            <lpage>112</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cmet.2007.11.006</pubid>
                  <pubid idtype="pmpid" link="fulltext">18249170</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>The structural basis of sirtuin substrate affinity</p>
            </title>
            <aug>
               <au>
                  <snm>Cosgrove</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Bever</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Avalos</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Muhammad</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Wolberger</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>2006</pubdate>
            <volume>45</volume>
            <issue>24</issue>
            <fpage>7511</fpage>
            <lpage>7521</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/bi0526332</pubid>
                  <pubid idtype="pmpid" link="fulltext">16768447</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Structure and substrate binding properties of cobB, a Sir2 homolog protein deacetylase from Escherichia coli</p>
            </title>
            <aug>
               <au>
                  <snm>Zhao</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Chai</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Marmorstein</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>337</volume>
            <issue>3</issue>
            <fpage>731</fpage>
            <lpage>741</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2004.01.060</pubid>
                  <pubid idtype="pmpid" link="fulltext">15019790</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Comparative genomics of the FtsK-HerA superfamily of pumping ATPases: implications for the origins of chromosome segregation, cell division and viral capsid packaging</p>
            </title>
            <aug>
               <au>
                  <snm>Iyer</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>17</issue>
            <fpage>5260</fpage>
            <lpage>5279</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkh828</pubid>
                  <pubid idtype="pmcid">521647</pubid>
                  <pubid idtype="pmpid" link="fulltext">15466593</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Structure of the histone deacetylase SIRT2</p>
            </title>
            <aug>
               <au>
                  <snm>Finnin</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Donigian</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Pavletich</snm>
                  <fnm>NP</fnm>
               </au>
            </aug>
            <source>Nat Struct Biol</source>
            <pubdate>2001</pubdate>
            <volume>8</volume>
            <issue>7</issue>
            <fpage>621</fpage>
            <lpage>625</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/89668</pubid>
                  <pubid idtype="pmpid" link="fulltext">11427894</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Identification of novel restriction endonuclease-like fold families among hypothetical proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Kinch</snm>
                  <fnm>LN</fnm>
               </au>
               <au>
                  <snm>Ginalski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Rychlewski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Grishin</snm>
                  <fnm>NV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>11</issue>
            <fpage>3598</fpage>
            <lpage>3605</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gki676</pubid>
                  <pubid idtype="pmcid">1157100</pubid>
                  <pubid idtype="pmpid" link="fulltext">15972856</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Realm of PD-(D/E)XK nuclease superfamily revisited: detection of novel families with modified transitive meta profile searches</p>
            </title>
            <aug>
               <au>
                  <snm>Knizewski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kinch</snm>
                  <fnm>LN</fnm>
               </au>
               <au>
                  <snm>Grishin</snm>
                  <fnm>NV</fnm>
               </au>
               <au>
                  <snm>Rychlewski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Ginalski</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>BMC Struct Biol</source>
            <pubdate>2007</pubdate>
            <volume>7</volume>
            <fpage>40</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1472-6807-7-40</pubid>
                  <pubid idtype="pmcid">1913061</pubid>
                  <pubid idtype="pmpid" link="fulltext">17584917</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Restriction endonucleases: classification, properties, and applications</p>
            </title>
            <aug>
               <au>
                  <snm>Williams</snm>
                  <fnm>RJ</fnm>
               </au>
            </aug>
            <source>Mol Biotechnol</source>
            <pubdate>2003</pubdate>
            <volume>23</volume>
            <issue>3</issue>
            <fpage>225</fpage>
            <lpage>243</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1385/MB:23:3:225</pubid>
                  <pubid idtype="pmpid">12665693</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>The domains of death: evolution of the apoptosis machinery</p>
            </title>
            <aug>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Dixit</snm>
                  <fnm>VM</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>1999</pubdate>
            <volume>24</volume>
            <issue>2</issue>
            <fpage>47</fpage>
            <lpage>53</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0968-0004(98)01341-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">10098397</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Origin and evolution of eukaryotic apoptosis: the bacterial connection</p>
            </title>
            <aug>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Cell Death Differ</source>
            <pubdate>2002</pubdate>
            <volume>9</volume>
            <issue>4</issue>
            <fpage>394</fpage>
            <lpage>404</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.cdd.4400991</pubid>
                  <pubid idtype="pmpid" link="fulltext">11965492</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>The STIR-domain superfamily in signal transduction, development and immunity</p>
            </title>
            <aug>
               <au>
                  <snm>Novatchkova</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Leibbrandt</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Werzowa</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Neubuser</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Eisenhaber</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>2003</pubdate>
            <volume>28</volume>
            <issue>5</issue>
            <fpage>226</fpage>
            <lpage>229</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0968-0004(03)00067-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">12765832</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Signalling of toll-like receptors</p>
            </title>
            <aug>
               <au>
                  <snm>Brikos</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>O'Neill</snm>
                  <fnm>LA</fnm>
               </au>
            </aug>
            <source>Handb Exp Pharmacol</source>
            <pubdate>2008</pubdate>
            <issue>183</issue>
            <fpage>21</fpage>
            <lpage>50</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">full_text</pubid>
                  <pubid idtype="pmpid" link="fulltext">18071653</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Building an immune system from nine domains</p>
            </title>
            <aug>
               <au>
                  <snm>Palsson-McDermott</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>O'Neill</snm>
                  <fnm>LA</fnm>
               </au>
            </aug>
            <source>Biochem Soc Trans</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <issue>Pt 6</issue>
            <fpage>1437</fpage>
            <lpage>1444</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1042/BST0351437</pubid>
                  <pubid idtype="pmpid" link="fulltext">18031241</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>The functions of plant TIR domains</p>
            </title>
            <aug>
               <au>
                  <snm>Burch-Smith</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Dinesh-Kumar</snm>
                  <fnm>SP</fnm>
               </au>
            </aug>
            <source>Sci STKE</source>
            <pubdate>2007</pubdate>
            <volume>2007</volume>
            <issue>401</issue>
            <fpage>pe46</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/stke.4012007pe46</pubid>
                  <pubid idtype="pmpid" link="fulltext">17726177</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>DNA-binding proteins and evolution of transcription regulation in the archaea</p>
            </title>
            <aug>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1999</pubdate>
            <volume>27</volume>
            <issue>23</issue>
            <fpage>4658</fpage>
            <lpage>4670</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/27.23.4658</pubid>
                  <pubid idtype="pmcid">148756</pubid>
                  <pubid idtype="pmpid" link="fulltext">10556324</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Gene acquisition at the insertion site for SCCmec, the genomic island conferring methicillin resistance in Staphylococcus aureus</p>
            </title>
            <aug>
               <au>
                  <snm>Noto</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Kreiswirth</snm>
                  <fnm>BN</fnm>
               </au>
               <au>
                  <snm>Monk</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Archer</snm>
                  <fnm>GL</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2008</pubdate>
            <volume>190</volume>
            <issue>4</issue>
            <fpage>1276</fpage>
            <lpage>1283</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1128/JB.01128-07</pubid>
                  <pubid idtype="pmcid">2238224</pubid>
                  <pubid idtype="pmpid" link="fulltext">18083809</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Combining evidence using p-values: application to sequence homology searches</p>
            </title>
            <aug>
               <au>
                  <snm>Bailey</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Gribskov</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <issue>1</issue>
            <fpage>48</fpage>
            <lpage>54</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/14.1.48</pubid>
                  <pubid idtype="pmpid" link="fulltext">9520501</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>New insights in the molecular biology and physiology of Streptococcus thermophilus revealed by comparative genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Hols</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hancy</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Fontaine</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Grossiord</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Prozzi</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Leblond-Bourget</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Decaris</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bolotin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Delorme</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Dusko Ehrlich</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>FEMS Microbiol Rev</source>
            <pubdate>2005</pubdate>
            <volume>29</volume>
            <issue>3</issue>
            <fpage>435</fpage>
            <lpage>463</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.femsre.2005.04.008</pubid>
                  <pubid idtype="pmpid">16125007</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>The ancient Virus World and evolution of cells</p>
            </title>
            <aug>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Senkevich</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Dolja</snm>
                  <fnm>VV</fnm>
               </au>
            </aug>
            <source>Biol Direct</source>
            <pubdate>2006</pubdate>
            <volume>1</volume>
            <fpage>29</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1745-6150-1-29</pubid>
                  <pubid idtype="pmcid">1594570</pubid>
                  <pubid idtype="pmpid" link="fulltext">16984643</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <issue>17</issue>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>The HHpred interactive server for protein homology detection and structure prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Soding</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Biegert</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lupas</snm>
                  <fnm>AN</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <issue>33 Web Server</issue>
            <fpage>W244</fpage>
            <lpage>248</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gki408</pubid>
                  <pubid idtype="pmcid">1160169</pubid>
                  <pubid idtype="pmpid" link="fulltext">15980461</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>PROMALS3D: a tool for multiple protein sequence and structure alignments</p>
            </title>
            <aug>
               <au>
                  <snm>Pei</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>BH</fnm>
               </au>
               <au>
                  <snm>Grishin</snm>
                  <fnm>NV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
            <volume>36</volume>
            <issue>7</issue>
            <fpage>2295</fpage>
            <lpage>2300</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkn072</pubid>
                  <pubid idtype="pmcid">2367709</pubid>
                  <pubid idtype="pmpid" link="fulltext">18287115</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>MUSCLE: multiple sequence alignment with high accuracy and high throughput</p>
            </title>
            <aug>
               <au>
                  <snm>Edgar</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>5</issue>
            <fpage>1792</fpage>
            <lpage>1797</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkh340</pubid>
                  <pubid idtype="pmcid">390337</pubid>
                  <pubid idtype="pmpid" link="fulltext">15034147</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>The PSIPRED protein structure prediction server</p>
            </title>
            <aug>
               <au>
                  <snm>McGuffin</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Bryson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>DT</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <issue>4</issue>
            <fpage>404</fpage>
            <lpage>405</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/16.4.404</pubid>
                  <pubid idtype="pmpid" link="fulltext">10869041</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>MOLPHY: Programs for molecular phylogenetics</p>
            </title>
            <aug>
               <au>
                  <snm>Adachi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Computer Science Monographs 27</source>
            <publisher>Tokyo: Institute of Statistical Mathematics</publisher>
            <pubdate>1992</pubdate>
         </bibl>
         <bibl id="B72">
            <title>
               <p>Inferring phylogenies from protein sequences by parsimony, distance, and likelihood methods</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Methods Enzymol</source>
            <pubdate>1996</pubdate>
            <volume>266</volume>
            <fpage>418</fpage>
            <lpage>427</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">full_text</pubid>
                  <pubid idtype="pmpid">8743697</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Pruitt</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Maglott</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <issue>35 Database</issue>
            <fpage>D61</fpage>
            <lpage>65</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkl842</pubid>
                  <pubid idtype="pmcid">1716718</pubid>
                  <pubid idtype="pmpid" link="fulltext">17130148</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea</p>
            </title>
            <aug>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Sorokin</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Novichkov</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Biol Direct</source>
            <pubdate>2007</pubdate>
            <volume>2</volume>
            <fpage>33</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1745-6150-2-33</pubid>
                  <pubid idtype="pmcid">2222616</pubid>
                  <pubid idtype="pmpid" link="fulltext">18042280</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B75">
            <title>
               <p>REBASE &#8211; enzymes and genes for DNA restriction and modification</p>
            </title>
            <aug>
               <au>
                  <snm>Roberts</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Vincze</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Posfai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Macelis</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <issue>35 Database</issue>
            <fpage>D269</fpage>
            <lpage>270</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkl891</pubid>
                  <pubid idtype="pmcid">1899104</pubid>
                  <pubid idtype="pmpid" link="fulltext">17202163</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B76">
            <title>
               <p>Phage abortive infection in lactococci: variations on a theme</p>
            </title>
            <aug>
               <au>
                  <snm>Chopin</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Chopin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bidnenko</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>8</volume>
            <issue>4</issue>
            <fpage>473</fpage>
            <lpage>479</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.mib.2005.06.006</pubid>
                  <pubid idtype="pmpid" link="fulltext">15979388</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B77">
            <title>
               <p>Comprehensive comparative-genomic analysis of Type 2 toxin-antitoxin systems and related mobile stress response systems in prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Biol Direct</source>
            <pubdate>2009</pubdate>
            <volume>4</volume>
            <issue>1</issue>
            <fpage>19</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1745-6150-4-29</pubid>
                  <pubid idtype="pmcid">2701414</pubid>
                  <pubid idtype="pmpid" link="fulltext">19493340</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

