Biology Direct

official impact factor 3.74

Open Access Highly Access Research

Massive comparative genomic analysis reveals convergent evolution of specialized bacteria

Vicky Merhej1, Manuela Royer-Carenzi2, Pierre Pontarotti2 and Didier Raoult1*

Author Affiliations

1 Unit for Research on Emergent and Tropical Infectious Diseases (URMITE), CNRS-IRD UMR 6236 IFR48, Faculty of Medicine, University of the Mediterranean, Marseilles, France

2 Evolutionary biology and modeling, LATP UMR CNRS 6632 FR 3098 IFR 48, University of Provence, Marseilles, France

For all author emails, please log on.

Biology Direct 2009, 4:13 doi:10.1186/1745-6150-4-13

Published: 10 April 2009

Additional files

Additional file 1:

Genome information data.

Format: PDF Size: 317KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 2:

Trends between genome size and gene count in different bacterial phyla. Columns correspond to the genome size (left axis), and the points correspond to the gene count (right axis). Red and blue colours correspond to host dependent (HD) and free-living (FL) bacteria, respectively. Other phyla: Aquifex, Thermotoga, Chlorobium, Dehalococcoides, Deinococci, Thermus, Fusobacteria, and planctomyces. Taxa are listed in Additional file 1.

Format: PDF Size: 18KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 3:

Schematic representation of strategy used to identify essential and lost COGs. FL corresponds to free-living and OI corresponds to obligate intracellular bacteria.

Format: PDF Size: 6KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 4:

Comparison of the sets of lost COGs. Venn diagram shows the number of shared and group-specific COGs lost in obligate intracellular bacteria and conserved in their close free-living relatives in alpha- and gamma-proteobacteria. Mycobacterium leprae lost only 63% of these COGs, however its genome content is rapidly degrading according to the high number of pseudogene [27].

Format: PDF Size: 3KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 5:

Distribution of the common loss proportion, simulated from 2000 re-samplings. The number of COGs lost by each phylum is fixed and equals the observed loss numbers. Red dashed lines represent the 2.5% and 97.5% quantiles and blue line indicates the observed common loss proportion. 100 COGs were lost in concert among the obligate intracellular bacteria over the 969 COGs that are lost by at least one of the studied bacteria, which give a proportion of 0.1032 (100/969). The number of common COGs lost was significantly more important than expected if the loss were random (Randomization test, n = 2000, p < 10-6).

Format: PDF Size: 15KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 6:

Probability of losing at least 100 COGs in common Curve describing the probability of losing at least 100 COGs in common with respect to the theoretical probability of random loss (p0). The red point corresponds to the probability-threshold under which the hypothesis of independent loss between phyla is rejected (with α = 5%).

Format: PDF Size: 11KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 7:

Mean number of genes per Mb encoding for intracellular trafficking. *p-values < 0.05 are shown in bold to indicate significant differences between mutualists and parasites (Wilcoxon rank sum test).

Format: DOC Size: 45KB Download file

This file can be viewed with: Microsoft Word Viewer

Open Data

Additional file 8:

COGs involved in transcription.

Format: DOC Size: 260KB Download file

This file can be viewed with: Microsoft Word Viewer

Open Data

Additional file 9:

COGs involved in intracellular trafficking and secretion.

Format: DOC Size: 139KB Download file

This file can be viewed with: Microsoft Word Viewer

Open Data

Additional file 10:

Distribution of COGs lost by small free-living bacteria among the set of 100 COGs lost in common by obligate intracellular bacteria and among all other COGs.

Format: PDF Size: 4KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 11:

Probability of losing at least 100 COGs in common among the 969 COGs lost at least by one obligate intracellular bacterium. Curve describing the probability of losing at least 100 COGs in common with respect to the theoretical probability of random loss (p0). The red point corresponds to the probability-threshold under which the hypothesis of independent loss between phyla is rejected (with α = 5%).

Format: PDF Size: 11KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data