Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Aug;10(8):2020-32.
doi: 10.1038/ismej.2015.249. Epub 2016 Feb 9.

High-resolution phylogenetic microbial community profiling

Affiliations

High-resolution phylogenetic microbial community profiling

Esther Singer et al. ISME J. 2016 Aug.

Abstract

Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structures at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake's water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Workflow of the PhyloTag sequence generation and cluster analysis pipeline with results from one of the five mock community replicate data sets (more details in Supplementary Figure 1). A simulated FL 16S rRNA gene read data set generated from the 23 genomes of the selected bacterial species was used to optimize the clustering steps in the pipeline (see Materials and methods section). Detailed processing steps for iTag and shotgun sequences are illustrated in Supplementary Figure 1.
Figure 2
Figure 2
Analysis of the mock community profiles. (a) Abundance profiles of the mock community as represented by PhyloTags (pooled from all five replicates), PacBio shotgun sequences and V4 iTags. Nocardiopsis dassonvillei, which was added at very low relative abundance, was exclusively detected in the PacBio shotgun data set. Additional contaminant OTUs were found only in the V4 iTags (Supplementary Table 3). (b) Spearman's rank correlation coefficients and corresponding P-values were calculated to evaluate the strength of relationships between various sequence data sets. (c) Principal coordinate analysis (PCoA) of microbial community structures at various Sakinaw Lake depths according to PhyloTags and iTags. Mean PCoA distances between iTag and PhyloTag pairs of the same depth are stated within parentheses in the legend. The inset shows depths 50 –120 m reanalyzed for higher resolution.
Figure 3
Figure 3
Conceptual representation of the 16S rRNA gene sequence with conserved (green) and hypervariable (blue) regions. Pink strips represent the abundance of mutations in respective variable regions. Target selection in amplicon sequencing determines community fingerprints. (a) The 16S rRNA gene variability is not homogeneous across taxonomic groups and subregions of the FL sequence. For example, Salmonella spp. are 97.4% identical across the FL 16S rRNA gene sequence, but are 100% identical across the V4 region. (b) In other instances, exclusively considering the hypervariable V4 region may lead to an overestimation of community diversity because mutations may accumulate here more than across the entire 16S rRNA gene.
Figure 4
Figure 4
Community composition analysis of Sakinaw Lake depth profile at phylum and genus level represented by PhyloTag and in silico generated V4 sequences. (a) Percentage of FL PhyloTag sequences by phylum with ambiguous classifications according to their in silico generated V4 region and vice versa. Phyla with ambiguous sequences (V4: ⩾5.0% of their total; FL: ⩾1.0% of their total) are reported in this figure. Relative sequence abundance of phyla in the total community based on the number of sequences is stated above bars. (b) Community composition analysis of Sakinaw lake depth profile at genus level and arranged by phylum (with >1% relative abundance). Color pairs denote samples of the same depth represented by FL and V4 sequences. Bubble sizes indicate read abundance of individual genera. Several OTUs showing largest discrepancy between V4 and FL abundances are highlighted by boxes (solid gray: more FL>V4; dotted black: V4>FL). Numbered boxes around bordered bubbles represent genera Methylocaldum (1), uncultivated genus within the Nitrospiraceae (2), Bacillus (3) and Methylotenera (4). Biological importance of these selected genera is discussed in the text. Examples of other genera with >1000 more FL than V4 sequences and >200 more V4 than FL sequences are depicted by bordered bubbles and boxes. Ecological significance of these genera in Sakinaw Lake was difficult to predict, for example, owing to the lack of reference genomes.

Similar articles

Cited by

References

    1. Babauta JT, Atci E, Ha PT, Lindemann SR, Ewing T, Call DR et al. (2014). Localized electron transfer rates and microelectrode-based enrichment of microbial communities within a phototrophic microbial mat. Front Microbiol 5: 1–2. - PMC - PubMed
    1. Bolhuis H, Stal LJ. (2011). Analysis of bacterial and archaeal diversity in coastal microbial mats using massive parallel 16S rRNA gene tag sequencing. ISME J 5: 1701–1712. - PMC - PubMed
    1. Bowman JP, Sly LI, Nichols DS, Hayward AS. (1993). Revised taxonomy of the methanotrophs: description of Methylobacter gen. nov., emendation of Methylococcus, validation of Methylosinus and Methylocystis species, and a proposal that the family Methtylococcaceae includes only the group I methanotrophs. Int J Syst Bacteriol 43: 735–753.
    1. BBMap. Available at: http://bbmap.sourceforge.net/ (accessed October 2015).
    1. Caporaso JG, Lauber CL, Walters WA, Berg-Lyons D, Huntley J, Fierer N et al. (2012). Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms. ISME J 6: 1621–1624. - PMC - PubMed

Substances