166 http://www.weizhongli-lab.org/rssgen/?cat=166 Analyses of the Stability and Core Taxonomic Memberships of the Human Microbiome. http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=3169/ Li K, Bihan M, et al.|-|-| PloS One. 2013 May 01; 8: e63139.|-|-| Analyses of the taxonomic diversity associated with the human microbiome continue to be an area of great importance. The study of the nature and extent of the commonly shared taxa ("core"), versus those less prevalent, establishes a baseline for comparing healthy and diseased groups by quantifying the variation among people, across body habitats and over time. The National Institutes of Health (NIH) sponsored Human Microbiome Project (HMP) has provided an unprecedented opportunity to examine and better define what constitutes the taxonomic core within and across body habitats and individuals through pyrosequencing-based profiling of 16S rRNA gene sequences from oral, skin, distal gut (stool), and vaginal body habitats from over 200 healthy individuals. A two-parameter model is introduced to quantitatively identify the core taxonomic members of each body habitat's microbiota across the healthy cohort. Using only cutoffs for taxonomic ubiquity and abundance, core taxonomic members were identified for each of the 18 body habitats and also for the 4 higher-level body regions. Although many microbes were shared at low abundance, they exhibited a relatively continuous spread in both their abundance and ubiquity, as opposed to a more discretized separation. The numbers of core taxa members in the body regions are comparatively small and stable, reflecting the relatively high, but conserved, interpersonal variability within the cohort. Core sizes increased across the body regions in the order of: vagina, skin, stool, and oral cavity. A number of "minor" oral taxonomic core were also identified by their majority presence across the cohort, but with relatively low and stable abundances. A method for quantifying the difference between two cohorts was introduced and applied to samples collected on a second visit, revealing that over time, the oral, skin, and stool body regions tended to be more transient in their taxonomic structure than the vaginal body region. Wed, 01 May 2013 00:00:00 -0400 Analyses of the Microbial Diversity Across the Human Microbiome. http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=3205/ Li K, Bihan M, et al.|-|-| PloS One. 2012 Jul 01; 7: e32118.|-|-| Analysis of human body microbial diversity is fundamental to understanding community structure, biology and ecology. The National Institutes of Health Human Microbiome Project (HMP) has provided an unprecedented opportunity to examine microbial diversity within and across body habitats and individuals through pyrosequencing-based profiling of 16 S rRNA gene sequences (16 S) from habits of the oral, skin, distal gut, and vaginal body regions from over 200 healthy individuals enabling the application of statistical techniques. In this study, two approaches were applied to elucidate the nature and extent of human microbiome diversity. First, bootstrap and parametric curve fitting techniques were evaluated to estimate the maximum number of unique taxa, S(max), and taxa discovery rate for habitats across individuals. Next, our results demonstrated that the variation of diversity within low abundant taxa across habitats and individuals was not sufficiently quantified with standard ecological diversity indices. This impact from low abundant taxa motivated us to introduce a novel rank-based diversity measure, the Tail statistic, ("?"), based on the standard deviation of the rank abundance curve if made symmetric by reflection around the most abundant taxon. Due to ?'s greater sensitivity to low abundant taxa, its application to diversity estimation of taxonomic units using taxonomic dependent and independent methods revealed a greater range of values recovered between individuals versus body habitats, and different patterns of diversity within habitats. The greatest range of ? values within and across individuals was found in stool, which also exhibited the most undiscovered taxa. Oral and skin habitats revealed variable diversity patterns, while vaginal habitats were consistently the least diverse. Collectively, these results demonstrate the importance, and motivate the introduction, of several visualization and analysis methods tuned specifically for next-generation sequence data, further revealing that low abundant taxa serve as an important reservoir of genetic diversity in the human microbiome. Sun, 01 Jul 2012 00:00:00 -0400 A Framework for Human Microbiome Research. http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=3655/ Meth� BA, Nelson KE, et al.|-|-| Nature. 2012 Jun 14; 486: 215-21.|-|-| A variety of microbial communities and their genes (the microbiome) exist throughout the human body, with fundamental roles in human health and disease. The National Institutes of Health (NIH)-funded Human Microbiome Project Consortium has established a population-scale framework to develop metagenomic protocols, resulting in a broad range of quality-controlled resources and data including standardized methods for creating, processing and interpreting distinct types of high-throughput metagenomic data available to the scientific community. Here we present resources from a population of 242 healthy adults sampled at 15 or 18 body sites up to three times, which have generated 5,177 microbial taxonomic profiles from 16S ribosomal RNA genes and over 3.5 terabases of metagenomic sequence so far. In parallel, approximately 800 reference strains isolated from the human body have been sequenced. Collectively, these data represent the largest resource describing the abundance and variety of the human microbiome, while providing a framework for current and future studies. Thu, 14 Jun 2012 00:00:00 -0400 Structure, Function and Diversity of the Healthy Human Microbiome. http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=3656/ Huttenhower C, Gevers D, et al.|-|-| Nature. 2012 Jun 14; 486: 207-14.|-|-| Studies of the human microbiome have revealed that even healthy individuals differ remarkably in the microbes that occupy habitats such as the gut, skin and vagina. Much of this diversity remains unexplained, although diet, environment, host genetics and early microbial exposure have all been implicated. Accordingly, to characterize the ecology of human-associated microbial communities, the Human Microbiome Project has analysed the largest cohort and set of distinct, clinically relevant body habitats so far. We found the diversity and abundance of each habitat's signature microbes to vary widely even among healthy subjects, with strong niche specialization both within and among individuals. The project encountered an estimated 81-99% of the genera, enzyme families and community configurations occupied by the healthy Western microbiome. Metagenomic carriage of metabolic pathways was stable among individuals despite variation in community structure, and ethnic/racial background proved to be one of the strongest associations of both pathways and microbes with clinical metadata. These results thus delineate the range of structural and functional configurations normal in the microbial communities of a healthy population, enabling future characterization of the epidemiology, ecology and translational applications of the human microbiome. Thu, 14 Jun 2012 00:00:00 -0400 Evaluation of 16S RDNA-based Community Profiling for Human Microbiome Research. http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=3657/ Ward DV, Gevers D, et al.|-|-| PloS One. 2012 Apr 01; 7: e39315.|-|-| The Human Microbiome Project will establish a reference data set for analysis of the microbiome of healthy adults by surveying multiple body sites from 300 people and generating data from over 12,000 samples. To characterize these samples, the participating sequencing centers evaluated and adopted 16S rDNA community profiling protocols for ABI 3730 and 454 FLX Titanium sequencing. In the course of establishing protocols, we examined the performance and error characteristics of each technology, and the relationship of sequence error to the utility of 16S rDNA regions for classification- and OTU-based analysis of community structure. The data production protocols used for this work are those used by the participating centers to produce 16S rDNA sequence for the Human Microbiome Project. Thus, these results can be informative for interpreting the large body of clinical 16S rDNA data produced for this project. Sun, 01 Apr 2012 00:00:00 -0400 Draft Genome Sequence of Bacteroides vulgatus PC510, a Strain Isolated from Human Feces. http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=2793/ Cu�v P�, Klaassens ES, et al.|-|-| Journal of bacteriology. 2011 Aug 01; 193: 4025-6.|-|-| Although Bacteroides vulgatus is one of the most prevalent microorganisms in the human gastrointestinal tract, little is known about the genetic potential of this species. Here, we describe the annotated draft genome sequence of B. vulgatus PC510 isolated from human feces. Mon, 01 Aug 2011 00:00:00 -0400 Minimum Information About a Marker Gene Sequence (MIMARKS) and Minimum Information About Any (x) Sequence (MIxS) Specifications. http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=3236/ Yilmaz P, Kottmann R, et al.|-|-| Nature Biotechnology. 2011 May 01; 29: 415-20.|-|-| Here we present a standard developed by the Genomic Standards Consortium (GSC) for reporting marker gene sequences--the minimum information about a marker gene sequence (MIMARKS). We also introduce a system for describing the environment from which a biological sample originates. The 'environmental packages' apply to any genome sequence of known origin and can be used in combination with MIMARKS and other GSC checklists. Finally, to establish a unified standard for describing sequence data and to provide a single point of entry for the scientific community to access and learn about GSC checklists, we present the minimum information about any (x) sequence (MIxS). Adoption of MIxS will enhance our ability to analyze natural genetic diversity documented by massive DNA sequencing efforts from myriad ecosystems in our ever-changing biosphere. Sun, 01 May 2011 00:00:00 -0400 Using DGGE Profiling to Develop a Novel Culture Medium Suitable for Oral Microbial Communities http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=2418/ Tian, Y., He, X., et al.|-|-| Mol Oral Microbiol. 2010 Oct 01; 25(5): 357-67.|-|-| More than 700 bacterial species have been detected in the human oral cavity. They form highly organized microbial communities and are responsible for many oral infectious diseases, such as dental caries and periodontal disease. The prevention and treatment of these diseases require a comprehensive knowledge of oral microbial communities, which largely relies on culture-dependent methods to provide detailed phenotypic and physiological analysis of these communities. However, most of the currently available laboratory media can only selectively support the growth of a limited number of bacterial species within these communities, and fail to sustain the original oral microbial diversity. In this study, using denaturing gradient gel electrophoresis (DGGE) as an index to systematically survey and analyse the selectivity of commonly used laboratory media, we developed a new medium (SHI medium) by combining the ingredients of several selected media that can support different subpopulations within the original oral microbial community derived from pooled saliva. DGGE and 454 pyrosequencing analysis showed that SHI medium was capable of supporting a more diversified community with a microbial profile closer to that of the original oral microbiota. Furthermore, 454 pyrosequencing revealed that SHI medium supported the growth of many oral species that have not before been cultured. Crystal violet assay and the confocal laser scanning microscope analysis indicated that, compared with other media, SHI medium is able to support a more complex saliva-derived biofilm with higher biomass yield and more diverse species. This DGGE-guided method could also be used to develop novel media for other complex microbial communities. Fri, 01 Oct 2010 00:00:00 -0400 ANDES: Statistical Tools for the ANalyses of DEep Sequencing http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=2395/ Li, K., Venter, E., et al.|-|-| BMC Res Notes. 2010 Jul 15; 3(1): 199.|-|-| ABSTRACT: BACKGROUND: The advancements in DNA sequencing technologies have allowed researchers to progress from the analyses of a single organism towards the deep sequencing of a sample of organisms. With sufficient sequencing depth, it is now possible to detect subtle variations between members of the same species, or between mixed species with shared biomarkers, such as the 16S rRNA gene. However, traditional sequencing analyses of samples from largely homogeneous populations are often still based on multiple sequence alignments (MSA), where each sequence is placed along a separate row and similarities between aligned bases can be followed down each column. While this visual format is intuitive for a small set of aligned sequences, the representation quickly becomes cumbersome as sequencing depths cover loci hundreds or thousands of reads deep. FINDINGS: We have developed ANDES, a software library and a suite of applications, written in Perl and R, for the statistical ANalyses of DEep Sequencing. The fundamental data structure underlying ANDES is the position profile, which contains the nucleotide distributions for each genomic position resultant from a multiple sequence alignment (MSA). Tools include the root mean square deviation (RMSD) plot, which allows for the visual comparison of multiple samples on a position-by-position basis, and the computation of base conversion frequencies (transition/transversion rates), variation (Shannon entropy), inter-sample clustering and visualization (dendrogram and multidimensional scaling (MDS) plot), threshold-driven consensus sequence generation and polymorphism detection, and the estimation of empirically determined sequencing quality values. CONCLUSIONS: As new sequencing technologies evolve, deep sequencing will become increasingly cost-efficient and the inter and intra-sample comparisons of largely homogeneous sequences will become more common. We have provided a software package and demonstrated its application on various empirically-derived datasets. Investigators may download the software from Sourceforge at https://sourceforge.net/projects/andestools. This publication is listed for reference purposes only. It may be included to present a more complete view of a JCVI employee's body of work, or as a reference to a JCVI sponsored project. Thu, 15 Jul 2010 00:00:00 -0400 Diversity of 16S RRNA Genes Within Individual Prokaryotic Genomes http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=2353/ Pei, A. Y., Oberdorf, W. E., et al.|-|-| Appl Environ Microbiol. 2010 Jun 01; 76(12): 3886-3897.|-|-| Analysis of intragenomic variation of 16S rRNA genes is a unique approach to examining the concept of ribosomal constraints on rRNA genes; the degree of variation is an important parameter to consider for estimation of the diversity of a complex microbiome in the recently initiated Human Microbiome Project (http://nihroadmap.nih.gov/hmp). The current GenBank database has a collection of 883 prokaryotic genomes representing 568 unique species, of which 425 species contained 2 to 15 copies of 16S rRNA genes per genome (2.22 +/- 0.81). Sequence diversity among the 16S rRNA genes in a genome was found in 235 species (from 0.06% to 20.38%; 0.55% +/- 1.46%). Compared with the 16S rRNA-based threshold for operational definition of species (1 to 1.3% diversity), the diversity was borderline (between 1% and 1.3%) in 10 species and >1.3% in 14 species. The diversified 16S rRNA genes in Haloarcula marismortui (diversity, 5.63%) and Thermoanaerobacter tengcongensis (6.70%) were highly conserved at the 2 degrees structure level, while the diversified gene in B. afzelii (20.38%) appears to be a pseudogene. The diversified genes in the remaining 21 species were also conserved, except for a truncated 16S rRNA gene in "Candidatus Protochlamydia amoebophila." Thus, this survey of intragenomic diversity of 16S rRNA genes provides strong evidence supporting the theory of ribosomal constraint. Taxonomic classification using the 16S rRNA-based operational threshold could misclassify a number of species into more than one species, leading to an overestimation of the diversity of a complex microbiome. This phenomenon is especially seen in 7 bacterial species associated with the human microbiome or diseases. Tue, 01 Jun 2010 00:00:00 -0400 Bacterial Diversity In the Oral Cavity of 10 Healthy Individuals http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=2257/ Bik, E. M., Long, C. D., et al.|-|-| Isme J. 2010 Mar 25; 4(8): 962-74.|-|-| The composition of the oral microbiota from 10 individuals with healthy oral tissues was determined using culture-independent techniques. From each individual, 26 specimens, each from different oral sites at a single point in time, were collected and pooled. An 11th pool was constructed using portions of the subgingival specimens from all 10 individuals. The 16S ribosomal RNA gene was amplified using broad-range bacterial primers, and clone libraries from the individual and subgingival pools were constructed. From a total of 11 368 high-quality, nonchimeric, near full-length sequences, 247 species-level phylotypes (using a 99% sequence identity threshold) and 9 bacterial phyla were identified. At least 15 bacterial genera were conserved among all 10 individuals, with significant interindividual differences at the species and strain level. Comparisons of these oral bacterial sequences with near full-length sequences found previously in the large intestines and feces of other healthy individuals suggest that the mouth and intestinal tract harbor distinct sets of bacteria. Co-occurrence analysis showed significant segregation of taxa when community membership was examined at the level of genus, but not at the level of species, suggesting that ecologically significant, competitive interactions are more apparent at a broader taxonomic level than species. This study is one of the more comprehensive, high-resolution analyses of bacterial diversity within the healthy human mouth to date, and highlights the value of tools from macroecology for enhancing our understanding of bacterial ecology in human health.The ISME Journal advance online publication, 25 March 2010; doi:10.1038/ismej.2010.30. Thu, 25 Mar 2010 00:00:00 -0400 Robust Computational Analysis of RRNA Hypervariable Tag Datasets http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=2485/ Sipos, M., Jeraldo, P., et al.|-|-| PLoS One. 2010 Jan 01; 5(12): e15220.|-|-| Next-generation DNA sequencing is increasingly being utilized to probe microbial communities, such as gastrointestinal microbiomes, where it is important to be able to quantify measures of abundance and diversity. The fragmented nature of the 16S rRNA datasets obtained, coupled with their unprecedented size, has led to the recognition that the results of such analyses are potentially contaminated by a variety of artifacts, both experimental and computational. Here we quantify how multiple alignment and clustering errors contribute to overestimates of abundance and diversity, reflected by incorrect OTU assignment, corrupted phylogenies, inaccurate species diversity estimators, and rank abundance distribution functions. We show that straightforward procedural optimizations, combining preexisting tools, are effective in handling large () 16S rRNA datasets, and we describe metrics to measure the effectiveness and quality of the estimators obtained. We introduce two metrics to ascertain the quality of clustering of pyrosequenced rRNA data, and show that complete linkage clustering greatly outperforms other widely used methods. Fri, 01 Jan 2010 00:00:00 -0500 Genomics. Genome Project Standards In a New Era of Sequencing http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=2135/ Chain, P. S., Grafham, D. V., et al.|-|-| Science. 2009 Oct 09; 326(5950): 236-7.|-|-| Fri, 09 Oct 2009 00:00:00 -0400 Metagenomic Analysis of the Human Distal Gut Microbiome http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=505/ Gill, S. R., Pop, M., et al.|-|-| Science. 2006 Jun 02; 312(5778): 1355-9.|-|-| The human intestinal microbiota is composed of 10(13) to 10(14) microorganisms whose collective genome ("microbiome") contains at least 100 times as many genes as our own genome. We analyzed approximately 78 million base pairs of unique DNA sequence and 2062 polymerase chain reaction-amplified 16S ribosomal DNA sequences obtained from the fecal DNAs of two healthy adults. Using metabolic function analyses of identified genes, we compared our human genome with the average content of previously sequenced microbial genomes. Our microbiome has significantly enriched metabolism of glycans, amino acids, and xenobiotics; methanogenesis; and 2-methyl-d-erythritol 4-phosphate pathway-mediated biosynthesis of vitamins and isoprenoids. Thus, humans are superorganisms whose metabolism represents an amalgamation of microbial and human attributes. Fri, 02 Jun 2006 00:00:00 -0400 Molecular Analysis of the Bacterial Microbiota In the Human Stomach http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=481/ Bik, E. M., Eckburg, P. B., et al.|-|-| Proc Natl Acad Sci U S A. 2006 Jan 04; 103(3): 732-7.|-|-| The microbiota of the human stomach and the influence of Helicobacter pylori colonization on its composition remain largely unknown. We characterized bacterial diversity within the human gastric mucosa by using a small subunit 16S rDNA clone library approach and analyzed 1,833 sequences generated by broad-range bacterial PCR from 23 gastric endoscopic biopsy samples. A diverse community of 128 phylotypes was identified, featuring diversity at this site greater than previously described. The majority of sequences were assigned to the Proteobacteria, Firmicutes, Actinobacteria, Bacteroidetes, and Fusobacteria phyla. Ten percent of the phylotypes were previously uncharacterized, including a Deinococcus-related organism, relatives of which have been found in extreme environments but not reported before in humans. The gastric clone libraries from 19 subjects contained H. pylori rDNA; however, only 12 of these subjects tested positive for H. pylori by conventional laboratory methods. Statistical analysis revealed a large degree of intersubject variability of the gastric ecosystem. The presence of H. pylori did not affect the composition of the gastric community. This gastric bacterial rDNA data set was significantly different from sequence collections of the human mouth and esophagus described in other studies, indicating that the human stomach may be home to a distinct microbial ecosystem. The gastric microbiota may play important, as-yet-undiscovered roles in human health and disease. Wed, 04 Jan 2006 00:00:00 -0500 Diversity of the Human Intestinal Microbial Flora http://www.weizhongli-lab.org/cms/index.php?id=629&no_cache=1&tx_ttnews[tt_news]=616/ Eckburg, P. B., Bik, E. M., et al.|-|-| Science. 2005 Apr 14; 308: 1635-1638.|-|-| The human endogenous intestinal microflora is an essential "organ" in providing nourishment, regulating epithelial development, and instructing innate immunity; yet surprisingly, basic features remain poorly described. We examined 13,355 prokaryotic ribosomal RNA gene sequences from multiple colonic mucosal sites and feces of healthy subjects in order to improve our understanding of gut microbial diversity. A majority of the bacterial sequences corresponded to uncultivated species and novel microorganisms. We discovered significant intersubject variability and differences between stool and mucosa community composition. Characterization of this immensely diverse ecosystem is the first step in elucidating its role in health and disease. Thu, 14 Apr 2005 00:00:00 -0400