Online citations, reference lists, and bibliographies.
← Back to Search

Novel Phylogenetic Studies Of Genomic Sequence Fragments Derived From Uncultured Microbe Mixtures In Environmental And Clinical Samples.

T. Abe, H. Sugawara, M. Kinouchi, S. Kanaya, T. Ikemura
Published 2005 · Biology, Medicine

Save to my Library
Download PDF
Analyze on Scholarcy
Share
A self-organizing map (SOM) was developed as a novel bioinformatics strategy for phylogenetic classification of sequence fragments obtained from pooled genome samples of uncultured microbes in environmental and clinical samples. This phylogenetic classification was possible without either orthologous sequence sets or sequence alignments. We first constructed SOMs for tetranucleotide frequencies in 210,000 5 kb sequence fragments obtained from 1502 prokaryotes for which at least 10 kb of genomic sequence has been deposited in public DNA databases. The sequences could be classified primarily according to phylogenetic groups without information regarding the species. We used the SOM method to classify sequence fragments derived from environmental samples of the Sargasso Sea and of an acidophilic biofilm growing in acid mine drainage. Phylogenetic diversity of the environmental sequences was effectively visualized on a single map. Sequences that were derived from a single genome but cloned independently could be reassociated in silico. G + C% has been used for a long period as a fundamental parameter for phylogenetic classification of microbes, but the G + C% is apparently too simple a parameter to differentiate a wide variety of known species. Oligonucleotide frequency can be used to distinguish the species because oligonucleotide frequencies vary significantly among their genomes.
This paper references
10.1038/nbt1048
Substrate-induced gene-expression screening of environmental metagenome libraries for isolation of catabolic genes
T. Uchiyama (2005)
10.1111/J.1462-2920.2004.00624.X
Application of tetranucleotide frequencies for the assignment of genomic fragments.
H. Teeling (2004)
Community structure and metabolism through reconstruction of microbial genomes
G. W. Tyson (2004)
10.1016/S0958-1669(02)00345-2
Screening for novel enzymes for biocatalytic processes: accessing the metagenome as a resource of novel functional sequence space.
P. Lorenz (2002)
10.1128/AEM.66.6.2541-2547.2000
Cloning the Soil Metagenome: a Strategy for Accessing the Genetic and Functional Diversity of Uncultured Microorganisms
M. R. Rondon (2000)
10.1101/GR.163101
Genome-scale compositional comparisons in eukaryotes.
A. Gentles (2001)
10.1016/S1369-5274(02)00353-3
Microbial population genomics and ecology.
E. DeLong (2002)
10.1101/GR.634603
Informatics for unveiling hidden genome signatures.
T. Abe (2003)
10.1126/SCIENCE.1093857
Environmental Genome Shotgun Sequencing of the Sargasso Sea
J. Venter (2004)
10.1016/S0958-1669(03)00067-3
Biotechnological prospects from metagenomics.
P. Schloss (2003)
10.1016/S0378-1119(01)00673-4
Analysis of codon usage diversity of bacterial genes with a self-organizing map (SOM): characterization of horizontally transferred genes with emphasis on the E. coli O157 genome.
S. Kanaya (2001)
10.1093/NAR/12.3.1749
Doublet frequencies in evolutionary distinct groups.
R. Nussinov (1984)
10.1109/5.58325
The self-organizing map
T. Kohonen (1990)
10.1128/MMBR.59.1.143-169.1995
Phylogenetic identification and in situ detection of individual microbial cells without cultivation.
R. Amann (1995)
10.1146/ANNUREV.GENET.32.1.185
Comparative DNA analysis across diverse genomes.
S. Karlin (1998)
Direct cloning of genes encoding novel xylanases from human gut, Can
H. Hayashi (2005)
10.1139/W04-136
Direct cloning of genes encoding novel xylanases from the human gut.
H. Hayashi (2005)
10.1038/nature02340
Community structure and metabolism through reconstruction of microbial genomes from the environment
G. Tyson (2004)
The complete genome sequence of the gram-positive bacterium
F. Kunst (1997)
10.1016/0167-7799(96)10025-1
Identifying microbial diversity in the natural environment: a molecular phylogenetic approach.
P. Hugenholtz (1996)
10.1109/5.537105
Engineering applications of the self-organizing map
T. Kohonen (1996)
10.1038/36786
The complete genome sequence of the Gram-positive bacterium Bacillus subtilis
F. Kunst (1997)



This paper is referenced by
10.1038/ismej.2014.83
A novel approach, based on BLSOMs (Batch Learning Self-Organizing Maps), to the microbiome analysis of ticks
R. Nakao (2014)
10.1038/nrmicro2167
Unravelling ancient microbial history with community proteogenomics and lipid geochemistry
J. Brocks (2009)
10.1093/nar/gkn692
tRNADB-CE: tRNA gene database curated manually by experts
T. Abe (2009)
10.1007/s00248-014-0440-3
Metabolic Characteristics of a Glycogen-Accumulating Organism in Defluviicoccus Cluster II Revealed by Comparative Genomics
Z. Wang (2014)
10.1155/2014/985706
Visualization of Genome Signatures of Eukaryote Genomes by Batch-Learning Self-Organizing Map with a Special Emphasis on Drosophila Genomes
T. Abe (2014)
10.1093/nar/gkn496
A comparison of random sequence reads versus 16S rDNA sequences for estimating the biodiversity of a metagenomic library
C. Manichanh (2008)
10.1292/jvms.18-0483
Viral population analysis of the taiga tick, Ixodes persulcatus, by using Batch Learning Self-Organizing Maps and BLAST search
Y. Qiu (2019)
10.1109/BIBE.2007.4375743
GPX: A Tool for the Exploration and Visualization of Genome Evolution
N. Nahar (2007)
DNA Sequence Classification: It’s Easier Than You Think: An open-source k-mer based machine learning tool for fast and accurate classification of a variety of genomic datasets
Stephen Solis-Reyes (2018)
The Proximon: Representation, Evaluation, and Applications of Metagenomic Functional Interactions
G. Vey (2017)
10.1002/9781118010518.CH40
Phylogenetic Binning of Metagenome Sequence Samples
A. McHardy (2011)
10.1007/978-3-642-21566-7_20
A Novel Bioinformatics Strategy to Predict Directional Changes of Influenza A Virus Genome Sequences
Y. Iwasaki (2011)
10.1186/1471-2105-10-430
WebCARMA: a web application for the functional and taxonomic classification of unassembled metagenomic reads
W. Gerlach (2009)
10.1186/1471-2105-11-544
Clustering metagenomic sequences with interpolated Markov models
David R. Kelley (2010)
Taxonomic classification of metagenomic sequences
W. Gerlach (2012)
10.1101/018465
Metagenome-assembled genomes uncover a global brackish microbiome
L. W. Hugerth (2015)
10.1007/978-3-642-02397-2_1
Batch-Learning Self-Organizing Map for Predicting Functions of Poorly-Characterized Proteins Massively Accumulated
T. Abe (2009)
A Large-scale Batch-learning Self-organizing Map for Function Prediction of Poorly-characterized Proteins Progressively Accumulating in Sequence Databases : Annual Report of the Earth Simulator Center April 2007-March 2008
T. Abe (2007)
10.1007/s00018-015-2004-1
From cultured to uncultured genome sequences: metagenomics and modeling microbial ecosystems
D. Garza (2015)
10.1155/2008/472719
Unsupervised Learning in Detection of Gene Transfer
L. Hamel (2008)
10.1093/nar/gkr225
Taxonomic classification of metagenomic shotgun sequences with CARMA3
W. Gerlach (2011)
10.2174/138920209789208255
Signal Processing for Metagenomics: Extracting Information from the Soup
G. Rosen (2009)
10.1007/978-3-642-02478-8_104
A Large-Scale Genomics Studies Conducted with Batch-Learning SOM Utilizing High-Performance Supercomputers
T. Abe (2009)
10.1186/1471-2164-11-461
MLTreeMap - accurate Maximum Likelihood placement of environmental DNA sequences into taxonomic and functional reference phylogenies
M. Stark (2010)
Computational Methods for Metagenomic Analysis
Itai Sharon (2010)
10.1016/j.coms.2012.01.001
Microorganisms of the nose and paranasal sinuses.
R. Haug (2012)
10.1100/tsw.2011.70
Genomic Signatures in Microbes—Properties and Applications
J. Bohlin (2011)
10.1038/nmeth976
Accurate phylogenetic classification of variable-length DNA fragments
A. McHardy (2007)
10.1155/2016/3164624
Horizontally Transferred Genetic Elements in the Tsetse Fly Genome: An Alignment-Free Clustering Approach Using Batch Learning Self-Organising Map (BLSOM)
R. Nakao (2016)
10.1038/ismej.2009.158
AMD biofilms: using model communities to study microbial evolution and ecological complexity in nature
V. Denef (2010)
10.1093/dnares/dsp018
A Novel Bioinformatics Strategy for Function Prediction of Poorly-Characterized Protein Genes Obtained from Metagenome Analyses
T. Abe (2009)
10.1093/dnares/dsu012
Evolutionary Changes in Vertebrate Genome Signatures with Special Focus on Coelacanth
Y. Iwasaki (2014)
See more
Semantic Scholar Logo Some data provided by SemanticScholar