Online citations, reference lists, and bibliographies.
← Back to Search

Sequencing Medicago Truncatula Expressed Sequenced Tags Using 454 Life Sciences Technology

F. Cheung, B. Haas, S. M. Goldberg, G. May, Yongli Xiao, C. Town
Published 2006 · Biology, Medicine

Save to my Library
Download PDF
Analyze on Scholarcy Visualize in Litmaps
Share
Reduce the time it takes to create your bibliography by a factor of 10 by using the world’s favourite reference manager
Time to take this seriously.
Get Citationsy
BackgroundIn this study, we addressed whether a single 454 Life Science GS20 sequencing run provides new gene discovery from a normalized cDNA library, and whether the short reads produced via this technology are of value in gene structure annotation.ResultsA single 454 GS20 sequencing run on adapter-ligated cDNA, from a normalized cDNA library, generated 292,465 reads that were reduced to 252,384 reads with an average read length of 92 nucleotides after cleaning. After clustering and assembly, a total of 184,599 unique sequences were generated containing over 400 SSRs. The 454 sequences generated hits to more genes than a comparable amount of sequence from MtGI. Although short, the 454 reads are of sufficient length to map to a unique genome location as effectively as longer ESTs produced by conventional sequencing. Functional interpretation of the sequences was carried out by Gene Ontology assignments from matches to Arabidopsis and was shown to cover a broad range of GO categories. 53,796 assemblies and singletons (29%) had no match in the existing MtGI. Within the previously unobserved Medicago transcripts, thousands had matches in a comprehensive protein database and one or more of the TIGR Plant Gene Indices. Approximately 20% of these novel sequences could be found in the Medicago genome sequence. A total of 70,026 reads generated by the 454 technology were mapped to 785 Medicago finished BACs using PASA and over 1,000 gene models required modification. In parallel to 454 sequencing, 4,445 5'-prime reads were generated by conventional sequencing using the same library and from the assembled sequences it was shown to contain about 52% full length cDNAs encoding proteins from 50 to over 500 amino acids in length.ConclusionDue to the large number of reads afforded by the 454 DNA sequencing technology, it is effective in revealing the expression of transcripts from a broad range of GO categories and contains many rare transcripts in normalized cDNA libraries, although only a limited portion of their sequence is uncovered. As with longer ESTs, 454 reads can be mapped uniquely onto genomic sequence to provide support for, and modifications of, gene predictions.
This paper references
10.1101/GR.9.10.950
Large-scale statistical analyses of rice ESTs reveal correlated patterns of gene expression.
R. Ewing (1999)
10.1101/GR.10.7.950
Human and Mouse Gene Structure: Comparative Analysis and Application to Exon Prediction
S. Batzoglou (2000)
10.2144/01304PF02
Reverse transcriptase template switching: a SMART approach for full-length cDNA library construction.
Y. Zhu (2001)
In silico analysis on frequency and distribution of microsatellites in ESTs of some cereal species.
R. Varshney (2002)
10.1016/S1369-5266(02)00240-6
Applications of single nucleotide polymorphisms in crop genetics.
A. Rafalski (2002)
Genomic and expression analysis of the 12p11-p12 amplicon using EST arrays identifies two novel amplified and overexpressed genes.
V. Bourdon (2002)
10.1007/s00122-002-1031-0
Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.)
T. Thiel (2002)
10.1101/GR.547002
A novel method for SNP detection using a new duplex-specific nuclease from crab hepatopancreas.
D. Shagin (2002)
10.1073/pnas.0932694100
Comparative genomics of Physcomitrella patens gametophytic transcriptome and Arabidopsis thaliana: Implication for land plant evolution
T. Nishiyama (2003)
10.1046/J.1365-313X.2003.01687.X
Correlated clustering and virtual display of gene expression patterns in the wheat life cycle by large-scale statistical analyses of expressed sequence tags.
Y. Ogihara (2003)
10.1093/NAR/GKG770
Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies.
B. Haas (2003)
10.1093/bioinformatics/btg034
TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets
G. Pertea (2003)
10.1007/s10142-004-0107-0
Molecular markers from the transcribed/expressed region of the genome in higher plants
P. Gupta (2004)
10.1093/nar/gkh099
The TIGR Plant Repeat Databases: a collective resource for the identification of repetitive sequences in plants
Shu Ouyang (2004)
10.1139/G05-029
Use of tall fescue EST-SSR markers in phylogenetic analysis of cool-season forage grasses.
M. Mian (2005)
10.1104/pp.104.059063
The Institute for Genomic Research Osa1 Rice Genome Annotation Database1
Q. Yuan (2005)
10.1093/nar/gki064
The TIGR Gene Indices: clustering and assembling EST and known genes and integration with eukaryotic genomes
Yuandan Lee (2005)
10.1104/pp.104.057034
Sequencing the Genespaces of Medicago truncatula and Lotus japonicus1
N. Young (2005)
10.1038/nature03959
Genome sequencing in microfabricated high-density picolitre reactors
M. Margulies (2005)
The International Medicago Genome Annotation Group [http://www.medicago.org/genome
The Arabidoposis Information Resource
The Arabidoposis Information Resource
S Ouyang



This paper is referenced by
10.1080/13102818.2021.1920462
Common bean as a potential crop for future food security: an overview of past, current and future contributions in genomics, transcriptomics, transgenics and proteomics
M. Nadeem (2021)
10.3390/cancers13061372
Non-Coding RNAs in Cancer Diagnosis and Therapy: Focus on Lung Cancer
Patricia Le (2021)
10.1007/s11356-020-10774-4
Transcriptome analysis of Plantago major as a phytoremediator to identify some genes related to cypermethrin detoxification
Ahmed A. A. Aioub (2020)
10.3389/fpls.2020.00592
Biotechnological Perspectives of Omics and Genetic Engineering Methods in Alfalfa
Miroslava Hrbáčková (2020)
10.1093/nar/gkaa1072
Optimized design of antisense oligomers for targeted rRNA depletion
Wesley A. Phelps (2020)
10.1101/2020.06.24.169102
Optimized design of antisense oligomers for targeted rRNA depletion
Wesley A. Phelps (2020)
10.1007/978-981-15-4752-2_18
Oxidative Stress and Antioxidant Defence in Fabaceae Plants Under Abiotic Stresses
Carla Gualtieri (2020)
10.1002/9781119409144.ch03
Medicago truncatula as an ecological, evolutionary, and forage legume model: new directions forward
E. J. Wettberg (2019)
10.30848/PJB2019-4(14)
Generation and analysis of expressed sequence tags from roots cDNA library of cotton (Gossypium arboreum)
A. Jamal (2019)
10.1038/s41598-019-48072-w
Elucidation of Galactomannan Biosynthesis Pathway Genes through Transcriptome Sequencing of Seeds Collected at Different Developmental Stages of Commercially Important Indian Varieties of Cluster Bean (Cyamopsis tetragonoloba L.)
A. Chaudhury (2019)
10.1002/9781119409144.ch119
Genomic and genetic markers in Medicago truncatula : introduction
F. J. Bruijn (2019)
10.2217/pgs-2018-0036
Computational functional genomics-based approaches in analgesic drug discovery and repurposing.
C. Lippmann (2018)
10.1007/978-981-10-5514-0_12
Legume, Microbiome, and Regulatory Functions of miRNAs in Systematic Regulation of Symbiosis
S. S. Hussain (2018)
10.1016/j.ecoenv.2018.05.066
Transcriptome analysis providing novel insights for Cd-resistant tall fescue responses to Cd stress.
Huihui Zhu (2018)
10.1111/pce.13123
Legume genetic resources and transcriptome dynamics under abiotic stress conditions.
M. Abdelrahman (2018)
10.1007/978-3-030-00614-3_9
The Draft Genome of the MD-2 Pineapple
Raimi M. Redwan (2018)
10.1007/978-981-10-6647-4_3
Insights of Novel Breeding Strategies in Sustainable Crop Production
K. Yasmin (2017)
10.3390/ijms18061155
De Novo Transcriptome Sequencing and the Hypothetical Cold Response Mode of Saussurea involucrata in Extreme Cold Environments
J. Li (2017)
10.1007/978-3-319-32059-5_17
A Role of Bioinformatics in Agriculture
Zohra Aslam (2017)
10.1186/s40064-016-3089-1
De novo transcriptome sequencing and comparative analysis to discover genes related to floral development in Cymbidium faberi Rolfe
Yuying Sun (2016)
Integration of RNA and protein expression profiles to study human cells
F. Danielsson (2016)
10.1007/s10535-016-0681-8
De novo transcriptome analysis of Fraxinus velutina using Illumina platform and development of EST-SSR markers
L.-P. Yan (2016)
10.3389/fpls.2016.00455
Emerging Genomic Tools for Legume Breeding: Current Status and Future Prospects
M. Pandey (2016)
10.1016/B978-0-444-63505-1.00005-1
Aspergillus Secretome: An Overview
B. Ganesan (2016)
10.5010/JPB.2016.43.3.311
Transcriptome analysis, microsatellite marker information, and orthologous analysis of Capsicum annuum varieties
Yul-Kyun Ahn (2016)
10.1371/journal.pone.0157908
Dissecting the Root Nodule Transcriptome of Chickpea (Cicer arietinum L.)
Chandra Kant (2016)
STUDY OF METHODS USED IN NGS ( NEXT GENERATION SEQUENCING ) IN GENE EXPRESSION
Yusuf Talib (2016)
Genome evolution, speciation and molecular population genetics of giant weta
Victoria G. Twort (2016)
10.1104/pp.15.01081
Discovering New Biology through Sequencing of RNA1
A. Weber (2015)
Update on Sequencing of RNA Discovering New Biology through Sequencing of RNA 1
A. Weber (2015)
10.3934/BIOENG.2015.4.487
Perspectives on the use of transcriptomics to advance biofuels
Siseon Lee (2015)
10.1080/07352689.2014.897909
Translational Genomics in Agriculture: Some Examples in Grain Legumes
R. Varshney (2015)
See more
Semantic Scholar Logo Some data provided by SemanticScholar