Summary
Project Participants:
David Lightfoot, Ph.D., Southern
Illinois University, Carbondale,
Khalid Meksem,
Ph.D., Southern Illinois University, Carbondale,
Christopher
Town, Ph. D., The Institute for Genomic Research,
Hongbin Zhang, Ph.D.Texas
A&M University
Introduction:
Functional genomics requires
multi-parallel approaches to assess gene function that make use of the
information and reagents provided by structural genomics. Genomics has enhanced
understanding of the nature of complex trait loci through physical maps and
through expression profiles that underlie phenotypes. Integrating tools that
allow gene targeting, (STS, ESTs, genome sequence tags and DNA markers) with
physical maps has lead to the routine isolation of DNA fragments that contain candidate
genes underlying complex trait loci.
Proposed
is the development of a high-density gene map from a robust physical map of the
soybean genome. The integrated map
will allow us to test the hypothesis
that ‘in the plant post-genome sequence era, crop whole genome sequencing is
unnecessary for functional genomes’. Testing the hypothesis will produce a
map for functional genomics that will accelerate the rate of economically
important gene discovery; provide a framework for integrating functional gene
maps of legumes; and provide a tool for comparative structural and functional
genomics among crops.
Objectives and Goals: We will:
1. Develop at
high-resolution a 40-60 thousand gene map that can be used for functional
genomics. Comparison of the gene map with the predicted gene content of 9 Mbp
of genome sequence will test the hypothesis that whole crop genome shotgun
sequencing is redundant.
2. Improve methods
for assembling the fingerprinted BACs into complete and robust physical map
contigs. Comparisons of physical map construction methods will address the
validity of gene locations inferred from physical maps.
3. Sequence 8-9 Mbp
of soybean genomic DNA from a gene rich region on chromosome G. The contig
generation tools and gene maps can be critically evaluated compared to benchmark
sequence data for overlapping BACs and their predicted gene content.
4.
Test the utility of the contigs, virtual sequence and DNA sequence
for identifying homeologous genomic regions. The predictive value of synteny
associations will be tested.
5.
Provide researchers with electronic access to fingerprints, maps,
genome sequences and clones. Allow interactive analysis of regions likely to
contain genes and QTLs of agricultural importance and determine the robustness
of the virtual sequence represented by the gene map.
Methods:
The
primary method for gene map development and physical map gap filling will be
STS integration with BAC fingerprint data. Paralogs will be identified from
ESTs to increase the map’s gene density. DNA sequencing will be used to generate
8-9 Mbp of sequence from two gene rich regions and one soybean chromosome.
Paralogs and sequence predicted genes will be annotated by identification in
the transcriptome, hypo-methylated genomic DNA or DNA derived from
hyper-acetylated nucleosomes. Bio-informatics will be used to compare gene
order across genera and to make gene function predictions.