Drosophila Genomic Sequence Annotation Using the BLOCKS+ Database
AUTOR(ES)
Henikoff, Jorja G.
FONTE
Cold Spring Harbor Laboratory Press
RESUMO
A simple and general homology-based method for gene finding was applied to the 2.9-Mb Drosophila melanogaster Adh region, the target sequence of the Genome Annotation Assessment Project (GASP). Each strand of the entire sequence was used as query of the BLOCKS+ database of conserved regions of proteins. This led to functional assignments for more than one-third of the genes and two-thirds of the transposons. Considering the enormous size of the query, the fact that only two false-positive matches were reported emphasizes the high selectivity of protein family-based methods for gene finding. We used the search results to improve BLOCKS+ by identifying compositionally biased blocks. Our results confirm that protein family databases can be used effectively in automated sequence annotation efforts.
ACESSO AO ARTIGO
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=310867Documentos Relacionados
- Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome
- SELEX_DB: an activated database on selected randomized DNA/RNA sequences addressed to genomic sequence annotation
- The Institute for Genomic Research Osa1 Rice Genome Annotation Database1
- Using GeneWise in the Drosophila Annotation Experiment
- MitoProteome: mitochondrial protein sequence database and annotation system