Identificação de genes e o Problema do Alinhamento Spliced Múltiplo

AUTOR(ES)
FONTE

IBICT - Instituto Brasileiro de Informação em Ciência e Tecnologia

DATA DE PUBLICAÇÃO

14/12/2010

RESUMO

The gene prediction in DNA sequences of eukariotic organisms is still an open problem in Bioinformatics. The sequence comparison based approach is commonly used in the search of solutions for this problem. Several different combinations of sequences are being used by gene recognition tools and in this work we propose the comparison of many cDNA sequences with a DNA sequence. This proposal was addressed by the formulation and study of a combinatorial optimization problem, called Multiple Spliced Alignment Problem. In this work we describe this problem, show that it is NP-complete under the Levenshtein distance and propose four heuristics to solve it. Based on these heuristics, we developed four gene recognition tools based on the comparison of a DNA sequence with many cDNAs. These tools were evaluated with instances built from real human genome data and their results were better when compared to other gene recognition tools available in literature.

ASSUNTO(S)

ferramentas de identificação de genes comparação de sequências problema do alinhamento spliced múltiplo ciencia da computacao gene recognition tools sequence comparison multiple spliced alignment problem

Documentos Relacionados