Finding the most significant common sequence and structure motifs in a set of RNA sequences.
AUTOR(ES)
Gorodkin, J
RESUMO
We present a computational scheme to locally align a collection of RNA sequences using sequence and structure constraints. In addition, the method searches for the resulting alignments with the most significant common motifs, among all possible collections. The first part utilizes a simplified version of the Sankoff algorithm for simultaneous folding and alignment of RNA sequences, but maintains tractability by constructing multi-sequence alignments from pairwise comparisons. The algorithm finds the multiple alignments using a greedy approach and has similarities to both CLUSTAL and CONSENSUS, but the core algorithm assures that the pairwise alignments are optimized for both sequence and structure conservation. The choice of scoring system and the method of progressively constructing the final solution are important considerations that are discussed. Example solutions, and comparisons with other approaches, are provided. The solutions include finding consensus structures identical to published ones.
ACESSO AO ARTIGO
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=146942Documentos Relacionados
- The distribution of RNA motifs in natural sequences.
- RNAProfile: an algorithm for finding conserved secondary structure motifs in unaligned RNA sequences
- A computer method for finding common base paired helices in aligned sequences: application to the analysis of random sequences.
- Finding errors in DNA sequences.
- Discovering common stem–loop motifs in unaligned RNA sequences