On the statistical assessment of similarities in DNA sequences.
AUTOR(ES)
Reich, J G
RESUMO
The statistical behavior of the similarity score for unrelated DNA sequences calculated as letter-by-letter comparison or from various forms of optimal alignment was studied. It was found that natural DNA-sequences from a data base and true random sequences show the same statistical behavior in terms of such scores. This makes it possible to adopt a simple criterion for the rejection of fortuitous similarity. It is based on the mean and standard deviation of chance scores whose expected values, depending on chain length, gap penalty and probability of letter coincidence, may be calculated from formulae given in the paper.
ACESSO AO ARTIGO
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=318937Documentos Relacionados
- Statistical analyses of counts and distributions of restriction sites in DNA sequences.
- Statistical analysis of nucleotide sequences.
- 'ZSTATS'--a statistical analysis for potential Z-DNA sequences.
- On the statistical significance of nucleic acid similarities.
- Conservation of sequences in related genomes of Apodemus: constraints on the maintenance of satellite DNA sequences.