A set of viral DNA decamers enriched in transcription control signals.
AUTOR(ES)
Volinia, S
RESUMO
We studied the frequency distribution of oligonucleotides 10 bp long in a sample of 620 Kb of viral genomes, containing 102 sequences from GenBank, with the aim of detecting transcription control signals. Two thousand three hundred decamers had a frequency 10 times higher than the mean and were subjected to further statistical analysis. For each of the 2300 decamers (parents), we counted the individual frequencies of the 30 decamers differing from the parent by one base mutation (progeny) and then calculated two variance/mean chi squares for the progeny, with and without the parent. We then studied the distribution of the ratio between the two chi squares. Out of 2300 decamers, 10 times more frequent than average, 479 decamers had a chi square ratio of 1.9 or larger. In this final set, which corresponds to less than 0.05% of all possible decamers, 58 decamers were found to contain viral and eukaryotic transcription control elements, like NF-kB, Sp1 and others. Furthermore, this set contains an excess of signals of length 5, 6, 7, 8, 9 and 10, when compared to 150 random sets, bootstrapped from the same viral genomes.
ACESSO AO ARTIGO
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=328405Documentos Relacionados
- Enrichment of oligonucleotide sets with transcription control signals. II: Mammalian DNA.
- DNA sequences downstream of the adenovirus type 2 fiber polyadenylation site contain transcription termination signals.
- Splice site choice in a complex transcription unit containing multiple inefficient polyadenylation signals.
- Control of retroviral RNA splicing through maintenance of suboptimal processing signals.
- Polyadenylation and transcription termination in gene constructs containing multiple tandem polyadenylation signals.