freqAnalysis: program for the statistical analysis of nucleotide motifs |
Statistics: > Nucleotides > Codons > Amino acids > Amino acid dimers > Phase I heptamers > Phase III heptamers |
Analysis of S. cerevisiae The program was applied to protein-encoding nucleotide sequences from S. cerevisiae, downloaded as the file orf_coding.fasta from the Saccharomyces Genome Database ( ftp://genome-ftp.stanford.edu/pub/yeast/yeast_ORFs/orf_coding.fasta.Z .) To restrict the analysis to sequences likely to express native chromosomal yeast proteins, we removed from the dataset all mitochondrial sequences, insertion sequences, and ORFs containing internal in-frame stop signals, a total of 146 sequences. The algorithm was applied to the remaining 6161 sequences (8,553,465 nucleotides), consisting of both putative and experimentally identified orfs, with input values of j (codon length) = 3, k (oligo length) = 7, and p (phase) = 1 and 3 corresponding respectively to motifs known to induce programmed +1 and -1 frameshifts. |
© 2001 Giddings Group - All rights reserved. |