<> "The repository administrator has not yet configured an RDF license."^^ . <> . . "Seven clusters in genomic triplet distributions"^^ . " Motivation: In several recent papers new algorithms were proposed for detecting coding regions without requiring learning dataset of already known genes. In this paper we studied cluster structure of several genomes in the space of codon usage. This allowed to interpret some of the results obtained in other studies and propose a simpler method, which is, nevertheless, fully\nfunctional. \n Results: Several complete genomic sequences were analyzed, using visualization of tables of triplet counts in a sliding window. The distribution of 64-dimensional vectors of triplet frequencies displays a well-detectable cluster structure. The structure was found to consist of seven clusters, corresponding to protein-coding information in three possible phases in one of the two complementary strands and in the non-coding regions. Awareness of the existence of this structure allows development of methods for the segmentation of sequences into regions with the same coding phase and non-coding regions.\n This method may be completely unsupervised or use some external information. Since the method does not need extraction of ORFs, it can be applied even for unassembled genomes. Accuracy calculated on the base-pair level (both sensitivity and specificity) exceeds 90%. This is not worse as compared to such methods as HMM, however, has the advantage to be much simpler and clear.\n"^^ . "2002" . . . . . . . . . . . . . "Alexander N."^^ . "Gorban"^^ . "Alexander N. Gorban"^^ . . "Tatyana G."^^ . "Popova"^^ . "Tatyana G. Popova"^^ . . "Andrei Yu"^^ . "Zinovyev"^^ . "Andrei Yu Zinovyev"^^ . . . . . . "Seven clusters in genomic triplet distributions (PDF)"^^ . . . . . . "Seven.pdf"^^ . . . "Seven clusters in genomic triplet distributions (Image (PNG))"^^ . . . . . . "preview.png"^^ . . . "Seven clusters in genomic triplet distributions (Indexer Terms)"^^ . . . . . . "indexcodes.txt"^^ . . "HTML Summary of #3077 \n\nSeven clusters in genomic triplet distributions\n\n" . "text/html" . . . "Theoretical Biology" . .