BIROn - Birkbeck Institutional Research Online

    Most transcription factor binding sites are in a few mosaic classes of the human genome

    Evans, Kenneth J. (2010) Most transcription factor binding sites are in a few mosaic classes of the human genome. BMC Genomics 11 (1), ISSN 1471-2164.

    [img]
    Preview
    Text
    3125.pdf - Published Version
    Available under License Creative Commons Attribution.

    Download (1MB) | Preview

    Abstract

    Background: Many algorithms for finding transcription factor binding sites have concentrated on the characterisation of the binding site itself: and these algorithms lead to a large number of false positive sites. The DNA sequence which does not bind has been modeled only to the extent necessary to complement this formulation. Results We find that the human genome may be described by 19 pairs of mosaic classes, each defined by its base frequencies, (or more precisely by the frequencies of doublets), so that typically a run of 10 to 100 bases belongs to the same class. Most experimentally verified binding sites are in the same four pairs of classes. In our sample of seventeen transcription factors — taken from different families of transcription factors — the average proportion of sites in this subset of classes was 75%, with values for individual factors ranging from 48% to 98%. By contrast these same classes contain only 26% of the bases of the genome and only 31% of occurrences of the motifs of these factors — that is places where one might expect the factors to bind. These results are not a consequence of the class composition in promoter regions. Conclusions:This method of analysis will help to find transcription factor binding sites and assist with the problem of false positives. These results also imply a profound difference between the mosaic classes.

    Metadata

    Item Type: Article
    Keyword(s) / Subject(s): Regulatory regions, Eukaryotic genome, gene-regulation, sequence, DNA, discovery, drosophila, elements, SEQ, ornaization
    School: Birkbeck Schools and Departments > School of Science > Biological Sciences
    Depositing User: Administrator
    Date Deposited: 23 Feb 2011 13:27
    Last Modified: 17 Apr 2013 12:33
    URI: http://eprints.bbk.ac.uk/id/eprint/3125

    Statistics

    Downloads
    Activity Overview
    88Downloads
    94Hits

    Additional statistics are available via IRStats2.

    Archive Staff Only (login required)

    Edit/View Item Edit/View Item