TY - JOUR
T1 - Explicit DNase sequence bias modeling enables high-resolution transcription factor footprint detection
AU - Yardimci, Galip Gürkan
AU - Frank, Christopher L.
AU - Crawford, Gregory E.
AU - Ohler, Uwe
N1 - Funding Information:
We thank Raluca Gordan for helpful discussions and Josh Schipper for sharing universal protein-binding microarray experiments for E2F1 factor. Human Frontier Science Program [RGY0093/2012 to U.O.]; National Institutes of Health [U54-HG004563 to G.E.C.] Funding for open access charge: Human Frontier Science Program [RGY0093/2012 to U.O.].
Publisher Copyright:
© The Author(s) 2014.
PY - 2014/10/29
Y1 - 2014/10/29
N2 - DNaseI footprinting is an established assay for identifying transcription factor (TF)-DNA interactions with single base pair resolution. High-throughput DNaseseq assays have recently been used to detect in vivo DNase footprints across the genome. Multiple computational approaches have been developed to identify DNase-seq footprints as predictors of TF binding. However, recent studies have pointed to a substantial cleavage bias of DNase and its negative impact on predictive performance of footprinting. To assess the potential for using DNase-seq to identify individual binding sites, we performed DNase-seq on deproteinized genomic DNA and determined sequence cleavage bias. This allowed us to build bias corrected and TF-specific footprint models. The predictive performance of these models demonstrated that predicted footprints corresponded to high-confidence TF-DNA interactions. DNase-seq footprints were absent under a fraction of ChIP-seq peaks, which we show to be indicative of weaker binding, indirect TF-DNA interactions or possible ChIP artifacts. The modeling approach was also able to detect variation in the consensus motifs that TFs bind to. Finally, cell type specific footprints were detected within DNase hypersensitive sites that are present in multiple cell types, further supporting that footprints can identify changes in TF binding that are not detectable using other strategies.
AB - DNaseI footprinting is an established assay for identifying transcription factor (TF)-DNA interactions with single base pair resolution. High-throughput DNaseseq assays have recently been used to detect in vivo DNase footprints across the genome. Multiple computational approaches have been developed to identify DNase-seq footprints as predictors of TF binding. However, recent studies have pointed to a substantial cleavage bias of DNase and its negative impact on predictive performance of footprinting. To assess the potential for using DNase-seq to identify individual binding sites, we performed DNase-seq on deproteinized genomic DNA and determined sequence cleavage bias. This allowed us to build bias corrected and TF-specific footprint models. The predictive performance of these models demonstrated that predicted footprints corresponded to high-confidence TF-DNA interactions. DNase-seq footprints were absent under a fraction of ChIP-seq peaks, which we show to be indicative of weaker binding, indirect TF-DNA interactions or possible ChIP artifacts. The modeling approach was also able to detect variation in the consensus motifs that TFs bind to. Finally, cell type specific footprints were detected within DNase hypersensitive sites that are present in multiple cell types, further supporting that footprints can identify changes in TF binding that are not detectable using other strategies.
UR - http://www.scopus.com/inward/record.url?scp=84925283402&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84925283402&partnerID=8YFLogxK
U2 - 10.1093/nar/gku810
DO - 10.1093/nar/gku810
M3 - Article
C2 - 25294828
AN - SCOPUS:84925283402
SN - 0305-1048
VL - 42
SP - 11865
EP - 11878
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 19
ER -