TY - JOUR
T1 - Experimentally determined weight matrix definitions of the initiator and TBP binding site elements of promoters
AU - Kraus, Richard J.
AU - Murray, Elizabeth E.
AU - Wiley, Steven R.
AU - Zink, Nancy M.
AU - Loritz, Karla
AU - Gelembiuk, Gregory W.
AU - Mertz, Janet E.
N1 - Funding Information:
We thank Peggy Farnham for HeLa S-3 cells and Dick Burgess, Peggy Farnham and members of their laboratories and our laboratory for helpful discussions. We especially thank Paul Lambert, Gary Stormo, Grace Wahba and Nancy Thompson for helpful comments on the manuscript. This research was supported by US Public Health Service Research grants CA-07175, CA-09075, CA-09135 and CA-22443 from the National Cancer Institute.
PY - 1996
Y1 - 1996
N2 - The basal elements of class II promoters are: (i) a -30 region, recognized by TATA binding protein (TBP); (ii) an initiator (Inr) surrounding the start site for transcription; (iii) frequently a downstream (+10 to +35) element. To determine the sequences that specify an Inr, we performed a saturation mutagenesis of the Inr of the SV40 major late promoter (SV40-MLP). The transcriptional activity of each mutant was determined both in vivo and in vitro. An excellent correlation between transcriptional activity and closeness of fit to the optimal Inr sequence, 5'-CAG/TT-3', was found to exist both in vivo and in vitro. Employing a neural network technique we generated from these data a weight matrix definition of an Inr that can be used to predict the activity of a given sequence as an Inr. Using saturation mutagenesis data of TBP binding sites we likewise generated a weight matrix definition of the -30 region element. We conclude the following: (i) Inrs are defined by the nucleotides immediately surrounding the transcriptional start site; (ii) most, if not all, Inrs are recognized by the same general transcription factor(s). We propose that the mechanism of transcription initiation is fundamentally conserved, with the formation of pre-initiation complexes involving the concurrent binding of general transcription factors to the -30, Inr and, possibly, downstream elements of class II promoters.
AB - The basal elements of class II promoters are: (i) a -30 region, recognized by TATA binding protein (TBP); (ii) an initiator (Inr) surrounding the start site for transcription; (iii) frequently a downstream (+10 to +35) element. To determine the sequences that specify an Inr, we performed a saturation mutagenesis of the Inr of the SV40 major late promoter (SV40-MLP). The transcriptional activity of each mutant was determined both in vivo and in vitro. An excellent correlation between transcriptional activity and closeness of fit to the optimal Inr sequence, 5'-CAG/TT-3', was found to exist both in vivo and in vitro. Employing a neural network technique we generated from these data a weight matrix definition of an Inr that can be used to predict the activity of a given sequence as an Inr. Using saturation mutagenesis data of TBP binding sites we likewise generated a weight matrix definition of the -30 region element. We conclude the following: (i) Inrs are defined by the nucleotides immediately surrounding the transcriptional start site; (ii) most, if not all, Inrs are recognized by the same general transcription factor(s). We propose that the mechanism of transcription initiation is fundamentally conserved, with the formation of pre-initiation complexes involving the concurrent binding of general transcription factors to the -30, Inr and, possibly, downstream elements of class II promoters.
UR - http://www.scopus.com/inward/record.url?scp=0029874536&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0029874536&partnerID=8YFLogxK
U2 - 10.1093/nar/24.8.1531
DO - 10.1093/nar/24.8.1531
M3 - Article
C2 - 8628688
AN - SCOPUS:0029874536
SN - 0305-1048
VL - 24
SP - 1531
EP - 1539
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 8
ER -