L1: RNA Synthesis and Processing Outline L1: RNA Synthesis and Processing • Transcription in Prokaryotes • Eukaryotic RNA Polymerases and General Transcription Factors • Regulation of Transcription in Eukaryotes • RNA Processing and Turnover Vad är en gen? What was a gene yesterday? gen promotorer signalpeptid transkriptionsstart RBS transkriptionstermineringssignal TAA TAG TGA ATG GTG TTG strukturell gen transkriptionsenhet What is a gene today? “A DNA segment that contributes to phenotype/function. In the absence of demonstrated function a gene may be characterized by sequencee, transcription or homology” – Wain et al, Genomics, 2002 “A locatable region of genomic sequence, which is associated with regulatory regions, transcribed regions and/or other functional sequence regions” – Pearson, Nature, 2006 “Alternatively spliced transcripts all belong to the same gene, even if the proteins that are produced are different” – the Gene Sweepstake team, 2003 The encode project-home work! A gene is a genomic sequence (DNA or RNA) directly encoding functional product molecules, either RNA or protein In the case that there are several functional products sharing overlapping regions, one takes the union of all overlapping genomic sequences coding for them This union must be coherent – i.e., done separately for final protein and RNA products – but does not require that all products necessarily share a common subsequence or just The gene is a union of genomic sequences encoding a coherent set of potentially overlapping functional products Nature 2007 Jun 14;447(7146):799-816 1 E. coli RNA polymerase Transkription i prokaryoter 1. Initiering: RNA-polymeras binder promotor, strukturella förändringar hos promotorpolymeraskomplexet (”bubbla”), initial transkription 2. Elongering: konformationsförändringar (håller hårdare om templatet), uppvindning av DNAt, RNA-syntes, ”proof-reading” 3. Terminering: Rho-oberoende terminatorer (inneboende terminatorer = hårnålsstrukturer), Rho-beroende terminatorer DNA footprinting (Part 1) Sequences of E. coli promoters 16-19 bp RNA polymeras: α2ββ´ω+ σ (ca 465 kDa) - sigmafaktorn påverkar DNA-bindande egenskaperna hos polymeraset - bindningsstyrkan till olika promotorer kan skilja 106 x - transkription initieras utan primer 5-9 bp • Startsignaler för RNA-syntes • Stark promotor: hög affinitet för RNA-polymeras • Konstitutionell promotor: alltid påslagen • Reglerbar promotor: går att slå på och stänga av DNA footprinting (Part 2) Sigmafaktorer (σ) i E. coli Gene Factor Use -35 sep -10 rpoD σ70 general TTGACA 16-19 bp TATAAT rpoH σ32 heat shock CCCTTGAA 13-15 bp CCCGATNT rpoN σ54 N-metabolism CTGGNA 6 bp TTGCA fliA σ28 (σF ) motility/chemotaxis CTAAA 15 bp GCCGATAA rpoE σE heat shock rpoS σS stress response 2 Transcription by E. coli RNA polymerase (Part 1) Transcription by E. coli RNA polymerase (Part 2) • • Structure of bacterial RNA polymerase (after ca 10 bp) ~ 12-14 bp Transcription termination in Prokaryotes RNA synthesis continues until the polymerase encounters a termination signal. The most common signal is a symmetrical inverted repeat of a GC-rich sequence followed by seven A residues. Alternatively, transcription of some genes is terminated by a specific termination protein (Rho), which binds extended segments of single-stranded RNA. Transcription termination Transcription termination (E. coli) Promoter RNA polymerase DNA RNA ”Hairpin” termination signal 5´ 5´ Other mechanism: Termination via Rho protein (hexamer) Pause-site Rho 3 Transkriptionsreglering Genreglering efter transkriptionsinitiering • Prematur transkriptionsterminering s.k. attenuering; ex. trp • Skiljer på positiv och negativ reglering • Positiv reglering: transkriptionsfaktor (aktivator) måste binda in till promotorn för att transkription skall ske - hjälper RNA-polymeras att binda till DNAt (rekrytering) - allosterisk verkan av steg efter polymerasinbindning • Antiterminering = en typ av positiv transkriptionsreglering vid vilken transkripionsfaktorer (proteiner) binder till RNApolymeras och modifierar det så att det kan läsa igenom speciella termineringssites - används av fager och i vissa operon • Negativ reglering:repressorprotein binder till operator och hindrar transkription - hindrar RNA-polymeras att binda till promotorn - alternativt håller kvar RNA-polymeraset Metabolism of lactose Negative control of the lac operon Positive control of the lac operon by glucose Reglering av laktos-operonet (E. coli) Catabolite Activator Protein (CAP) Promotor -35 Cykliskt Promotor= AMP ”Landningsplats” för (cAMP) RNA-polymeras CAP -10 Lac I S.D. 5´ Lac-repressor (Lac I) Laktos eller IPTG (syntetisk analog) DNA -10 Lac Z Lac I mRNA Lac I mRNA-start RNA-pol. -35 3´ cAMP/CAPbindningsställe Operatorsekvens S.D. 5´ Lac Z β-galaktosidas: Spjälkar laktos till galaktos och glukos Lac Y S.D. Lac Y Laktos-permeas: Reglerar införsel av laktos Lac A S.D. mRNA Lac A 3´ Thio-galaktosid-acetylas: Bryter ned ej klyvbara laktos-analoger Frånvaro av laktos: • Lac-repressorn binder till operator-sekvensen • Transkriptionen blockeras; inget β-galaktosidas-enzym (eller något av de andra enzymerna) produceras Närvaro av laktos (eller IPTG): • Lac-repressorn kan inte binda till operator-sekvensen • Transkriptionsker; β-galaktosidas-enzym (och alla de andra enzymerna) produceras Närvaro av laktos (eller IPTG) samt låga halter av glukos: • Halten av cAMP stiger • cAMP-CAP-komplex bildas som kan binda uppströms om promotorn • cAMP-CAP-komplex främjar transkriptionen (vägleder RNA-polymeraset) • Mer β-galaktosidas-enzym produceras 4 Eukaryotic RNA Polymerases and General Transcription Factors Structure of yeast RNA polymerase II Eukaryotic cells have three nuclear RNA polymerases that transcribe different classes of genes. They are complex enzymes, consisting of 12 to 17 different subunits each. They all have 9 conserved subunits, 5 of which are related to subunits of bacterial RNA polymerase. Eukaryotic RNA Polymerases and General Transcription Factors Formation of a polymerase II preinitiation complex in vitro (Part 1) RNA polymerase II is responsible for synthesis of mRNA and it has been the focus of most transcription studies. Unlike prokaryotic RNA polymerase, it requires initiation factors that (in contrast to bacterial σ factors) are not associated with the polymerase. General transcription factors are proteins involved in transcription from all polymerase II promoters. About 10% of the genes in the human genome encode transcription factors, emphasizing the importance of these proteins. Promoters contain several different regulatory sequence elements. Promoters of different genes contain different combinations of promoter elements, which appear to function together to bind general transcription factors. Formation of a polymerase II preinitiation complex in vitro (Part 2) Model of the polymerase II preinitiation complex 5 RNA polymerase II/Mediator complexes and transcription initiation The ribosomal RNA gene is transcribed by RNA polymerase I Initiation of rDNA transcription Transcription of RNA polymerase III genes Identification of eukaryotic regulatory sequences A eukaryotic promoter 6 The SV40 enhancer Action of enhancers DNA looping The immunoglobulin enhancer Electrophoretic-mobility shift assay Chromatin immunoprecipitation (Part 1) 7 Chromatin immunoprecipitation (Part 2) Structure of transcriptional activators Examples of DNA-binding domains Action of transcriptional activators Action of eukaryotic repressors Regulation of transcriptional elongation (Part 1) 8 Regulation of transcriptional elongation (Part 2) Regulation of transcriptional elongation (Part 3) Regulation of Transcription in Eukaryotes Regulation of Transcription in Eukaryotes The packaging of eukaryotic DNA in chromatin has important consequences for transcription, so chromatin structure is a critical aspect of gene expression. Chromatin can be modified by: Modifications of chromatin structure play key roles in the control of transcription in eukaryotic cells. • Modifications of histones • Interactions with HMG (high mobility group) proteins • Rearrangements of nucleosomes Actively transcribed genes are in relatively decondensed chromatin, which can be seen in polytene chromosomes of Drosophila. Histone acetylation (Part 1) Patterns of histone modification • Histone modification: • The amino-terminal tail domains of core histones are rich in lysine and can be modified by acetylation. Transcriptional activators and repressors are associated with histone acetyltransferases (HAT) and deacetylases (HDAC), respectively. • 9 Chromatin remodeling factors Chromatin remodeling factors are protein complexes that alter contacts between DNA and histones. They can reposition nucleosomes, change the conformation of nucleosomes, or eject nucleosomes from the DNA. Regulation of Transcription in Eukaryotes To facilitate elongation, elongation factors become associated with the phosphorylated C-terminal domain of RNA polymerase II. They include histone modifying enzymes and chromatin remodeling factors that transiently displace nucleosomes during transcription. Regulation of Transcription in Eukaryotes: DNA methylation • • DNA methylation is another general mechanism that controls transcription in eukaryotes. Methyl groups are added at the 5-carbon position of cytosines (C) that precede guanines (G) (CpG dinucleotides). RNA Processing and Turnover Most newly-synthesized RNAs must be modified, except bacterial RNAs which are used immediately for protein synthesis while still being transcribed. rRNAs and tRNAs must be processed in both prokaryotic and eukaryotic cells. Regulation of processing steps provides another level of control of gene expression. Processing of ribosomal RNAs Processing of transfer RNAs (Part 1) 10 Processing of transfer RNAs (Part 2) Processing of eukaryotic messenger RNAs (Part 1) Processing of eukaryotic messenger RNAs (Part 2) Formation of the 3´ ends of eukaryotic mRNAs Splicing of pre-mRNA RNA Processing and Turnover: Splicing Three sequence elements of pre-mRNAs are important: Sequences at the 5′ splice site, at the 3′ splice site, and within the intron at the branch point. Pre-mRNAs contain similar consensus sequences at each of these positions. Splicing takes place in large complexes, called spliceosomes, which have five types of small nuclear RNAs (snRNAs)—U1, U2, U4, U5, and U6. They are complexed with six to ten protein molecules to form small nuclear ribonucleoprotein particles (snRNPs). 11 Alternative splicing in Drosophila sex determination Alternative splicing of Dscam 12x48x33x2=38016 combinations RNA Processing and Turnover: RNA editing Editing of apolipoprotein B mRNA RNA editing is processing (other than splicing) that alters the protein-coding sequences of some mRNAs. It involves single base modification reactions such as deamination of cytosine to uridine and adenosine to inosine. Editing of the mRNA for apolipoprotein B, which transports lipids in the blood, results in two different proteins: Apo-B100 is synthesized in the liver by translation of the unedited mRNA. Apo-B48 is synthesized in the intestine from edited mRNA in which a C has been changed to a U by deamination. RNA Processing and Turnover: RNA degradation Aberrant mRNAs can also be degraded. Nonsense-mediated mRNA decay eliminates mRNAs that lack complete open-reading frames. When ribosomes encounter premature termination codons, translation stops and the defective mRNA is degraded. Ultimately, RNAs are degraded in the cytoplasm. Intracellular levels of any RNA are determined by a balance between synthesis and degradation. Rate of degradation can thus control gene expression. RNA Processing and Turnover rRNAs and tRNAs are very stable, in both prokaryotes and eukaryotes. Bacterial mRNAs are rapidly degraded, most have half-lives of 2 to 3 minutes. In eukaryotic cells, mRNA half-lives vary; less than 30 minutes to 20 hours in mammalian cells. Short-lived mRNAs code for regulatory proteins, levels of which can vary rapidly in response to environmental stimuli. mRNAs encoding structural proteins or central metabolic enzymes have long half-lives. 12 RNA Processing and Turnover mRNA degradation Degradation of eukaryote mRNAs is initiated by shortening of the poly-A tails. Rapidly degraded mRNAs often contain specific AUrich sequences near the 3′ ends which are binding sites for proteins that can either stabilize them or target them for degradation. These RNA-binding proteins are regulated by extracellular signals, such as growth factors and hormones. Degradation of some mRNAs is regulated by both siRNAs and miRNA. 13