Characterization of U2 snRNA associated proteins: Functional Analysis of C. merolae Hsh49 Protein by Mona Aminorroayaee B.Sc., Tabriz University of Medical Science, 2009 Thesis Submitted in Partial Fulfillment of The Requirements for the Degree of Master of Science In Mathematical, Computer, and Physical Sciences (Biochemistry) The University of Northern British Columbia January 2018 ©Mona Aminorroayaee, 2018 i Abstract Pre-mRNA splicing is an important process involving the removal of non-coding regions (introns) from messenger RNA. Disruption of this mechanism can cause diseases such as cancer. A particle known as the spliceosome is responsible for recognition and removal of introns. This complex machinery consists of five small nuclear RNAs, U1, U2, U4, U5, and U6, and many proteins. Each snRNA associates with proteins to form small nuclear ribonucleo-proteins or snRNPs. Due to the considerable number of proteins involved in this process in humans, the study of splicing has been a struggle. Cyanidioschyzon merolae (C. merolae) is a unicellular acidophilic red alga that we hope will be a good model for studying splicing factors due to its greatly reduced complexity (only ~70 splicing proteins, compared to ~300 in humans). Thus, we are investigating the interactions of snRNAs with their specific proteins by expressing the entire snRNP particles recombinantly and investigating their function. I sought to use this system to investigate the interactions between the U2 snRNA and its proteins. U2 snRNA associates with two multi-protein complexes, SF3a and SF3b. The SF3b complex is required for the recognition of the intron’s branch point and precise excision of introns from pre-mRNA. I attempted to express all of these proteins simultaneously as well as individually, with my efforts ultimately focused on Hsh49. This is one of five subunits of the SF3b protein complex in C. merolae. I was not able to detect any interaction between U2 snRNA and the Hsh49 protein by Electrophoretic mobility shift assay (EMSA), Fluorescence polarization (FP), or immunoprecipitation analysis. Nevertheless, Circular Dichroism (CD) showed that Hsh49 folded properly. ii TABLE OF CONTENTS ABSTRACT ....................................................................................................................................................... II TABLE OF CONTENTS ..................................................................................................................................... III LIST OF FIGURES ............................................................................................................................................. V LIST OF TABLES ............................................................................................................................................ VIII CHAPTER 1 INTRODUCTION ............................................................................................................................ 1 1.1 PRE-MRNA SPLICING............................................................................................................................................. 2 1.2 SPLICEOSOME ASSEMBLY ........................................................................................................................................ 3 1.3 SPLICING AND DISEASE ........................................................................................................................................... 4 1.4 ROLE OF U2 SNRNP IN YEAST AND HUMANS.............................................................................................................. 5 1.4.1 Structure of U2 snRNA ................................................................................................................................... 6 1.5 U2 SNRNP IN C. MEROLAE ..................................................................................................................................... 8 1.5.1 U2 snRNA proteins ........................................................................................................................................ 9 1.6 PREVIOUS EXPRESSION OF U2-ASSOCIATED PROTEINS ................................................................................................. 15 1.7 OVERALL RESEARCH OBJECTIVE .............................................................................................................................. 16 CHAPTER 2 CO-EXPRESSION OF ALL U2 SNRNPS AND SF3A PROTEINS .......................................................... 18 2.1 MATERIALS AND METHODS ................................................................................................................................... 20 2.1.1 Preparation of C. merolae genomic DNA .................................................................................................... 20 2.1.2 Construction of Co-expression Vectors containing U2 snRNP genes and SF3a genes ................................. 20 2.1.3 Co-expression and Purification of the U2 snRNPs: ...................................................................................... 27 2.2 RESULTS AND DISCUSSION..................................................................................................................................... 29 CHAPTER 3 INDIVIDUAL EXPRESSION AND PURIFICATION OF PRP9, PRP11, PRP21 AND RDS3 PROTEINS ..... 38 3.1 MATERIAL AND METHODS..................................................................................................................................... 39 3.1.1 Preparation of C. merolae genomic DNA .................................................................................................... 39 3.1.2 Construction of expression Vectors containing Prp9, Prp11, Prp21, Rds3 and Hsh49 genes ...................... 39 3.1.3 Expression and purification of Prp9, Prp11, Prp21 and Rds3 proteins ........................................................ 41 3.2 RESULTS AND DISCUSSION..................................................................................................................................... 46 CHAPTER 4 FUNCTIONAL ANALYSIS OF C. MEROLAE HSH49 PROTEIN ........................................................... 61 4.1.1 Fluorescence Anisotropy Assay ................................................................................................................... 63 4.2 MATERIALS AND METHODS ................................................................................................................................... 63 4.2.1 Circular Dichroism Spectropolarimetry........................................................................................................ 63 4.2.2 RNA-Protein Binding Assays ........................................................................................................................ 65 4.2.2.1 Fluorescence Anisotropy Assay ...................................................................................................................................65 4.2.2.2 Fluorescence-based electrophoretic mobility shift assay (F-EMSA) ............................................................................69 4.2.2.3 Electrophoretic mobility shift assay (EMSA) ................................................................................................................70 4.2.2.4 Generation of [32p]-labelled U2 snRNA by IVT............................................................................................................70 4.2.3 Co-immunoprecipitation of Hsh49 .............................................................................................................. 73 4.2.3.1 Dot blot assay ..............................................................................................................................................................73 4.2.3.2 Western blot assay ......................................................................................................................................................73 4.2.3.3 Preparation of C. merolae whole-cell extract ..............................................................................................................74 4.2.3.4 Generating anti-CmHsh49 antibody ............................................................................................................................74 4.2.3.5 Hsh49 co-immunoprecipitation ...................................................................................................................................74 4.2.3.6 Northern blot analysis .................................................................................................................................................74 iii 4.2.4 Alignment of Hsh49 amino acid and U2 snRNA sequences ......................................................................... 75 4.3 RESULTS ............................................................................................................................................................ 76 4.3.1 Expression and Purification of C. merolae Hsh49 ........................................................................................ 76 4.3.2 Sequence alignment of C. merolae U2 snRNA and Hsh49 with S. cerevisiae and Homo sapiens U2 snRNA and Hsh49 ............................................................................................................................................................ 80 4.3.3 Circular Dichroism Spectropolarimetry........................................................................................................ 86 4.3.4 Fluorescence Anisotropy (FA) Assay ............................................................................................................ 87 4.3.5 Fluorescence-based electrophoretic mobility shift (F-EMSA) Assay ............................................................ 92 4.3.6 EMSA for detecting the Hsh49 protein-U2 snRNA interaction .................................................................... 93 4.3.7 Co-Immunoprecipitation ............................................................................................................................. 98 4.3.7.1 Hsh49 Dot Blot Assay ...................................................................................................................................................98 4.3.7.2 Western Blot Assay ......................................................................................................................................................99 4.3.7.3 Hsh49 Co-Immunoprecipitation ................................................................................................................................100 4.3.7.4 Northern Blot Analysis ...............................................................................................................................................101 4.4 DISCUSSION ..................................................................................................................................................... 102 CHAPTER 5 GENERAL DISCUSSION ...............................................................................................................108 5.1 FUTURE DIRECTIONS .......................................................................................................................................... 111 LITERATURE CITED .......................................................................................................................................112 iv List of Figures Figure 1................................................................................................................................................... 2 Figure 2. The pathway of spliceosome assembly during pre-mRNA splicing......................................... 4 Figure 3. Secondary structures of free U2 snRNA and base pairing interaction of U2 snRNA to the branch point and a comparison of yeast and human sequences. ......................................................... 6 Figure 4. Secondary structure of the U2 and U6 snRNA complex in humans and yeast. ...................... 8 Figure 5. Model for Rds3 function. ...................................................................................................... 10 Figure 6. The structure of the SF3a core. ............................................................................................. 13 Figure 7. Interaction network between U2 snRNA proteins. ............................................................... 15 Figure 9. Construction of the U2 snRNP co-expression vectors. Each gene of the U2 snRNP was first introduced into a single plasmid and then all genes were combined serially into plasmids until they were present in one final plasmid. Plasmids were digested with SwaI (S) or PacI (P), followed by heat inactivation, T4 DNA polymerase treatment and annealing................................................................ 26 Figure 10. Construction of the SF3a co-expression vectors. Individual SF3a genes were first introduced into a single plasmid and then all genes were combined serially into plasmids until they were present in one final plasmid. Plasmids were digested with SwaI (S) or PacI (P), followed by heat inactivation, T4 DNA polymerase treatment and annealing................................................................ 27 Figure 11. Determining the presence of Prp11, Prp21, Prp9, Prp5 and Prp43 genes in pQlink by PCR. .............................................................................................................................................................. 30 Figure 12. Determining the presence of U2 snRNA, Prp11 and Rse1 genes in pQlink by PCR. ........... 30 Figure 13. Determining the presence of Hsh155 and Cus1 genes in pQlink by PCR. ........................... 31 Figure 14. Determining the presence of Rds3, Prp11, Rse1 and Hsh49 genes in pQlink by PCR. ........ 32 Figure 15. Determining the presence of each SF3a gene in pQlink. .................................................... 33 Figure 16. Small scale co-expression and purification of Sms-U2 snRNA and Sms-U2 snRNP constructs. ............................................................................................................................................ 34 Figure 17. Large-scale co-expression and purification of Sms, Sms-U2 snRNA and Sms-U2 snRNP constructs. ............................................................................................................................................ 35 Figure 18. Co-expression of the SF3a from pQlink. .............................................................................. 36 Figure 19. Determining the presence of each SF3a gene in PMCSG23................................................ 47 Figure 20. SF3a protein expression from pQlink. ................................................................................. 48 Figure 21. SF3a protein expression from PMCSG23 expression vector. .............................................. 49 Figure 22. Expression of Rds3 from pQlink. ......................................................................................... 50 Figure 23. Expression of Rds3 from PMCSG23 expression vector. ...................................................... 51 Figure 24. Solubility test of the Rds3. .................................................................................................. 51 Figure 25. Purification of Rds3. ............................................................................................................ 53 Figure 26. TEV protease cleavage and dialysis of the Rds3. ................................................................ 54 Figure 27. Purification of Rds3 after TEV protease cleavage and dialysis............................................ 55 Figure 28. Expression of the Hsh49 protein from PMCSG23 expression vector. ................................. 57 Figure 29. Purification of the Hsh49 was analyzed on 15% high TEMED SDS gel. ............................... 59 Figure 30. HumanU2 snRNA-protein crosslinks (Dybkov et al. 2006). ................................................. 62 Figure 31. The basic principle of fluorescence polarization (Pagano et al. 2011). .............................. 63 Figure 32. Expression of the Hsh49 protein from PMCSG23 expression vector. ................................. 77 Figure 33. Purification of the Hsh49 was analyzed on 15% high TEMED SDS gel. ............................... 79 v Figure 34. Sequence alignment of C. merolae (C.m) U2 snRNA with S. cerevisiae (S.c) U2 snRNA. .... 81 Figure 35. Sequence alignment of C. merolae (C.m) U2 snRNA with Homo sapiens (H.S) U2 snRNA. 82 Figure 36. Sequence alignment of C. merolae (C.m) Hsh49 protein with S. cerevisiae (S.C) Hsh49 protein. ................................................................................................................................................. 83 Figure 37. Sequence alignment of C. merolae (C.m) Hsh49 protein with Homo sapiens (H.S) Hsh49 protein. ................................................................................................................................................. 84 Figure 38. A Multiple sequence alignment of C. merolae (C.m) Hsh49 with S. cerevisiae (S.C) and Homo sapiens Hsh49 (H.S) protein. ..................................................................................................... 85 Figure 39. A Multiple sequence alignment of C. merolae (C.m) Hsh49 with S. cerevisiae (S.C) and Homo sapiens Hsh49 (H.S) protein. ..................................................................................................... 85 Figure 40. A Multiple sequence alignment of C. merolae (C.m) Hsh49 with S. cerevisiae (S.C) and Homo sapiens Hsh49 (H.S) protein. ..................................................................................................... 85 Figure 41. CD spectra of the Hsh49 secondary structure. ................................................................... 86 Figure 42. Melting temperature (Tm) of the recombinant Hsh49 protein. ......................................... 87 Figure 43. The result of the fluorescence anisotropy experiment of the U4 snRNA-Snu13 interaction. The concentration of the Snu13 was between 0 to 400 nM. .............................................................. 88 Figure 44. The results of the fluorescence anisotropy of the recombinant Hsh49 and fluorescein labelled of the U2 snRNA. .................................................................................................................... 89 Figure 45. The results of the fluorescence anisotropy experiments of the U2 snRNA-Hsh49 interaction. ........................................................................................................................................... 90 Figure 46. A comparison of the conformational changes of the fluorescein U2 snRNA oligonucleotide in three different types of binding buffers by using the fluorescence anisotropy assay. ⁎⁎⁎ ₌ p< 0.001..................................................................................................................................................... 92 Figure 47. Fluorescence-based electrophoretic mobility shift assay (F-EMSA) with recombinant Hsh49 and fluorescent labelled U2 snRNA. ......................................................................................... 93 Figure 48. Electrophoretic mobility shift assay with recombinant Hsh49 and radioactive [32p]labelled U2 snRNA. ............................................................................................................................... 94 Figure 49. Electrophoretic mobility shift assay with recombinant Hsh49 and radioactive [32p]labelled U2 snRNA. ............................................................................................................................... 95 Figure 50. Electrophoretic mobility shift assay with recombinant Hsh49 and radioactive [32p]labelled U2 snRNA. ............................................................................................................................... 96 Figure 51. Electrophoretic mobility shift assay with cleaved and uncleaved recombinant Hsh49 and radioactive [32p]-labelled U2 snRNA. .................................................................................................. 97 Figure 52. Electrophoretic mobility shift assay of uncleaved recombinant Hsh49 and His-MBP tags (negative control) with radioactive [32p]-labelled U2 snRNA. ............................................................ 98 Figure 53. Dot blot test. Determining the sensitivity of the anti-Hsh49 antiserum generated against recombinant C. merolae Hsh49 protein. .............................................................................................. 99 Figure 54. Specificity of the anti-Hsh49 antiserum via Western blot analysis using different concentrations of the recombinant C. merolae Hsh49. ..................................................................... 100 Figure 55. Western blot of Hsh49 C. merolae whole cell extract. ..................................................... 101 Figure 56. Northern blot analysis of the immunoprecipitated RNA probed for U2 snRNA. .............. 102 Figure 57. The crystal structure of the first RRM of Homo sapiens Hsh49 (PDB file 5GVQ; Kuwasako et al. 2017). ............................................................................................................................................ 104 vi Figure 58. The secondary structure of the first RRM of Homo sapiens Hsh49 (PDB file 5GVQ; Kuwasako et al. 2017). ....................................................................................................................... 104 Figure 59. The crystal structure of the second RRM of Homo sapiens Hsh49 (PDB file 1X5T; Sato et al., to be published). .......................................................................................................................... 105 Figure 60. The secondary structure of the second RRM of Homo sapiens Hsh49 (PDB file 1X5T; Sato et al., to be published). ...................................................................................................................... 105 Figure 61. The Crystal structure of the S. cerevisiae Hsh49 protein (PDB file 5LSB; vanRoon et al. 2017). ................................................................................................................................................. 106 Figure 62. The secondary structure of the S. cerevisiae Hsh49 (PDB file 5LSB; vanRoon et al. 2017). ............................................................................................................................................................ 106 vii List of Tables Table 1. Comparison of all U2 snRNA proteins in humans, yeast, and C. merolae a ........................... 14 Table 2. The U2 snRNA and all U2 snRNP genes with their associated accession numbers, masses, and lengths (Stark et al. 2015). ............................................................................................................ 19 Table 3. DNA oligonucleotides which were used to amplify and sequence the desired gene. The first DNA oligonucleotide for each gene is the forward and the second DNA oligonucleotide in the reverse primer. . ................................................................................................................................................ 22 Table 4. U2snRNP and SF3a plasmids which were used to make the related co-expression vector... 24 Table 5. DNA oligonucleotides which were used to amplify and sequence the desired gene. . ......... 41 Table 7. MDG non-inducing media. ..................................................................................................... 43 Table 8. ZYM-5052 Auto-inducing media. ............................................................................................ 44 Table 9. Measurement parameters for the CD spectra of Hsh49. ....................................................... 64 Table 10. The fluorescence anisotropy reagent (Buffer 1) used for the detection of the U2 snRNAHsh49 interaction. (Hsh49 with benzonase and without benzonase). ................................................ 66 Table 11. The fluorescence anisotropy reagent (Buffer 2) used for the detection of the U2 snRNAHsh49 interaction. ................................................................................................................................ 66 Table 12. The fluorescence anisotropy reagent (Buffer 3) used for the detection of the U2 snRNAHsh49 interaction. ................................................................................................................................ 67 Table 13. The fluorescence anisotropy reagent (Buffer 4) used for the detection of the U2 snRNAHsh49 interaction. ................................................................................................................................ 67 Table 14. The fluorescence anisotropy reagent (Buffer A) used for detection of conformational changes of the U2 snRNA. .................................................................................................................... 68 Table 15. The fluorescence anisotropy reagent (Buffer B) used for detection of conformational changes of the U2 snRNA. .................................................................................................................... 68 Table 16. The fluorescence anisotropy reagent (Buffer C) used for detection of conformational changes of the U2 snRNA. .................................................................................................................... 68 Table 17. The F-EMSA reagent used for the detection of the U2 snRNA-Hsh49 interaction. ............. 69 Table 18. The labelling reaction ........................................................................................................... 70 Table 19. EMSA reagents (Binding Buffer Reagent 1) used for the detection of the U2 snRNA-Hsh49 interaction. ........................................................................................................................................... 71 Table 20. EMSA reagents (Binding Buffer Reagent 2) used for the detection of the U2 snRNA-Hsh49 interaction. ........................................................................................................................................... 72 viii Acknowledgments I would like to thank first and foremost my supervisor, Dr. Stephen Rader, for giving me the opportunity to conduct my research in his lab. Stephen provided a friendly training environment that motivates me to be curious in each step of performing my research and have a deep understating of the Pre-mRNA splicing process. I would also like to thank Stephen for the time that he patiently spent to introduce and explain complicated concepts in this area. I really enjoyed my studies during these years. I would like to acknowledge the members of my supervisory committee, Dr. Kerry Reimer, Dr. Brent Murray, and Dr. Keith Egger for their valuable questions and informative comments on this thesis. I am also so thankful to past and current members of the Rader lab: Dr. Liz Dunn, Ambreen Siraj, Fatimat Shidi, Viktor Slat, Maya de Vos, Radu Pasca, Raliat Abioye, Kevin Huolt, and Samantha Smith for their critique questions and insightful conversation that helped me to develop my knowledge and I would like to thank Corbin Black for his assistance to find the best possible sequence alignment data. I would also like to acknowledge Sebastian J. Mackedenski for his help in circular dichroism spectropolarimetry data collection. I would particularly like to thank Dr. Martha Stark for introducing practical techniques in the Lab and for all her helpful and informative advices through my project. I am also grateful to Brooke Boswell and Luke Spooner for proofing and editing of this thesis. ix I would like to thank UNBC for granting me the financial support that I was given to pursue my studies. My special thanks go to my husband, Rahim Pasha Khajei, for his ongoing support and wise counsel over the years. I am so thankful to my parents, Yasamin Bozarjomehr and Javad Aminorroayaee for their love and guidance, and my brothers Emia and Aria for making me happy through passing though times. Thank you very much, everyone! Mona Aminorroayaee March 22, 2018 x Chapter 1 Introduction Pre messenger RNA splicing is an important process involving the removal of noncoding regions (introns) from messenger RNA, which is carried out by a highly complex and dynamic machine called the spliceosome. This complex machinery consists of five small nuclear RNAs (snRNAs), U1, U2, U4, U5, and U6, and many proteins. Each snRNA associates with proteins to form small nuclear ribonucleoproteins, (snRNPs). Due to the considerable number of proteins involved in this process in humans, the study of splicing has been a struggle. Cyanidioschyzon merolae (C. merolae) is a unicellular acidophilic red alga that we hope will be a good model for studying splicing factors due to its greatly reduced complexity (only ~70 splicing proteins, compared to ~300 in humans; Will et al. 2011). Thus, we are investigating the interactions of snRNAs with their specific proteins by expressing entire snRNP particles recombinantly in bacteria. During spliceosome assembly, U1 snRNA recognizes and base-pairs with the 5’ splice site of the pre-mRNA. However, Stark et al. (2015) could not find any candidates for the U1-associated proteins or U1 snRNA, hinting that C. merolae does not have a U1 snRNP. Biochemical studies of the U2 snRNA would help to improve understanding of the initiation of the spliceosome assembly in the absence of the U1 snRNA, and provide information about the interaction of the U2 snRNA with its specific proteins. 1 3UHP51$6SOLFLQJ 7KHƍVSOLFHVLWH ƍ66 WKHƍVSOLFHVLWH ƍ66 DQGWKHEUDQFKSRLQWVLWH %36 DUH UHDFWLYHVHFWLRQVRQWKHSUHP51$WUDQVFULSWDQGWKHVHFRQWDLQVPDOOFRQVHUYHGVHTXHQFHV &KLRXHWDO 3UHP51$VSOLFLQJLQYROYHVUHPRYDORIQRQFRGLQJVHTXHQFHVNQRZQDV LQWURQVDQGVXEVHTXHQWMRLQLQJRIDGMDFHQWSURWHLQFRGLQJVHTXHQFHVFDOOHGH[RQV )LJXUH  ,QRWKHUZRUGVVSOLFLQJKDSSHQVRQSUHFXUVRUPHVVHQJHU51$ZKLFKLVWUDQVFULEHGIURP WHPSODWH'1$LQWKHQXFOHXVWRJHQHUDWHPDWXUHPHVVHQJHU51$ P51$ 0DWXUHP51$ FDQWKHQEHWUDQVODWHGLQWRSURWHLQDIWHUH[SRUWWRWKHF\WRSODVP 3UHP51$VSOLFLQJFRPSULVHVWZRWUDQVHVWHULILFDWLRQUHDFWLRQV,QWKHILUVWUHDFWLRQ WKHƍK\GUR[\OJURXSRIWKHDGHQRVLQHZLWKLQWKHEUDQFKSRLQWRIWKHLQWURQUHDFWVZLWKWKH SKRVSKRGLHVWHUERQGRIWKHƍVSOLFHVLWH )LJXUH WKLVUHDFWLRQOHDGVWRDIUHHƍH[RQDQG ODULDWLQWURQ,QWKHVHFRQGUHDFWLRQWKHSKRVSKRGLHVWHUERQGDWWKHƍVSOLFHVLWHUHDFWVZLWK WKHƍK\GUR[\OORFDWHGDWWKHƍH[RQ7KURXJKWKLVSURFHVVWKHH[RQVDUHMRLQHGWRHDFK RWKHUDQGWKHLQWURQLVHOLPLQDWHG :DKOHWDO)LJXUH   )LJXUH3UHP51$VSOLFLQJLQYROYLQJUHPRYDORIQRQFRGLQJLQWURQVDQGVXEVHTXHQWMRLQLQJRIDGMDFHQW SURWHLQFRGLQJH[RQV([RQVDQGLQWURQVDUHLQGLFDWHGZLWKER[DQGWKLFNOLQHUHVSHFWLYHO\/HWWHU $  UHSUHVHQWVEUDQFKSRLQW$GHQRVLQH 2 1.2 Spliceosome Assembly The spliceosome is a large RNA-protein complex located in the nucleus, and it identifies and interacts with the three reactive regions of substrate transcripts (5′SS, 3′SS, BPS) in order to perform pre-mRNA splicing (Chiou et al. 2014). The spliceosome consists of five small nuclear ribonucleoproteins (snRNPs) including U1, U2, U4, U6, and U5, which are the major components of the spliceosome. Each snRNP contains a snRNA and associated proteins (Wahl et al. 2009). The human spliceosome includes five small nuclear RNAs and more than 300 proteins, while the spliceosome in Saccharomyces cerevisiae (S. cerevisiae) contains five small nuclear RNAs and almost 100 different proteins (Jurica et al. 2003; Wahl et al. 2009, Fabrizio et al. 2009). The assembly of the spliceosome starts with recognition and base pairing of U1 and U2 snRNPs to the 5′ splice site and branch site of the pre-mRNA respectively. The U2 snRNA binds to the branch site, from which the catalytic adenosine at the branch point binding sequence bulges out (Figure 2, Commitment Complex). Single molecule analysis posits that either U1 or U2 can bind first to the pre-mRNA during the early steps of spliceosome assembly (Hoskins et al, 2016). The U6 snRNA associates with the U4 snRNA to form the U4/U6 di-snRNA; then, the binding of U5 snRNA to U4/U6 di-snRNP results in creation of the U4/U6.U5 tri-snRNP (Will et al. 2011). After association of tri-snRNP to form the pre-spliceosome (Figure 2, left side), the interaction between U1 snRNA and the 5′ splice site is disrupted, as well as that between the U4 and U6 snRNAs. Next, U6 snRNA associates with U2 snRNA and the 5′ splice site through base pairing interactions (Figure 2, Active Spliceosome). The order and arrangement of the spliceosome is key to the identification and excision of introns from the rest of the pre-mRNA during splicing (Wahl et 3 DO 7KHVHUHDUUDQJHPHQWVDQGFKDQJHVDUHQHFHVVDU\IRUWKHVSOLFHRVRPHWRSURFHHGWR WKHWUDQVHVWHULILFDWLRQUHDFWLRQ %URZ $WWKHHQGRIWKHVSOLFLQJSDWKZD\WKHVSOLFHG LQWURQLVUHOHDVHGDQGWKH88DQG8VQ513VZKLFKZHUHFRQQHFWHGWRWKHODULDWLQWURQ GLVDVVHPEOHDQGDUHUHF\FOHGIRUWKHQH[WURXQGRIVSOLFLQJ &KDQJHWDO)LJXUHWRS ULJKW   )LJXUH7KHSDWKZD\RIVSOLFHRVRPHDVVHPEO\GXULQJSUHP51$VSOLFLQJ 6SOLFLQJDQG'LVHDVH 6LQFHSUHP51$VSOLFLQJLVDYLWDOSURFHVVLQJHQHH[SUHVVLRQLQRUGHUWRFUHDWH PDWXUHPHVVHQJHU51$ P51$ DQGWKHQIXQFWLRQDOSURWHLQVPLVUHJXODWLRQLQVSOLFLQJ SURFHVVHVFDXVHVDZLGHYDULHW\RIKXPDQGLVHDVHV$Q\DGGLWLRQRUUHPRYDORIQXFOHRWLGHVWR P51$DVDUHVXOWRIVSOLFLQJHUURUVFDQOHDGWRGLVUXSWLRQRIWKHRSHQUHDGLQJIUDPHRI P51$DQGFRQVHTXHQWWUXQFDWLRQRIWKHHQFRGHGSURWHLQ )DXVWLQRDQG&RRSHU  4 Research indicates that approximately half of all genetic diseases are due to disruptions in splicing (Lopez-Bigas et al. 2005). Examples of human genetic diseases caused by mutations in splicing include: Retinitis pigmentosa, which leads to blindness; spinal muscular atrophy, which causes mortality in childhood; myotonic dystrophy, the most common muscular disorder in adults; Frasier syndrome; Parkinson’s disease; and Atypical cystic fibrosis (Faustino and Cooper 2003). In addition, a wide range of cancers has been linked to errors in splicing. Therefore, it seems that a better understanding of the splicing reaction is an essential step to providing more research opportunities for treatment of human genetic diseases. 1.4 Role of U2 snRNP in Yeast and Humans Many of the interactions in the spliceosome seem to function in ensuring that activation occurs at the right time and place. As mentioned above, U2 snRNA binds to the branch point, thereby inducing the branch site A to bulge out in its active conformation. At the same time, U2 components such as SF3b appear to sequester the bulged A to prevent it from reacting prematurely (Lardelli et al. 2010). In addition, U6 snRNA associates with U2 snRNA and the 5′ splice site through base pairing interactions (Wahl et al. 2009). The association of the U2 snRNP with the pre-mRNA branch site is one of the primary stages in the pre-mRNA splicing process and plays a fundamental role during splicing assembly since they directly participate in chemical catalysis (Nilsen 1998). Moreover, the U2 snRNP undergoes several rearrangements along with the recognition of the branch point adenosine and formation of base pair interaction between theU2 snRNA and “consensus sequence within the intron” (Sashital et al. 2007). 5 6WUXFWXUHRI8VQ51$ 7KHƍHQGRI8VQ51$ZKLFKLVKLJKO\FRQVHUYHGFRQWDLQVVHYHUDOVWHP±ORRSV DQGVLQJOHVWUDQGHGUHJLRQVZKLFKLQWHUDFWZLWKWKH8VQ51$WKHLQWURQDQGWKHVQ513 SURWHLQV )LJXUHDQG)LJXUH 'RQPH]HWDO  LQYHVWLJDWHWKDWVWHP,LQWKHƍ WHUPLQXVRI8VQ51$KDVDIXQFWLRQLQHDUO\VWHSVRIVSOLFHRVRPHDVVHPEO\6WUXFWXUDO VWXGLHVRIWKH88FRPSOH[VKRZWKHSUHVHQFHRIWKH8VWHP,DVDVWDEOHVWUXFWXUHLQWKLV FRPSOH[ 6DVKLWDOHWDO  D E   )LJXUH6HFRQGDU\VWUXFWXUHVRIIUHH8VQ51$DQGEDVHSDLULQJLQWHUDFWLRQRI8VQ51$WRWKH EUDQFKSRLQWDQGDFRPSDULVRQRI\HDVWDQGKXPDQVHTXHQFHV D 7KHƍHQGRI8VQ51$FRPSULVHVWKH EUDQFKSRLQWUHFRJQLWLRQVHTXHQFH EOXH DQGWKUHHVWHP±ORRSVLQFOXGLQJVWHP, UHG VWHP,,DDQGVWHP,,E 6DVKLWDOHWDO  E %DVHSDLULQJLQWHUDFWLRQEHWZHHQEUDQFKSRLQWUHFRJQLWLRQRIWKH8VQ51$DQG EUDQFKSRLQW $ RIWKHSUHP51$<¶VGHPRQVWUDWHWKHSRO\S\ULPLGLQHWUDFW 7KHGHJUHHRIWKHSRVWWUDQVFULSWLRQDOPRGLILFDWLRQRI8VQ51$VLQ\HDVWDQG KXPDQVLVGLIIHUHQW7KHIRXU¶2PHWK\OJXDQRVLQHVDQGWZRSVHXGRXUDFLOVDUHH[DPSOHVRI 6 modifications that occur in Stem I of mammalian U2 snRNA (Massenet et al.1998), while there is no post-transcriptional modification in yeast Stem I. NMR structures of the U2 stem I from S. cerevisiae and the fully modified U2 stem I from humans reveal that there is a great similarity in structure. Despite the fact that they include divergent tandem wobble pairs and modified nucleotides found in human U2 stem I, this indicates that the overall folds of the stem loops of U2 snRNA are similar in yeast and humans (Sashital et al. 2007, Figure 3a). However, the U2 snRNA stem I is more stable in humans than yeast stem I (Sashital et al. 2007, Figure 3a). Most binding of the U2 snRNP to the branch site in humans occurs through the polypyrimidine tract (Figure 3b) in an ATP-dependent manner. It seems this base pairing interaction increases the stability of the binding between U2 snRNA and the polypyrimidine tract, and helps to specify the exact location of branch formation. With this in mind, it is clear that without this interaction, the first step of the splicing pathway and spliceosome assembly would be impossible (Kramer et al. 1988; Wu et al. 1989). However, in yeast, introns do not contain a polypyrimidine tract (Rymond and Rosbash 1985; Rymond et al. 1987). Overall, from yeast to mammals, U2 snRNA has been conserved evolutionarily (Ares et al. 1986; Igel and Ares. 1988). The base paring interaction between the U2 snRNA and the branch point with the intron and the pre-mRNA splicing mechanism is similar in both yeast and mammals (Parker et al. 1987; Chabot et al. 1985; Ruskin et al. 1988, Zhuang et al. 1989). 7 Figure 4. Secondary structure of the U2 and U6 snRNA complex in humans and yeast. The secondary structure of the U2 and U6 snRNA in humans was presented by Sun and Manley (Sun and Manley. 1995) The NMR secondary structure of the U2 and U6 snRNA in yeast was proposed by Sashital (Sashital et al. 2004) 1.5 U2 snRNP in C. merolae Cyanidioschyzon merolae (C. merolae) is a unicellular, acidophilic red alga. The small size of the genome and reduced number of introns in C. merolae make the idea of studying splicing in this system an interesting subject. C. merolae contains the smallest genome size (16.5 Mb) among eukaryotes (Misumi et al. 2005; Suzuki et al. 1992). The number of genes in the C. merolae genome is the same as S. cerevisiae (yeast), but 26 genes contain introns (0.5% of the genome) in C. merolae (Matsuzaki et al. 2004) compared to 287 in S. cerevesiae (Juneau et al. 2007). It is worth mentioning that C. merolae grows in acidic hot springs with a pH of ⁓1.5 and temperature of ⁓ 45 C (Matsuzaki et al. 2004), raising the possibility that its snRNAs are intrinsically more stable. It is also the first alga in which the 8 genome was sequenced and the first whose sequence was 100% completed (Nozaki et al. 2007). By considering such features, some interesting questions emerge – such as whether or not proteins involved in the splicing process can remain stable in high temperatures and/or how the process of splicing occurs in this organism. A comprehensive bioinformatics study carried out by Stark et al. (2015) has determined that the C. merolae spliceosome contains four snRNAs including U2, U4, U5, and U6. This research also did not find any candidates for the U1-associated proteins or U1 snRNA, suggesting that C. merolae does not have a U1 snRNP. By considering this fact, it would be interesting to know how splicing starts in the absence of the U1 snRNA and if it is possible that the U2 snRNA starts the splicing process in C. merolae. These questions motivated me to study the U2 snRNA in C. merolae. In this regard, it is imperative to know the U2 snRNA’s components and understand the role and function of each particle during the splicing process. 1.5.1 U2 snRNA proteins Here I will cover all of the functions of the U2 snRNA proteins which have been recognized either in yeast or humans. The U2 snRNA associates with ten specific proteins and seven Sm proteins in C. merolae, making the U2 snRNP the most complex particle in splicing (Table 1). The following are the proteins U2 snRNA associates with:  SF3a and SF3b 9 7KHVHDUHWZRPXOWLSURWHLQFRPSOH[HV6)DFRQWDLQVWKUHHVXEXQLWVLQFOXGLQJ3US 3USDQG3USDQG6)EFRQVLVWVRIILYHVXEXQLWV+VK+VK&XV5VHDQG5GV SURWHLQVDOORIZKLFKDUHSUHVHQWLQ&PHURODH+RZHYHU6)ELQ6FHUHYLVLDHKDVRQH DGGLWLRQDOSURWHLQ3@ODEHOOHG8VQ51$ZDVWLWUDWHGZLWKWKH+VK 7KHILUVW(06$ZDVFRPSOHWHGE\XVLQJELQGLQJEXIIHUUHDJHQWV 7DEOH DQG WKH(06$ELQGLQJUHDFWLRQVPL[WXUHZDVLQFXEDWHGDWURRPWHPSHUDWXUHIRUPLQXWHV &RPSDULQJWKHUHVXOWLQJIUHH8VQ51$ODQH /DQH WRWKHODQHVFRQWDLQLQJ+VKDQG8 93 VQ51$ /DQHV௅ QREDQGVZHUHREVHUYHGWKDWLQGLFDWHGWKH+VK8VQ51$IRUPHGD FRPSOH[ )LJXUH   )LJXUH(OHFWURSKRUHWLFPRELOLW\VKLIWDVVD\ZLWKUHFRPELQDQW+VKDQGUDGLRDFWLYH>S@ODEHOOHG8 VQ51$+VKFRQFHQWUDWLRQVDUHLQGLFDWHGRQWRSDQGIUHH8VQ51$LVVKRZQLQ/DQH7KH851$ROLJR FRQFHQWUDWLRQZDVQ0DQGLQFXEDWLRQZDVGRQHDW&IRUPLQXWHV7KHJHOZDVUXQIRUPLQXWHVDW 9ROWDJH7KHELQGLQJEXIIHUUHDJHQWVIRUWKLV(06$DUHOLVWHGLQ7DEOH ,UHSHDWHGWKH(06$ZLWKDGLIIHUHQWELQGLQJEXIIHU%LQGLQJ%XIIHU5HDJHQWV 7DEOH DQGLQFXEDWLRQWHPSHUDWXUH&IRUPLQXWHV ,JHOHWDO 7KHELQGLQJ EXIIHUUHDJHQWVZDVXVHGIRUWKHUHPDLQLQJ(06$SURFHGXUHV(YHQE\FKDQJLQJWKHVH FRQGLWLRQVWKHUHZDVQRVKLIWLQGLFDWLQJWKH+VK8VQ51$LQWHUDFWLRQ )LJXUH  94  )LJXUH(OHFWURSKRUHWLFPRELOLW\VKLIWDVVD\ZLWKUHFRPELQDQW+VKDQGUDGLRDFWLYH>S@ODEHOOHG8 VQ51$+VKFRQFHQWUDWLRQVDUHLQGLFDWHGRQWRSDQGIUHH8VQ51$LVVKRZQLQ/DQH7KH851$ROLJR FRQFHQWUDWLRQZDVQ0DQG,QFXEDWLRQZDVGRQHDW&IRUPLQXWHV7KHJHOZDVUXQIRUPLQXWHVDW YROWDJH7KLV(06$ZDVGRQHE\XVLQJWKHELQGLQJEXIIHUUHDJHQWVLQ7DEOH 3ULRUWRPL[LQJWKHUHDJHQWVDQ51$SUREHZDVKHDWHGDW&IRUWKUHHPLQXWHV DQGDOORZHGWRFRRORQLFHIRUWZRPLQXWHVWRHQVXUHSURSHUIROGLQJRIWKHVQ51$$Q LQFXEDWLRQWHPSHUDWXUHRI&ZDVXVHGIRU+VKZLWKQREHQ]RQDVHDQG&DQG& ZHUHXVHGIRUWKH+VKZKLFKZDVZLWKEHQ]RQDVH,FKRVHD&LQFXEDWLRQWHPSHUDWXUHWR EHFORVHWRWKHWHPSHUDWXUHWKDWWKH&PHURODHJURZVLQWKHLUQDWXUDOVHWWLQJV & LQWKH KRSHWKDWWKLVWHPSHUDWXUHZRXOGKHOSWKH51$WRXQIROGDQGIDFLOLWDWHWKHSURWHLQ51$ LQWHUDFWLRQ 0DWVX]DNLHWDO 1REDQGUHSUHVHQWLQJWKH8VQ51$+VKFRPSOH[ IRUPDWLRQZDVGHWHFWHGZKHQFRPSDULQJIUHH8VQ51$ZLWKWKHRWKHUODQHV )LJXUH  95  )LJXUH(OHFWURSKRUHWLFPRELOLW\VKLIWDVVD\ZLWKUHFRPELQDQW+VKDQGUDGLRDFWLYH>S@ODEHOOHG8 VQ51$+VKFRQFHQWUDWLRQVDUHLQGLFDWHGRQWRSDQGIUHH8VQ51$LVVKRZQLQODQH/DQHDQG/DQH /DQHVDQGFRQWDLQWKH+VKZLWKEHQ]RQDVH :% DQGLQFXEDWHGZLWK8VQ51$DW&IRUPLQXWHV /DQHVDQGFRQWDLQWKH+VKZLWKQREHQ]RQDVH 1R% DQGLQFXEDWHGZLWK8VQ51$DW&IRU PLQXWHV/DQHVDQGFRQWDLQWKH+VKZLWKEHQ]RQDVHDQGLQFXEDWHGZLWK8VQ51$DW&IRU PLQXWHV7KHFRQFHQWUDWLRQRIWKH8ROLJRZDVQ0DQGWKHROLJRZDVKHDWHGDW&IRUPLQXWHVDQG FRROHGRQLFHIRUPLQXWHVEHIRUHLQFXEDWLRQ7KHJHOZDVUXQIRUPLQXWHVDWYROWDJH ,QRUGHUWRGHWHUPLQHZKHWKHUFOHDYLQJ+LVDQG0%3WDJVIURP+VKFDXVHGWKLV SURWHLQWRPLVIROG,SHUIRUPHG(06$WRFRPSDUHWKHUHVXOWVZLWKWKHFOHDYHGDQGXQFOHDYHG +VK )LJXUH :KHQFRPSDULQJXQFOHDYHG /DQHV௅ DQGFOHDYHG /DQHV௅  +VKLQ)LJXUHDVORZHUPLJHUDWLQJEDQGZDVIRUPHGLQXQFOHDYHG+VKODQHV7KLV EDQGFRUUHVSRQGVWRDSURWHLQ51$FRPSOH[IRUPDWLRQDQGWKHLQWHQVLW\RIWKHEDQG LQFUHDVHGZLWKWKHSURWHLQFRQFHQWUDWLRQ 96  )LJXUH(OHFWURSKRUHWLFPRELOLW\VKLIWDVVD\ZLWKFOHDYHGDQGXQFOHDYHGUHFRPELQDQW+VKDQG UDGLRDFWLYH>S@ODEHOOHG8VQ51$&OHDYHGDQGXQFOHDYHG+VKFRQFHQWUDWLRQVDUHLQGLFDWHGRQWRSDQG IUHH8VQ51$LVVKRZQLQ/DQHDQG/DQH7KHFRQFHQWUDWLRQRIWKH8VQ51$ROLJRZDVQ07KH8 VQ51$ROLJRZDVKHDWHGDW&IRUPLQXWHVDQGFRROHGRQLFHIRUPLQXWHVEHIRUHLQFXEDWLRQ LQFXEDWLRQ ZDVDW&IRUPLQXWHV 7KHJHOUDQIRUPLQXWHVLQYROWDJH 7RLQVXUHLIWKH8VQ51$LQWHUDFWVZLWKWKHXQFOHDYHG+VKSURWHLQ,UDQ DQRWKHU(06$WRWHVWWKHLQWHUDFWLRQRIWKH8VQ51$ZLWKWKHXQFOHDYHG+VKDQGXVHG WKH+LV0%3WDJVDVDQHJDWLYHFRQWURO6XUSULVLQJO\EDQGVDSSHDUHGLQ+LV0%3WDJVODQHV QHJDWLYHFRQWURO/DQHV௅ ZKLFKZHUHVLPLODUWRWKHEDQGVWKDWZHUHREVHUYHGLQWKH XQFOHDYHG+VK/DQHV௅RI)LJXUH7KLVREVHUYDWLRQVXJJHVWVWKDW8VQ51$GRHVQRW DVVRFLDWHZLWKWKHXQFOHDYHG+VKEXWLWIRUPVDFRPSOH[ZLWKWKH+LV0%3WDJV 97  )LJXUH(OHFWURSKRUHWLFPRELOLW\VKLIWDVVD\RIXQFOHDYHGUHFRPELQDQW+VKDQG+LV0%3WDJV QHJDWLYHFRQWURO ZLWKUDGLRDFWLYH>S@ODEHOOHG8VQ51$8QFOHDYHGUHFRPELQDQW+VKDQG+LV0%3 WDJVFRQFHQWUDWLRQVDUHLQGLFDWHGRQWRSDQGIUHH8VQ51$LVVKRZQLQ/DQHDQG/DQH7KHFRQFHQWUDWLRQ RIWKH8VQ51$ROLJRZDVQ07KH8VQ51$ROLJRZDVKHDWHGDW&IRUPLQXWHVDQGFRROHGRQLFH IRUPLQXWHVEHIRUHLQFXEDWLRQ LQFXEDWLRQZDVDW&IRUPLQXWHV 7KHJHOUDQIRUPLQXWHVDW YROWV &R,PPXQRSUHFLSLWDWLRQ +VK'RW%ORW$VVD\ %HIRUHGRLQJFRLPPXQRSUHFLSLWDWLRQDGRWEORWDVVD\ZDVSHUIRUPHGWRGHWHUPLQH WKHVHQVLWLYLW\DQGHIIHFWLYHQHVVRIWKHSRO\FORQDO+VKDQWLERG\ZKLFKZDVUDLVHGDJDLQVW SXULILHGUHFRPELQDQW+VKSURWHLQ )LJXUH %DVHGRQWKHGRWEORWUHVXOWWKH+VK DQWLERG\FRXOGGHWHFWWKHSXULILHG+VKE\FRPSDULQJDQWL+VKDQWLVHUXPZLWKQRQ LPPXQHVHUXP SUHEOHHG   98  )LJXUH'RWEORWWHVW'HWHUPLQLQJWKHVHQVLWLYLW\RIWKHDQWL+VKDQWLVHUXPJHQHUDWHGDJDLQVW UHFRPELQDQW&PHURODH+VKSURWHLQ :HVWHUQ%ORW$VVD\ $OVRD+VK:HVWHUQEORWWHVWZDVSHUIRUPHG )LJXUH WRVWXG\WKHVSHFLILFLW\ RIWKHDQWL+VKDQWLVHUXP'LIIHUHQWFRQFHQWUDWLRQVRIUHFRPELQDQW+VKSURWHLQZHUH SUREHGZLWKWKHDQWL+VKDQWLVHUXP SULPDU\DQWLERG\ DQGJRDWDQWLUDW+53DQWLERG\ VHFRQGDU\DQWLERG\ :HVWHUQEORWDQDO\VLVVKRZHGWKDWUHFRPELQDQW+VKSURWHLQZLWKWKH FRQFHQWUDWLRQVRI—JDQGQJFRXOGEHGHWHFWHGE\WKHDQWL+VKDQWLVHUXP$EDQGRI WKHH[SHFWHGVL]HIRU+VK ⁓N'D ZDVREVHUYHGDVVKRZQLQ)LJXUH$OVRVRPH QRQVSHFLILFEDQGVDSSHDUHGLQWKH:HVWHUQEORWDQDO\VLVZKLFKFRXOGEHGXHWRWKHKLJK DPRXQWRIWRWDOSURWHLQORDGHGRQWKHJHODQGWKHORZHUEDQGLVUHODWHGWRWKHSURWHLQ GHJUDGDWLRQ 99  )LJXUH6SHFLILFLW\RIWKHDQWL+VKDQWLVHUXPYLD:HVWHUQEORWDQDO\VLVXVLQJGLIIHUHQW FRQFHQWUDWLRQVRIWKHUHFRPELQDQW&PHURODH+VK +VK&R,PPXQRSUHFLSLWDWLRQ ,QHHGHGWRGHWHUPLQHLIWKHDQWL+VKDQWLVHUXP²JHQHUDWHGDJDLQVWWKH UHFRPELQDQW+VKSURWHLQ²FRXOGGHWHFWDQGLPPXQRSUHFLSLWDWH+VKSURWHLQIURP& PHURODHH[WUDFW,PPXQRSUHFLSLWDWLRQRIWKH+VKDOORZVIRUWKHUHPRYDODQGLGHQWLILFDWLRQ RIRWKHUSDUWLFOHVIURPWKH&PHURODHH[WUDFWWKDWDUHDVVRFLDWHGZLWK+VK%DVHGRQ :HVWHUQEORWDQDO\VLVLQ)LJXUHWKH+VKSURWHLQZDVLPPXQRSUHFLSLWDWHGZLWKDQWL +VKDQWLVHUXPDVVKRZQLQODQHEXWQRWZLWKQRQLPPXQHVHUXP ODQH DVZDV H[SHFWHG$VWKHEDQGLQ/DQHFRPLJUDWHGZLWKWKHUHFRPELQDQW+VK U+VK ERWK EDQGVLQGLFDWLQJDVL]HRI⁓N'D,FRQFOXGHWKDWWKHDQWL+VKDQWLVHUXPFRXOG LPPXQRSUHFLSLWDWHWKH+VKSURWHLQIURP&PHURODHH[WUDFW 100  )LJXUH:HVWHUQEORWRI+VK&PHURODHZKROHFHOOH[WUDFW/DQHQJRIWKHUHFRPELQDQWDQGSXULILHG &P+VKSURWHLQ U+VK /DQHLPPXQRSUHFLSLWDWHIURPDQWL+VKDQWLVHUXP Į+VK DQG/DQH LPPXQRSUHFLSLWDWHIURPQRQLPPXQHVHUXP 1,6  1RUWKHUQ%ORW$QDO\VLV 1RUWKHUQDQDO\VLVRIWKHLPPXQRSUHFLSLWDWHG51$VKRZHGWKDWQREDQGZDV REVHUYHGZKLFKPLJKWFRUUHVSRQGWRWKH8VQ51$%\FRPSDULQJWKHVXSHUQDWDQWDQG LPPXQRSUHFLSLWDWHGSHOOHWVQR8VQ51$ZDVSUHFLSLWDWHGE\WKHDQWL+VKDQWLVHUXP IURP&PHURODHH[WUDFW )LJXUH      101   )LJXUH1RUWKHUQEORWDQDO\VLVRIWKHLPPXQRSUHFLSLWDWHG51$SUREHGIRU8VQ51$/DQHWRWDO 51$IURP&PHURODHZKROHFHOOH[WUDFW/DQH௅VXSHUQDWDQW 6 DQGSHOOHW 3 IURPQRQLPPXQHVHUXP ZKLFKLVWKHFRQWURODQG/DQH௅VXSHUQDWDQW 6 DQGSHOOHW 3 IURPWKHDQWL+VKDQWLVHUXP 'LVFXVVLRQ 6XFFHVVIXOH[SUHVVLRQDQGSXULILFDWLRQRIWKH+VKSURWHLQPDGHLWSRVVLEOHWR SXUVXHIXUWKHULQYHVWLJDWLRQVDQGUHVHDUFKRQWKH+VKSURWHLQ7RVWXG\WKH+VKDQG8 VQ51$LQWHUDFWLRQILUVWFLUFXODUGLFKURLVP &' VSHFWURSRODULPHWU\ZDVSHUIRUPHG7KH&' VSHFWUDKHOSHGWRGHWHUPLQHLIWKH+VKLVIROGHGDQGWKHVHFRQGDU\VWUXFWXUHRIWKH UHFRPELQDQW+VKSURWHLQZDVDOVRLQYHVWLJDWHG,WLVZRUWKFRQVLGHULQJWKHSRVVLELOLW\RI 102 misfolding Hsh49 during the purification process. If Hsh49 misfolds or loses its secondary structure, this can affect the formation of a complex between the Hsh49-U2 snRNA. Based on the CD spectra result, recombinant Hsh49 contains 91% α-helices and 0% β-sheets. Also, CD spectra showed the thermal stability of this protein over a temperature range of 28.79 to 84.44 C. I compared the α-helix and β-sheet content of C. merolae Hsh49 with that of Homo sapien and S. cerevisiae Hsh49. The secondary structure of human Hsh49 consists of 19% αhelices and 23% β-sheets (Figure 57‒ Figure 60) while S. cerevisiae Hsh49 contains 23% αhelices and 26% β-sheets (Figure 61 and Figure 62). The content of α-helices and β-sheets in the secondary structure of Hsh49 in yeast and humans are close to each other. Based on the CD spectra results, the content of α-helices and β-sheets in C. merolae is very different from yeast and humans. The sequence alignment of the Hsh49 protein in C. merolae demonstrated that there is less similarity to the S. cerevisiae and Homo sapiens. This result suggests that the Hsh49 does not have the same function in C. merolae as it has in S. cerevisiae and Homo sapiens. Therefore, I decided to perform the functional analyses. As shown previously, the secondary structure of Hsh49 in C. merolae consists of 91% α-helices and 0% β-sheets (Figure 41). Hsh49 sequence alignment in C. merolae indicates it is 34% identical and 52% similar to the S. cerevisiae Hsh49. On the other hand, Hsh49 in C. merolae is 35% identical and 50% similar to the human Hsh49. Sequence alignment results indicate that the first sequences of C. merolae Hsh49 contain RRM1 and the second half matches contain RRM2. In addition, the results show that Homo sapiens Hsh49 RRM1 is more similar to C. merolae Hsh49 RRM1 than RRM2. 103 Therefore, based on the stability and folding of the recombinant Hsh49 protein that was obtained from CD spectra result, FA, EMSA and immunoprecipitation analysis were performed to characterize the Hsh49-U2 snRNA interaction. Figure 57. The crystal structure of the first RRM of Homo sapiens Hsh49 (PDB file 5GVQ; Kuwasako et al. 2017). Figure 58. The secondary structure of the first RRM of Homo sapiens Hsh49 (PDB file 5GVQ; Kuwasako et al. 2017). 104 Figure 59. The crystal structure of the second RRM of Homo sapiens Hsh49 (PDB file 1X5T; Sato et al., to be published). Figure 60. The secondary structure of the second RRM of Homo sapiens Hsh49 (PDB file 1X5T; Sato et al., to be published). 105 Figure 61. The Crystal structure of the S. cerevisiae Hsh49 protein (PDB file 5LSB; vanRoon et al. 2017). Figure 62. The secondary structure of the S. cerevisiae Hsh49 (PDB file 5LSB; vanRoon et al. 2017). F-EMSA indicated that the fluorescent U2 oligo was not degraded; therefore, fluorescence anisotropy was performed and the results did not show any interaction between the Hsh49 and the fluorescein labelled 35 nucleotide U2 snRNA oligo in C. merolae. It is possible that the Hsh49 interacts with different sites of the U2 snRNA which does not include 106 the 35 oligonucleotides that were used in the FA assays. Therefore, I decided to study the interaction of the full-length U2 snRNA with the recombinant Hsh49 protein by EMSA to determine if any interaction occurs. Based on previous studies that were done on Hsh49 originating from Homo sapiens and S. cerevisiae cells, it was shown that Hsh49 has an RNA binding activity and associates to the U2 snRNA. Igel et al. (1998) found that the S. cerevisiae GST-Hsh49 protein interacts with a 309-nt in vitro U2 snRNA transcript. This finding came from EMSA. Furthermore, Homo sapiens Hsh49 interacts with several regions of the full-length U2 snRNA which consists of the 5ʹ end of the U2 snRNA, and stem loops I and IIb (Dybkov et al. 2006). This is contrasted with my EMSA findings which showed that there was no interaction between the Hsh49-U2 snRNA in C. merolae. The FA and EMSA results confirm the Northern analysis which indicated that no interaction occurs between Hsh49-U2 snRNA. In all FEMSA and EMSA results, free U2 snRNA oligo showed two bands in all lanes and these bands could be related to the conformational changes of the U2 snRNA. Western blot analysis showed that the anti-Hsh49 antiserum which was generated against the recombinant Hsh49 could recognize and immunoprecipitate the Hsh49 from C. merolae extract. This result supports the successful recombination of the Hsh49 protein. It was demonstrated that Hsh49 in yeast interacts with the Cus1 but the presence of the Cus1 is not essential for the U2 snRNA-Hsh49 interaction in yeast (Igel et al. 1998). It is possible that the pre-mRNA splicing in C. merolae occurs differently between yeast and human cells. Perhaps, the presence of Cus1 or some other factors are necessary for this interaction in C. merolae. 107 Chapter 5 General Discussion C. merolae has a small genome, containing only 26 introns and a small number of core splicing proteins. In C. merolae, 43 splicing proteins are used compared to almost 90 in budding yeast and around 140 in Homo sapiens. Furthermore, C. merolae only contains four snRNAs including U2, U4, U5 and U6 snRNA, since no candidate for the U1- associated proteins or U1 snRNA were found (Stark et al. 2015). The reduced number of splicing proteins and introns enables C. merolae to be considered a simpler organism to study splicing. The U2 snRNP, with ten specific proteins and seven Sm proteins, is one of the most complex particle in C. merolae. In order to better understand the molecular function and structural role of the U2 snRNP and SF3a complex, the first step is to generate an in vitro reconstitution system. The prerequisite step towards this approach was the expression and purification of these small nuclear ribonucleoproteins. In Chapter 2, my major attempt was to co-express and purify seventeen recombinant proteins. Current studies suggest that the co-expression of proteins results in stability of each protein and prevents protein degradation. In addition, protein coexpression increases the solubility and yield of protein (Romier et al. 2006; Stefan et al. 2015). Previous studies showed that both Lsm and Sm complexes co-expressed and copurified successfully (Dunn, 2014, unpublished). However, co-expression of the U2 snRNP and SF3a protein complex was not successful. Not all of the U2 snRNA proteins could be expressed and SF3a appeared as an insoluble particle. 108 As a result of the unsuccessful co-expression of the U2 snRNPs and SF3a protein complex, I decided to express and purify some U2 snRNA proteins individually. Chapter 3 is concerned with the expression of the Rds3 and SF3a protein to study the function and structure of these proteins. Of the SF3a and Rds3 genes which were cloned individually into the pQlink vector, only Prp9 and Prp21 were expressed. None of the SF3a genes which were cloned in the PMCSG23 vector were expressed. The Rds3 protein was expressed and purified well after its gene was cloned in the PMCSG23 vector. However, I could not remove the His and MBP tags completely from this protein, and the tags could later interfere with the protein interactions or with obtaining the crystal structure of the desired protein. Chapter 4 describes the individual expression and purification of one of the U2 snRNA protein subunits, Hsh49. Since this protein could express and purify successfully, I performed some functional assays to determine the interaction between the Hsh49 protein and U2 snRNA. Before performing this step, I pursued CD spectra for two reasons: First to determine if the recombinant Hsh49 is folded, which would help me to do functional assays, and second, to assess the content of α-helices and β-sheets of the secondary structure of the Hsh49. The CD spectrum results indicated that the recombinant Hsh49 is folded and the secondary structure of the recombinant Hsh49 contains 91% α-helices, 0% β-sheets and 9% random coils. Unexpectedly, no interaction was observed between the portion of the U2 snRNA oligonucleotide (35 nucleotide) and the recombinant Hsh49 via fluorescence anisotropy assay. Electrophoretic mobility shift assay (EMSA) did not show any interaction between the recombinant Hsh49 and full length IVT U2 snRNA oligonucleotide. The EMSA results were 109 confirmed by the northern blot analysis of the immunoprecipitated RNA, for which there was no band observed indicating Hsh49-U2 snRNA interaction. The question that arises: is it possible that the Hsh49 was misidentified and it is not really the right protein? Sequence alignments of Hsh49 in C. merolae showed amino acids sequence identity with their S.cerevisiae and Homo sapiens homologs, with 12% and 9% respectively. Comprehensive bioinformatics survey of the C. merolae splicing machinery recognized four snRNAs (U2, U4, U5 and U6) and 43 core splicing proteins. In addition, BLAST searches using Reciprocal Best Hit methodology could identify all 10 U2 snRNA associated proteins including Hsh49 with E-values below the cutoff of 1E-13 (Stark et al. 2015). Importantly, 2′O-methyl- Mass spectrometry data confirmed the association of Hsh49 protein with U2 snRNA and some U2 snRNA proteins. This was performed using 2′Omethyl antisense oligonucleotide pull-downs against U2 snRNA of C. merolae extract. This experiment was followed by Northern analysis and mass spectrometry which showed that Hsh49 copurified with U2 snRNA (Reimer et al. 2017). These results provides confirmatory evidence that the Hsh49 is the right protein. The data gathered in this study showed that the recombinant Hsh49 did not interact with the U2 snRNA. It may be possible that the interaction between the Hsh49 and the U2 snRNA does not occur directly. More specifically, the presence of one or a group of the U2 snRNA proteins are necessary to mediate this interaction. Previous studies showed that the Cus1 protein directly interacts with the Hsh49 protein in yeast through the first RRM and this interaction is essential for the tethering of the U2 snRNA to the branch point of the premRNA (Igel et al. 1998; Champion-Arnaud et al. 1994). 110 Moreover, another subunit of the SF3b containing Hsh155 and Rse1 proteins interact with one another through the protein-protein interaction. These two proteins also associate with the Hsh49 and Cus1 protein complex. On the other hand, biochemical studies indicate that the U2 snRNA interacts with the Hsh155, Hsh49 and Cus1 in yeast (ChampionArnaud et al. 1994). 5.1 Future Directions In light of the fact that the interaction of Hsh49 to U2 snRNA might need other proteins of the SF3b complex, assembling U2 snRNP could help to detect the interaction of the Hsh49 protein with the U2 snRNA. One approach would be checking the interaction of the U2 snRNA-Hsh49 in the presence of the Cus1 protein. It seems that a deeper understanding of the U2 snRNA and Hsh49 protein is needed; therefore, other subunits of the SF3b complex should be studied. 111 Literature Cited Abovich, N., Lia, X. C., and Rosbash, M. (1994). The yeast MUD2 protein: an interaction with PRP11 defines a bridge between commitment complexes and U2 snRNP addition. Genes and Development. 8: 843-854. Abovich, N., and Rosbash, M. (1997). Cross-intron bridging interactions in the yeast commitment complex are conserved in mammals. Cell. 89: 403-412. Arenas, J. E., and Abelson, J. (1997). Prpp43: An RNA helicase-like factor involved in spliceosome disassembly. Proc Natl Acad Sci USA. 94 (22): 11798–11802. Ares, M., Jr. (1986). U2 RNA from yeast is unexpectedly large and contains homology to vertebrate U4, U5, and U6 small nuclear RNAs. Cell. 47: 49-59. Black, C. S., Garside, E., L, MacMillan, A., M, Rader, S., D. (2016). Conserved structure of Snu13 from the highly reduced spliceosome of Cyanidioschyzon merolae. Protein Science, 25: 911-916. Brosi, R., Groning, K., Behrens, S. E., Luhrmann, R., Kramer, A. (1993). Interaction of mammalian splicing factor SF3a with U2 snRNP and relation of its 60-kD subunit to yeast PRP9. Science, 262 (5130): 102-105. Brow, D. A. (2002). Allosteric cascade of spliceosome activation, Annu. Rev. Genet, 36: 333-360. Caspary, F., and Seraphin, B. (1998). The yeast U2A/U2B complex is required for prespliceosome formation. EMBO Journal, 17(21): 6348-6358. Chabot, B., Black, D. L., LeMaster, D. M., and Steitz, J. A. (1985). The 3ʹ splice site of premessenger RNA is recognized by a small nuclear ribonucleoprotein. Science, 230 (4732): 1344-1349. Champion-Arnaud, P., and Reed, R. (1994). The prespliceosome components SAP 49 and SAP 145 interact in a complex implicated in tethering U2 snRNP to the branch site. Genes and Development. 8: 1974-1983. 112 Chang, T. H., Tung, L., Yeh, F. L., Chen, J. H., Chang, S. L. (2013). Functions of the DExD/H-box proteins in nuclear pre-mRNA splicing. Elsevier, 1829(8): 764-774. Chang, J., Schwer, B., and Stewart Shuman, S. (2012). Structure function analysis and genetic interactions of the yeast branchpoint binding protein Msl5. Nucleic Acids Research, 40(10): 4539-4552. Chiou, N. T., and Lynch, K. W. (2014). Mechanism of spliceosomal assembly. In Hartel, K. J (Eds), Spliceosomal pre-mRNA splicing (pp. 35-43). Heidelberg New York: Springer. Combs, D. J., Nagel, R. J., Ares, Jr. M., Stevens, W. S. (2006). Prp43p Is a DEAH-Box Spliceosome Disassembly Factor Essential for Ribosome Biogenesis. Mol Cell Biol. 26(2): 523-534. Cretu, C., Schmitzová, J., Ponce- Salvetierra, A., Dybkov, O., De Laurentiis, E. I., Sharma, K., Will, C. L., Urlaub, H., Lührmann, R.,Vladimir Pena, V. (2016) Molecular Architecture of SF3b and structural consequences of its cancer-related mutations. Molecular and Cell Biology, 64(2): 307-319. Cvitkovic and Jurica, Nucleic Acids Research [Internet]. c2013. [cited 2015 June 10]. Available from: Spliceosomedb.ucsc.edu/proteins. Draper, D. E. (2004). A guide to ions and RNA structure. RNA Society, 10(3):335–343. Donmez, G., Hartmuth, K., Lührmann, R. (2004). Modified nucleotides at the 5' end of human U2 snRNA are required for spliceosomal E-complex formation. RNA Society, 10(12):1925-1933. Dunn EA. 2014. “Investigation of free U6 small nuclear ribonucleoprotein structure and function.” Ph.D. thesis, University of British Columbia, Vancouver. Dybkov, O., Will, C. L., Deckert, J. Behzadnia, N., Hartmuth, K., and Reinhard Luhrmann. (2006). U2 snRNA-protein contacts in purified human 17S U2 snRNPs and in spliceosomal A and B complexes. Molecular and Cellular Biology. 26(7): 2803-2816. Dyson, M. R. (2007). Expression strategy. In Dyson, M. R., and Durocher, Y (Eds.), Expression systems (pp. 1-11). Bioxham, Oxforshire: Scion. 113 http://www.ebi.ac.uk/Tools/psa/emboss_needle/ Fabrizio, P. Dannenberg, J., Dube, P., Kastner, B., Stark, H., Urlaub, H., Lührmann, R. (2009). The evolutionarily conserved core design of the catalytic activation step of the yeast spliceosome. Molecular and Cellular Biology, 36(4): 593-608. Faustino, N. A., and Cooper, T. A. (2003). Pre-mRNA splicing and human disease. Cold Spring Harbor Laboratory Press, Gene and Development, 17(4): 419-437. Golas, M. M., Sander, B., Will, C. L., Lührmann, R., Stark, H. (2003). Molecular architecture of the multiprotein splicing factor SF3b. Science. 300(5621): 980-984. Gozani, O., Feld, R., Reed, R. (1996). Evidence that sequence-independent binding of highly conserved U2 snRNP proteins upstream of the branch site is required for assembly of spliceosomal complex A. Genes and Development, 10: 233-243. Harlow, Ed., Lane, D. (1988). Antibodies: A laboratory manual. New York: Cold spring harbor laboratory, Chapter 5. Held, P., Jolley, M. E. (2008). Fluorescence polarization detection of Brucella abortus and Mycobacterium bovis specific antibodies using kits from Diachemix, BioTek Instruments, Inc. Hoskins, A. A., Rodgers, M. L., Friedman, L. J., Gelles, J., Moore, M. J. (2016). Single molecule analysis reveals reversible and irreversible steps during spliceosome activation. eLIFE Research article, 5: 1-22. Howe, C. (2007). Gene cloning and manipulation. New York: Cambridge university press. Chapter 8. Igel, H., Wells, S., Perriman, R., and Ares, M. (1998). Conservation of structure and subunit interactions in yeast homologues of splicing factor 3b (SF3b) subunit. RNA society, 4(1): 1-10. Igel, A. H. and M. Ares, Jr. (1988). Internal sequences that distinguish yeast from metazoan U2 snRNA are unnecessary for pre- mRNA splicing. Nature 334: 450-453. 114 Juneau, K., Curtis Palm, C., Miranda, M., and Davis, R. W. (2007). High-density yeast-tiling array reveals previously undiscovered introns and extensive regulation of meiotic splicing. National Academy of Sciences, 104(5): 1522-1527. Jurica, M. S., and Moore, M. J. (2003). Pre-mRNA splicing: awash in a sea of proteins. Molecular and Cellular Biology 12(1): 5-14. Kakehi, K., Oda, Y., and Kinoshita, M. (2001). Fluorescence Polarization: analysis of carbohydrate-protein interaction. Analytical Biochemistry, 297(2): 111-116. Kholod, N., and Mustelin, T. (2001). Novel vectors for co-expression of two proteins in E. coli. Biotechniques, 31(2): 322-323, 326-328. Kim, R. (2007). Protein expression in Escherichia coli. In Dyson, M. R., and Durocher, Y (Eds.), Expression systems (pp. 13-28). Bioxham, Oxforshire: Scion. Kistler, Al., Guthrie, C. (2001). Deletion of Mud2, the yeast homolog of U2AF65, can bypass the requirement for Sub2, an essential spliceosomal ATPase. Genes and Development.15: 42-49. Kramer, A. (1988). Presplicing complex formation requires two proteins and U2 snRNP. Genes and Development, 2(9): 1155-1167. Kramer, A., Mulhauser, F., Wersig, C., Groning, K., Bilbe, G. (1995). Mammalian splicing factor SF3a120 represents a new member of the SURP family of proteins and is homologous to the essential splicing factor PRP21p of Saccharomyces cerevisiae. RNA, 1(3): 260-272. Kuwasako, K. Nameki, N. Tsuda, K. M., Takahashi, M. Sato, A. Tochio, N. Inoue, M. Terada, T. Kigawa, T. Kobayashi, N. Shirouzu, M. Ito, T. Sakamoto, T. Wakamatsu, K. Güntert, P. Takahashi, S.Yokoyama, S., and Muto. Y. (2017). Solution structure of the first RNA recognition motif domain of human spliceosomal protein SF3b49 and its mode of interaction with a SF3b145 fragment. Protein science, 26(2): 280-291. Lardelli, R. M., Thompson, J. X., Yates, J. R., and Stevens, S. W., (2010). Release of SF3 from the intron branchpoint activates the first step of pre-mRNA splicing. RNA biology, 16(3):516-528. 115 Lea, W. A., and Simeonov, A. (2011). Fluorescence polarization assays in small molecule screening. Expert Opin Drug Disco. 6(1): 17-32. Legrain, P., and Chapon, C. (1993). Interaction between PRP11 and SPP91 yeast splicing factors and characterization of a PRP9-PRP11-SPP91 complex. Science, 262(5130): 108-110. Liang, W-W., and Cheng, S-C. (2015). A novel mechanism for Prp5 function in prespliceosome formation and proofreading the branch site sequence. Gene and Development, 29(1): 81-93. Lin, P. C., and Xu, R. M. (2012). Structure and assembly of the SF3a splicing factor complex of U2 snRNP. EMBO J, 31(6): 1579-1590. Lopez-Bigas, N., Audita, B., Ouzounisa, C., Parrab, G., Guigo, R. (2005). Are splicing mutations the most frequent cause of hereditary disease?. Federation of European Biochemical Societies, 579(9): 1900-1903. Louis-Jeune, C., Andrade-Navarro, M., Perez-Iratxeta, C. (2011). Prediction of protein secondary structure from circular dichroism using theoretically derived spectra. Proteins, Structure, Function, and Bioinformatics. 80(2): 347-381. Martin, A., Schneider, S., and Schwer, B. (2002). Prp43 Is an Essential RNA-dependent ATPase Required for Release of Lariat-Intron from the Spliceosome, J Biol Chem, 277(20): 17743-17750. Matsuzaki, M., Misumi, O., Shin-i, T., Maruyama, S., Takahara, M., Miyagishima, S. M., Mori, T., Keiji Nishida, K., Yagisawa, F., Nishida, K., Yoshida, Y., Nishimura, Y., Nakao, S., Kobayashi, T., Momoyama, Y., Higashiyama, T., Minoda, A., Sano, M., Nomoto, S., H., Oishi, K., Hayashi, H., Ohta, F., Nishizaka,S., Haga,S., Miura, S., Morishita, T., Kabeya, Y., Terasawa, K., Suzuki, Y., Ishii, Y., Asakawa, S., Takano, H., Ohta, N., Kuroiwa,H., Tanaka, K., Shimizu, N., Sugano, S., Sato, N., Nozaki, H., Ogasawara, N., Kohara, Y., and Kuroiwa, T. (2004). Genome sequence of the ultrasmall unicellular red alga Cyanidioschyzon merolae 10D, Nature, 428(6983): 653-657. Massenet, S., Mougin, A., Branlant, C. (1998). Post-transcriptional modifications in the U small nuclear RNAs. In Modifications and editing of RNA (eds. H. Grosjean and R. Benne), (pp. 201-228). ASM Press, Washington, DC. 116 Misra, V., K. Draper, D., E. (1998). On the role of magnesium ions in RNA stability. Nucleic Acid Sciences, 48(2): 113-135. Misumi, O., Matsuzaki, M., Nozaki, H., Miyagishima, S. Y., Mori, T., Nishida, K., Yagisawa, F.,Yoshida, Y., Kuroiwa, H., and Kuroiwa, T. (2005). Cyanidioschyzon merolae Genome. A tool for facilitating comparable studies on organelle biogenesis in photosynthetic eukaryotes. Plant Physiology, 137(2): pp. 567-585. Moll, J., R. Acharya, A. Gal, J. Mir, A., A. and Vinson, C. (2002). Magnesium is required for specific DNA binding of the CREB B-ZIP domain. Nucleic Acids Research. 30(5): 1240-1246. NCBI Gene [Internet]. [date unknown]. National Center for Biotechnology Information [NCBI]; [cited 2015 June 10]. Available from: http://www.ncbi.nlm.nih.gov/gene/ Nilsen, T.W. (1998). RNA-RNA interactions in nuclear pre-mRNA splicing. In RNA Structure and Function, M. Grundber-Manago and R.W. Simons (Eds.), Cold Spring Harbor Laboratory Press, 35: 279-307. Noble, J. S. C., Prives, C., and Manley, J. L. (1988). Alternative splicing of SV40 early premRNA is determined by branch site selection. Genes and Development, 2(11): 14601475. Nozaki, H., Takano, H., Misumi, O., Terasawa, K., Matsuzaki, M., Maruyama, S., Nishida, K., Yagisawa, F., Yoshida, Y., Fujiwara, T., Takio, S., Tamura, K., Sung Jin Chung, S. J., Nakamura, S., Kuroiwa, H., Tanaka, K., Sato, N., and Kuroiwa, T. (2007). A 100%-complete sequence reveals unusually simple genomic features in the hot-spring red alga Cyanidioschyzon merolae. 5(28): 1-8. http://nucleobytes.com/index.php/4peaks O’Day, C. L., Dalbadie, M. G., Abelson, J. (1996). The saccharomyces cerevisiae Prp5 protein has RNA-dependent ATPasae activity with specificity for U2 small nuclear RNA. Biol. Chem. 271: 33261-33267. Parker, R., Siliciano, P. G., and Guthrie, C. (1987). Recognition of the TACTAAC box during mRNA splicing in yeast involves base pairing to the U2-like snRNA. Cell, 49(2): 229-239. 117 Pagano, J., M. Clingman, C., C. and Ryder, S., P. (2011). Quantitative approaches to monitor protein–nucleic acid interactions using fluorescent probes. RNA society. 17(1): 1420. Patel, A. A., and Joan A. Steitz, J. A. (2003). Splicing double: insights from the second spliceosome. Mol. Cell, 4: 960-970. Pauling, M. H., McPheeters, D, S., Ares, Jr, M. (2000). Functional Cus1p is found with Hsh155p in a multiprotein splicing factor associated with U2 snRNA. Molecular and Cellular Biology, 20(6): 2176-2185. Pei, J., Kim, B. H., and Grishin, N. V. (2008). PROMALS3D: a tool for multiple sequence and structure alignment. Nucleic Acids Res. 36(7), 2295-2300. Perrin, F. (1926) Polarization de la Lumiere de Fluorescence. Vie Moyenne de Molecules dans L’etat Excite. J. Phys Radium. 7(12): 390-401. Rymond, B. C. and Rosbash, M. (1985). Cleavage of 5' splice site and lariat formation are independent of 3' splice site in yeast mRNA splicing. Nature 317: 735-737. Rymond, B. C., Torrey, D. D., Rosbash. M. (1987). A novel role for the 3' region of introns in pre-mRNA splicing of Saccharomyces cerevisiae. Genes Dev. 1: 238-246. Reimer, K. A., Stark, M. R., Aguilar, L-C., Stark, S. R., Burke, R. D., Moore, J., Fahlman, R. P., YIP, C. K., Kuroiwa, H., Oefinger, M., and Rader, S., D. (2017). The sole LSm complex in Cyanidioschyzon merolae associates with pre-mRNA splicing and mRNA degradation factors. RNA society, 23:1-16. Romier, C., Ben Jelloul, M., Albeck, S., Buchwald, G., Busso, D., Celie, P. H., Christodoulou, E., De Marco, V., van Gerwen, S., Knipscheer, P., Lebbink, J. H., Notenboom, V., Poterszman, A., Rochel, N., Cohen, S. X., Unger. T., Sussman, J. L., Moras, D., Sixma, T. K., Perrakis, A. (2006), Co-expression of protein complexes in prokaryotic and eukaryotic hosts: experimental procedures, database tracking and case studies. Biological crystallography, 62: 1232-1242. Ruskin, B., Zamore, P. D., and Green, M.R. (1988). A factor, U2AF is required for U2 snRNP binding and splicing complex assembly. Cell, (52): 207-219. 118 Rutz, B., and Seraphint, B. (1999).Transient interaction of BBP/ScSF1 and Mud2 with the splicing machinery affects the kinetics of spliceosome assembly. RNA society. 5: 918-831. Saccharomyces Genome Database [Internet]. [date unknown]. Stanford (CA): Stanford University; [cited 2015 June 10]. Available from: http://www.yeastgenome.org Sashital, D. G., Cornilescu, G., McManus, C. J., Brow, D. A., Butcher, S. E. (2004). U2–U6 RNA folding reveals a group II intron-like domain and a four-helix junction. Nat. Struct. Mol. Biol. 11: 1237–1242. Sashital, D. G., Venditti, V., Angers, C. G., Cornilescu, G., and Butcher, S. E. (2007). Structure and thermodynamics of a conserved U2 snRNA domain from yeast and human. Cold Spring Harbor Laboratory Press, 13(3): 328-338. Sato, A. et al. (no date) ‘Solution structure of the second RRM domain in splicing factor = 3B’, to be Published. doi: 10.2210/PDB1X5T/PDB. Scheich, C., Kummel, D., Soumailakakis, D., Heinemann, U., and Bussow, K. (2007). Nucleic Acids Research, 35(6): 1-7. Schmid-Burgk, J. L., Xie, Z., and Benenson, Y. (2014). Hierarchical ligation-independent assembly of PCR fragments. Springer science+business media, 1116: 49-58. Staknis, D., and Reed, R. (1994). Direct Interactions between Pre-mRNA and Six U2 Small Nuclear Ribonucleoproteins during Spliceosome Assembly. Molecular and Cellular Biology. 14(5): 2994-3005. Stark, H., Dube, P., Lührmann, R., Kastner, B. (2001). Arrangement of RNA and proteins in the spliceosomal U1 small nuclear ribonucleoprotein particle. Nature, 409 (6819): 539-42. Stark, M. R., Dunn, E. A.,Dunn, W. S. C., Grisdalec, C. J., Danielea, A. R., Halsteada, M. R. G., Fast, N. M., Rader, S. D. (2015). Dramatically reduced spliceosome in Cyanidioschyzon merolae. PNAS, 112(11): 1-10. Stefan, A., Ceccarelli, A., Conte, E., Monton, S. A., Hochkoeppler, A. (2015). The multifaceted benefits of protein co-expression in Escherichia coli. Journal of Visualized Experiment. 5(96): 1-9. 119 Sun, J. S. and Manley, J. L. (1995). A novel U2-U6 snRNA structure is necessary for mammalian mRNA splicing. Genes & Dev. 9: 843–854. Suzuki, K., Ohta, N. and Kuroiwa, T. (1992). lsolation of the cell-nuclear, mitochondrial, and chloroplast DNA from the ultrasmall eukaryote Cyanidioschyzon merolae. Protoplasma, 171(1-2): 80-4. Tan, S., Kern, R. C., and Selleck,W. (2005). The pST44 polycistronic expression system for producing protein complexes in Escherichia coli. Protein Expr. Purif. 40(2): 385-395. Tauchert, M. J., Fourmann, J-B., Christian, H., Luhrmann, R., Ficner, R. (2016) structural and functional analysis of the RNA helicase Prp43 from the thermophilic eukaryote chaetomium thermophilum. Structural Biology Communications. 72: 112-120. https://thebiogrid.org/ www.uniprot.org van Roon, A. M., Oubridge, C., Obayashi, E., Sposito, B., Newman, A. J., Seraphin, B. and Nagai, K. (2017) ‘Crystal structure of U2 snRNP SF3b components: Hsh49p in complex with Cus1p binding domain’, RNA, p. rna.059378.116. doi: 10.1261/rna.059378.116. Wang, Q., He, J., Lynn, B., and Rymond, B. C. (2005). Interactions of the yeast SF3b splicing factor. Molecular and Cellular Biology, 25(24): 10745-10754. Wang, Q., and Rymond, B. C. (2003). Rds3p is required for stable U2 snRNP recruitment to the splicing apparatus. Molecular and Cellular Biology. 23(20): 7339-7349. Wahl, M. C.,Will, C. L., and Luhrmann, R. (2009). The Spliceosome: Design Principles of a Dynamic RNP Machine, Cell, 136, 701-718. Wiest, D. K., O’Day, C. L., and Abelson, J. (1996).Vitro studies of the Prp9. Prp11. Prp21 complex indicate a pathway for U2 small nuclear ribonucleoprotein activation. Biological Chemistry, 271(52): 33268-3327. 120 Will, C. L., Urlaub, H., Achsel, T., Gentzel, M., Wilm, M., Luhrmann, R. (2002). Characterization of novel SF3b and 17S U2 snRNP proteins, including a human Prp5p homologue and an SF3b DEAD box protein. EMBO J, 21: 4978-4988. Will, C. L., and Lührmann, R. (2011). Spliceosome Structure and Function. Cold Spring Harbor Laboratory Press, 3(7): 1-23. Wu, S., C. Romfo, T. Nilsen, and M. Green. (1999). Functional recognition of the 3′ splice site AG by the splicing factor U2AF35. Nature, 402: 832-835 Wu, J., and Manley, J. L. (1989). Mammalian pre-mRNA branch site selection by U2 snRNP involves base pairing. Genes and Development. Cold Spring Harbor Laboratory Press, 3, 1553-1561. Wu, T., and Fu, X. D. (2015). Genomic functions of U2AF in constitutive and regulated splicing. RNA Biology, 12(5):479-785. Yan, D., Perriman, R., Igel, H., Howe, K. J., Nevile, M., and JR. M. A. (1998). CUS2, a yeast homolog of human Tat-SF1, rescues function of misfolded U2 through an unusual RNA recognition motif. Molecular and Cellular Biology, 18(9): 5000-5009. http://www.yeastgenome.org/locus/S000002201/interaction Zhuang, Y., and Weiner, A. M. (1989). A compensatory base change in human U2 snRNA can suppress a branch site mutation. Genes and Development. Cold Spring Harbor Laboratory Press, 3: 1545-1552. 121