Abstract: Protein phosphatase 1 occurs in all tissues and regulates many pathways, ranging from cell-cycle progression to carbohydrate metabolism. Many naturally occurring, molecular toxins modulate PP1 activity, though the exact mechanism of this differential regulation is not understood. A detailed elucidation of these interactions is crucial for understanding the cellular basis of phosphatase function and signaling pathways but, more importantly, they can serve as the basis for highly specific therapeutics, e.g. against cancer. We report the crystal structures of PP1 in complex with nodularin-R at 1.63 A and tautomycin at 1.70 A resolution. The PP1:nodularin-R complex was used to demonstrate the utility of our improved PP1 production technique, which produces highly active, soluble PP1. Tautomycin is one of the few toxins that reportedly preferentially binds PP1>PP2A. Therefore, the PP1:tautomycin structure is the first complex structure with a toxin with preferred PP1 specificity. Furthermore, since tautomycin is a linear non-peptide-based toxin, our reported structure will aid the design of lead compounds for novel PP1-specific pharmaceuticals.
Abstract: The production of crystals suitable for high-resolution structure determination is still one of the major bottlenecks in the structure determination process. This is especially true in structural genomics (SG) consortia, where the implementation of protein-specific purification and optimization strategies is not readily implemented into the structure determination workflow. This chapter describes four strategies that have been implemented by a number of SG groups to increase the number of protein targets that resulted in atomic resolution structures: (1) orthologue screening; (2) the use of 1D (1)H NMR spectroscopy to screen for the folded state of a protein prior to crystallization; (3) deletion constructs generation, in which regions of the target protein predicted to be disordered are omitted from the construct, to maximize the likelihood of crystal formation; and (4) crystallization optimum solubility screening to identify more suitable buffers for a given protein. The implementation of these strategies can lead to a substantial increase in the number of protein structures solved. Finally, because these strategies do not require the implementation of expensive robotics, they are highly applicable not only for the SG community but also for academic laboratories.
Abstract: BK virus (BKV) is a polyomavirus that establishes a lifelong persistence in most humans and is a major impediment to success of kidney grafts. The function of the innate immune system in BKV infection and pathology has not been investigated. Here we examine the role of antimicrobial defensins in BKV infection of Vero cells. Our data show that alpha-defensin human neutrophil protein 1 (HNP1) and human alpha-defensin 5 (HD5) inhibit BKV infection by targeting an early event in the viral lifecycle. HD5 treatment of BKV reduced viral attachment to cells, whereas cellular treatment with HD5 did not. Colocalization studies indicated that HD5 interacts directly with BKV. Ultrastructural analysis revealed HD5-induced aggregation of virions. HD5 also inhibited infection of cells by other related polyomaviruses. This is the first study to demonstrate polyomavirus sensitivity to defensins. We also show a novel mechanism whereby HD5 binds to BKV leading to aggregation of virion particles preventing normal virus binding to the cell surface and uptake into cells.
Abstract: Hematopoietic tyrosine phosphatase (HePTP) is one of three members of the kinase interaction motif (KIM) phosphatase family which also includes STEP and PCPTP1. The KIM-PTPs are characterized by a 15 residue sequence, the KIM, which confers specific high-affinity binding to their only known substrates, the MAP kinases Erk and p38, an interaction which is critical for their ability to regulate processes such as T cell differentiation (HePTP) and neuronal signaling (STEP). The KIM-PTPs are also characterized by a unique set of residues in their PTP substrate binding loops, where 4 of the 13 residues are differentially conserved among the KIM-PTPs as compared to more than 30 other class I PTPs. One of these residues, T106 in HePTP, is either an aspartate or asparagine in nearly every other PTP. Using multiple techniques, we investigate the role of these KIM-PTP specific residues in order to elucidate the molecular basis of substrate recognition by HePTP. First, we used NMR spectroscopy to show that Erk2-derived peptides interact specifically with HePTP at the active site. Next, to reveal the molecular details of this interaction, we solved the high-resolution three-dimensional structures of two distinct HePTP-Erk2 peptide complexes. Strikingly, we were only able to obtain crystals of these transient complexes using a KIM-PTP specific substrate-trapping mutant, in which the KIM-PTP specific residue T106 was mutated to an aspartic acid (T106D). The introduced aspartate side chain facilitates the coordination of the bound peptides, thereby stabilizing the active dephosphorylation complex. These structures establish the essential role of HePTP T106 in restricting HePTP specificity to only those substrates which are able to interact with KIM-PTPs via the KIM (e.g., Erk2, p38). Finally, we describe how this interaction of the KIM is sufficient for overcoming the otherwise weak interaction at the active site of KIM-PTPs.
Abstract: The Escherichia coli gene cluster ymgABC was identified in transcriptome studies to have a role in biofilm development and stability. In this study, we showed that YmgB represses biofilm formation in rich medium containing glucose, decreases cellular motility, and protects the cell from acid indicating that YmgB has a major role in acid-resistance in E. coli. Our data show that these phenotypes are potentially mediated through interactions with the important cell signal indole. In addition, gel mobility-shift assays suggest that YmgB may be a non-specific DNA-binding protein. Using nickel-enrichment DNA microarrays, we showed that YmgB binds, either directly or indirectly, via a probable ligand, genes important for biofilm formation. To advance our understanding of the function of YmgB, we used X-ray crystallography to solve the structure of the protein to 1.8 A resolution. YmgB is a biological dimer that is structurally homologous to the E. coli gene regulatory protein Hha, despite having only 5% sequence identity. This supports our DNA microarray data showing that YmgB is a gene regulatory protein. Therefore, this protein, which clearly has a critical role in acid-resistance in E. coli, has been renamed as AriR for regulator of acid resistance influenced by indole.
Abstract: Automation and miniaturization are key issues of high-throughput research projects in the post-genomic era. The implementation of robotics and parallelization has enabled researchers to process large numbers of protein targets for structural studies in a short time with reasonable cost efficiency. However, the cost of implementing the robotics and parallelization often prohibit their use in the traditional academic laboratory. Fortunately, multiple groups have made significant efforts to minimize the cost of heterologous protein expression for the production of protein samples in quantities suitable for high resolution structural studies. In this review, we describe recent efforts to continue to minimize the cost for the parallel processing of multiple protein targets and focus on those materials and strategies that are highly suitable for the traditional academic laboratory.
Abstract: Glutathione S-transferases (GSTs) comprise a diverse superfamily of enzymes found in organisms from all kingdoms of life. GSTs are involved in diverse processes, notably small-molecule biosynthesis or detoxification, and are frequently also used in protein engineering studies or as biotechnology tools. Here, we report the high-resolution X-ray structure of Atu5508 from the pathogenic soil bacterium Agrobacterium tumefaciens (atGST1). Through use of comparative sequence and structural analysis of the GST superfamily, we identified local sequence and structural signatures, which allowed us to distinguish between different GST classes. This approach enables GST classification based on structure, without requiring additional biochemical or immunological data. Consequently, analysis of the atGST1 crystal structure suggests a new GST class, distinct from previously characterized GSTs, which would make it an attractive target for further biochemical studies.
Abstract: Hematopoietic tyrosine phosphatase (HePTP) is a 38kDa class I non-receptor protein tyrosine phosphatase (PTP) that is strongly expressed in T cells. It is composed of a C-terminal classical PTP domain (residues 44-339) and a short N-terminal extension (residues 1-43) that functions to direct HePTP to its physiological substrates. Moreover, HePTP is a member of a recently identified family of PTPs that has a major role in regulating the activity and translocation of the MAP kinases Erk and p38. HePTP binds Erk and p38 via a short, highly conserved motif in its N terminus, termed the kinase interaction motif (KIM). Association of HePTP with Erk via the KIM results in an unusual, reciprocal interaction between the two proteins. First, Erk phosphorylates HePTP at residues Thr45 and Ser72. Second, HePTP dephosphorylates Erk at PTyr185. In order to gain further insight into the interaction of HePTP with Erk, we determined the structure of the PTP catalytic domain of HePTP, residues 44-339. The HePTP catalytic phosphatase domain displays the classical PTP1B fold and superimposes well with PTP-SL, the first KIM-containing phosphatase solved to high resolution. In contrast to the PTP-SL structure, however, HePTP crystallized with a well-ordered phosphate ion bound at the active site. This resulted in the closure of the catalytically important WPD loop, and thus, HePTP represents the first KIM-containing phosphatase solved in the closed conformation. Finally, using this structure of the HePTP catalytic domain, we show that both the phosphorylation of HePTP at Thr45 and Ser72 by Erk2 and the dephosphorylation of Erk2 at Tyr185 by HePTP require significant conformational changes in both proteins.
Abstract: Here, we report the three-dimensional structure of severe acute respiratory syndrome coronavirus (SARS-CoV) nsP7, a component of the SARS-CoV replicase polyprotein. The coronavirus replicase carries out regulatory tasks involved in the maintenance, transcription, and replication of the coronavirus genome. nsP7 was found to assume a compact architecture in solution, which is comprised primarily of helical secondary structures. Three helices (alpha2 to alpha4) form a flat up-down-up antiparallel alpha-helix sheet. The N-terminal segment of residues 1 to 22, containing two turns of alpha-helix and one turn of 3(10)-helix, is packed across the surface of alpha2 and alpha3 in the helix sheet, with the alpha-helical region oriented at a 60 degrees angle relative to alpha2 and alpha3. The surface charge distribution is pronouncedly asymmetrical, with the flat surface of the helical sheet showing a large negatively charged region adjacent to a large hydrophobic patch and the opposite side containing a positively charged groove that extends along the helix alpha1. Each of these three areas is thus implicated as a potential site for protein-protein interactions.
Abstract: In structural genomics centers, nuclear magnetic resonance (NMR) screening is in increasing use as a tool to identify folded proteins that are promising targets for three-dimensional structure determination by X-ray crystallography or NMR spectroscopy. The use of 1D 1H NMR spectra or 2D [1H,15N]-correlation spectroscopy (COSY) typically requires milligram quantities of unlabeled or isotope-labeled protein, respectively. Here, we outline ways towards miniaturization of a structural genomics pipeline with NMR screening for folded globular proteins, using a high-density micro-fermentation device and a microcoil NMR probe. The proteins are micro-expressed in unlabeled or isotope-labeled media, purified, and then subjected to 1D 1H NMR and/or 2D [1H,15N]-COSY screening. To demonstrate that the miniaturization is functioning effectively, we processed nine mouse homologue protein targets and compared the results with those from the "macro-scale" Joint Center of Structural Genomics (JCSG) high-throughput pipeline. The results from the two pipelines were comparable, illustrating that the data were not compromised in the miniaturized approach.
Abstract: In the Joint Center for Structural Genomics, one-dimensional (1D) 1H NMR spectroscopy is routinely used to characterize the folded state of protein targets and, thus, serves to guide subsequent crystallization efforts and to identify proteins for NMR structure determination. Here, we describe 1D 1H NMR screening of a group of 79 mouse homologue proteins, which correlates the NMR data with the outcome of subsequent crystallization experiments and crystallographic structure determination. Based on the 1D 1H NMR spectra, the proteins are classified into four groups, "A" to "D." A-type proteins are candidates for structure determination by NMR or crystallography; "B"-type are earmarked for crystallography; "C" indicates folded globular proteins with broadened line shapes; and "D" are nonglobular, "unfolded" polypeptides. The results obtained from coarse- and fine-screen crystallization trials imply that only A- and B-type proteins should be used for extensive crystallization trials in the future, with C and D proteins subjected only to coarse-screen crystallization trials. Of the presently studied 79 soluble protein targets, 63% yielded A- or B-quality 1D 1H NMR spectra. Although similar yields of crystallization hits were obtained for all four groups, A to D, crystals from A- and B-type proteins diffracted on average to significantly higher resolution than crystals produced from C- or D-type proteins. Furthermore, the output of refined crystal structures from this test set of proteins was 4-fold higher for A- and B-type than for C- and D-type proteins.
Abstract: Currently, 119 high resolution structures of Thermotoga maritima proteins have been determined by the Joint Center for Structural Genomics (JCSG, www.jcsg.org). Sixty-seven of these were solved using the first implementation of the multi-tiered crystallization strategy at the JCSG for the efficient crystallization of large numbers of protein targets. Previously, we reported the analysis of all proteins crystallized using this multi-tiered strategy [Lesley, S.A. et al. (2002) Proc. Natl. Acad. Sci. USA 99, 11664-11669; Page, R. et al. (2003) Acta Crystallogr. D Biol. Crystallogr. 59, 1028-1037]. Here, we extend the analysis and describe the crystallization characteristics of those proteins that produced diffraction quality crystals, ultimately resulting in high resolution structures. First, we found that over 77% (52) of the crystals used for structure determination were produced directly from high-throughput coarse screens, indicating that less than one quarter of the crystals (15) required fine screening. In addition, as observed for the proteome screen [Page, R. et al. (2003) Acta Crystallogr. D Biol. Crystallogr. 59, 1028-1037], the majority of conditions that produced crystals for natively expressed proteins, whose structures have been determined, were distinct from those of their more extensively purified and selenomethionine-labeled counterparts. Finally, 99% of the proteins whose structures were solved crystallized in conditions contained in the JCSG Minimal Core Screen [Page, R. et al. (2003) Acta Crystallogr. D Biol. Crystallogr. 59, 1028-1037; Page, R. and Stevens, R.C. (2004) Methods 34, 373-389], a set of 67 conditions previously identified as those most likely to produce crystals of a diverse set of proteins, confirming its success for rapid identification of proteins with a natural propensity to crystallize.
Abstract: An optimal solubility screen is described that uses the results of crystallization trials to identify buffers that improve protein solubility and, in turn, crystallization success. This screen is useful not only for standard crystallization experiments, but also can easily be implemented into any high-throughput structure-determination pipeline. As a proof of principle, the predicted novel-fold protein AF2059 from Archaeoglobus fulgidus, which was known to precipitate in most buffers and particularly during concentration experiments, was selected. Using the crystallization results of 192 independent crystallization trials, it was possible to identify a buffer containing 100 mM CHES pH 9.25 that significantly improves its solubility. After transferring AF2059 into this ;optimum-solubility' buffer, the protein was rescreened for crystal formation against these same 192 conditions. Instead of extensive precipitation, as observed initially, it was found that 24 separate conditions produced crystals and the exchange of AF2059 into CHES buffer significantly improved crystallization success. Fine-screen optimization of these conditions led to the production of a crystal suitable for high-resolution (2.2 A) structure determination.
Abstract: Cost and time reduction are two of the driving forces in the development of new strategies for protein crystallization and subsequent structure determination. Here, we report the analysis of the Thermotoga maritima proteome, in which we compare the proteins that were successfully expressed, purified and crystallized versus the rest of the proteome. This set of almost 500 proteins represents one of the largest, internally consistent, protein expression and crystallization datasets available. The analysis shows that individual parameters, such as isoelectric point, sequence length, average hydropathy, low complexity regions (SEG), and combinations of these biophysical properties for crystallized proteins define a distinct subset of the T. maritima proteome. The distribution profiles of the various biophysical properties in the expression/crystallization set are then used to extract rules to improve target selection and improve the efficiency and output of structural genomics, as well as general structural biology efforts.
Abstract: Recent efforts to collect and mine crystallization data from structural genomics (SG) consortia have led to the identification of minimal screens and novel screening strategies that can be used to streamline the crystallization process. Two groups, the Joint Center for Structural Genomics and the University of Toronto, carried out large-scale crystallization trials on different sets of bacterial targets (539, JCSG and 755, Toronto), using different sample processing and crystallization methods, and then analyzed their results to identify the smallest subset of conditions that would have crystallized the maximum number of protein targets. The JCSG Core Screen contains 67 conditions (from 480) while the Toronto Minimal Screen contains 6 (from 48). While the exact conditions included in the two screens do not overlap, the major precipitants of the conditions are similar and thus both screens can be used to determine if a protein has a natural propensity to crystallize. In addition, studies from other groups including the University of Queensland, the Mycobacterium tuberculosis SG group, the Southeast Collaboratory for SG, and the York Structural Biology Laboratory indicate that alternative crystallization strategies may be more successful at identifying initial crystallization conditions than typical sparse matrix screens. These minimal screens and alternative screening strategies are already being used to optimize the crystallization processes within large SG efforts. The differences between these results, however, demonstrate that additional studies which examine the influence of protein biophysical properties and sample preparation methods on crystal formation must also be carried out before more robust screens can be identified.
Abstract: As the field of structural genomics continues to grow and new technologies are developed, novel strategies are needed to efficiently crystallize large numbers of protein targets, thus increasing output, not just throughput [Chayen & Saridakis (2002). Acta Cryst. D58, 921-927]. One strategy, developed for the high-throughput structure determination of the Thermotoga maritima proteome, is to quickly determine which proteins have a propensity for crystal formation followed by focused SeMet-incorporated protein crystallization attempts. This experimental effort has resulted in over 320 000 individual crystallization experiments. As such, it has provided one of the most extensive systematic data sets of commonly used crystallization conditions against a wide range of proteins to date. Analysis of this data shows that many of the original screening conditions are redundant, as all of the T. maritima proteins that crystallize readily could be identified using just 23% of the original conditions. It also shows that proteins that contain selenomethionine and are more extensively purified often crystallize in distinctly different conditions from those of their native less pure counterparts. Most importantly, it shows that the two-tiered strategy employed here is extremely successful for predicting which proteins will readily crystallize, as greater than 99% of the proteins identified as having a propensity to crystallize under non-optimal native conditions did so again as selenomethionine derivatives during the focused crystallization trials. This crystallization strategy can be adopted for both large-scale genomics programs and individual protein studies with multiple constructs and has the potential to significantly accelerate future crystallographic efforts.
Abstract: Profilin and beta/gamma-actin from calf thymus were covalently linked using the zero-length cross-linker 1-ethyl-3-(3-dimethylaminopropyl)-carbodiimide in combination with N-hydroxysuccinimide, yielding a single product with an apparent molecular mass of 60 kDa. Sequence analysis and x-ray crystallographic investigations showed that the cross-linked residues were glutamic acid 82 of profilin and lysine 113 of actin. The cross-linked complex was shown to bind with high affinity to deoxyribonuclease I and poly(l-proline). It also bound and exchanged ATP with kinetics close to that of unmodified profilin-actin and inhibited the intrinsic ATPase activity of actin. This inhibition occurred even in conditions where actin normally forms filaments. By these criteria the cross-linked profilin-actin complex retains the characteristics of unmodified profilin-actin. However, the cross-linked complex did not form filaments nor copolymerized with unmodified actin, but did interfere with elongation of actin filaments in a concentration-dependent manner. These results support a polymerization mechanism where the profilin-actin heterodimer binds to the (+)-end of actin filaments, followed by dissociation of profilin, and ATP hydrolysis and P(i) release from the actin subunit as it assumes its stable conformation in the helical filament.
Abstract: Previous crystallographic investigations have shown that actin can undergo large conformational changes, even when complexed to the same actin binding protein. We have conducted a formal analysis of domain motions in actin, using the four available crystal structures, to classify the mechanism as either hinge or shear and to quantify the magnitude of these changes. We demonstrate that actin consists of two rigid cores, a semi-rigid domain and three conformationally variable extended loops. Confirming predictions about the nature of the domain rotation in actin based on its structural similarity to hexokinase, we show, using an algorithm previously used only to identify protein hinges, that residues at the interface between the two rigid cores undergo a shear between alternative conformations of actin. Rotations of less than 7 degrees in the torsion angles of five residues in the polypeptides that connect the rigid cores enable one actin conformation to be transformed into another. Because these torsion angle changes are small, the interface between the domains is maintained. In addition, we show that actin secondary structure elements, including those outside the rigid cores, are conformationally invariant among the four crystal structures, even when actin is complexed to different actin binding proteins. Finally, we demonstrate that the current F-actin models are inconsistent with the principles of actin conformational change identified here.