Abstract: Bacterial cell walls contain peptidoglycan, an essential polymer made by enzymes in the Mur pathway. These proteins are specific to bacteria, which make them targets for drug discovery. MurC, MurD, MurE and MurF catalyze the synthesis of the peptidoglycan precursor UDP-N-acetylmuramoyl-L-alanyl-γ-D-glutamyl-meso-diaminopimelyl-D-alanyl-D-alanine by the sequential addition of amino acids onto UDP-N-acetylmuramic acid (UDP-MurNAc). MurC-F enzymes have been extensively studied by biochemistry and X-ray crystallography. In gram-negative bacteria, ∼30-60% of the bacterial cell wall is recycled during each generation. Part of this recycling process involves the murein peptide ligase (Mpl), which attaches the breakdown product, the tripeptide L-alanyl-γ-D-glutamyl-meso-diaminopimelate, to UDP-MurNAc. We present the crystal structure at 1.65 Å resolution of a full-length Mpl from the permafrost bacterium Psychrobacter arcticus 273-4 (PaMpl). Although the Mpl structure has similarities to Mur enzymes, it has unique sequence and structure features that are likely related to its role in cell wall recycling, a function that differentiates it from the MurC-F enzymes. We have analyzed the sequence-structure relationships that are unique to Mpl proteins and compared them to MurC-F ligases. We have also characterized the biochemical properties of this enzyme (optimal temperature, pH and magnesium binding profiles and kinetic parameters). Although the structure does not contain any bound substrates, we have identified ∼30 residues that are likely to be important for recognition of the tripeptide and UDP-MurNAc substrates, as well as features that are unique to Psychrobacter Mpl proteins. These results provide the basis for future mutational studies for more extensive function characterization of the Mpl sequence-structure relationships.
Abstract: The crystal structures of BB2672 and SPO0826 were determined to resolutions of 1.7 and 2.1 Å by single-wavelength anomalous dispersion and multiple-wavelength anomalous dispersion, respectively, using the semi-automated high-throughput pipeline of the Joint Center for Structural Genomics (JCSG) as part of the NIGMS Protein Structure Initiative (PSI). These proteins are the first structural representatives of the PF06684 (DUF1185) Pfam family. Structural analysis revealed that both structures adopt a variant of the Bacillus chorismate mutase fold (BCM). The biological unit of both proteins is a hexamer and analysis of homologs indicates that the oligomer interface residues are highly conserved. The conformation of the critical regions for oligomerization appears to be dependent on pH or salt concentration, suggesting that this protein might be subject to environmental regulation. Structural similarities to BCM and genome-context analysis suggest a function in amino-acid synthesis.
Abstract: Proteins with the DUF2063 domain constitute a new Pfam family, PF09836. The crystal structure of a member of this family, NGO1945 from Neisseria gonorrhoeae, has been determined and reveals that the N-terminal DUF2063 domain is likely to be a DNA-binding domain. In conjunction with the rest of the protein, NGO1945 is likely to be involved in transcriptional regulation, which is consistent with genomic neighborhood analysis. Of the 216 currently known proteins that contain a DUF2063 domain, the most significant sequence homologs of NGO1945 (∼40-99% sequence identity) are from various Neisseria and Haemophilus species. As these are important human pathogens, NGO1945 represents an interesting candidate for further exploration via biochemical studies and possible therapeutic intervention.
Abstract: The crystal structure of Dhaf4260 from Desulfitobacterium hafniense DCB-2 was determined by single-wavelength anomalous diffraction (SAD) to a resolution of 2.01 Å using the semi-automated high-throughput pipeline of the Joint Center for Structural Genomics (JCSG) as part of the NIGMS Protein Structure Initiative (PSI). This protein structure is the first representative of the PF04016 (DUF364) Pfam family and reveals a novel combination of two well known domains (an enolase N-terminal-like fold followed by a Rossmann-like domain). Structural and bioinformatic analyses reveal partial similarities to Rossmann-like methyltransferases, with residues from the enolase-like fold combining to form a unique active site that is likely to be involved in the condensation or hydrolysis of molecules implicated in the synthesis of flavins, pterins or other siderophores. The genome context of Dhaf4260 and homologs additionally supports a role in heavy-metal chelation.
Abstract: The crystal structure of Jann_2411 from Jannaschia sp. strain CCS1, a member of the Pfam PF07336 family classified as a domain of unknown function (DUF1470), was solved to a resolution of 1.45 Å by multiple-wavelength anomalous dispersion (MAD). This protein is the first structural representative of the DUF1470 Pfam family. Structural analysis revealed a two-domain organization, with the N-terminal domain presenting a new fold called the ABATE domain that may bind an as yet unknown ligand. The C-terminal domain forms a treble-clef zinc finger that is likely to be involved in DNA binding. Analysis of the Jann_2411 protein and the broader ABATE-domain family suggests a role as stress-induced transcriptional regulators.
Abstract: The crystal structure of PA1994 from Pseudomonas aeruginosa, a member of the Pfam PF06475 family classified as a domain of unknown function (DUF1089), reveals a novel fold comprising a 15-stranded β-sheet wrapped around a single α-helix that assembles into a tight dimeric arrangement. The remote structural similarity to lipoprotein localization factors, in addition to the presence of an acidic pocket that is conserved in DUF1089 homologs, phospholipid-binding and sugar-binding proteins, indicate a role for PA1994 and the DUF1089 family in glycolipid metabolism. Genome-context analysis lends further support to the involvement of this family of proteins in glycolipid metabolism and indicates possible activation of DUF1089 homologs under conditions of bacterial cell-wall stress or host-pathogen interactions.
Abstract: The crystal structures of SPO0140 and Sbal_2486 were determined using the semiautomated high-throughput pipeline of the Joint Center for Structural Genomics (JCSG) as part of the NIGMS Protein Structure Initiative (PSI). The structures revealed a conserved core with domain duplication and a superficial similarity of the C-terminal domain to pleckstrin homology-like folds. The conservation of the domain interface indicates a potential binding site that is likely to involve a nucleotide-based ligand, with genome-context and gene-fusion analyses additionally supporting a role for this family in signal transduction, possibly during oxidative stress.
Abstract: SSO2064 is the first structural representative of PF01796 (DUF35), a large prokaryotic family with a wide phylogenetic distribution. The structure reveals a novel two-domain architecture comprising an N-terminal, rubredoxin-like, zinc ribbon and a C-terminal, oligonucleotide/oligosaccharide-binding (OB) fold domain. Additional N-terminal helical segments may be involved in protein-protein interactions. Domain architectures, genomic context analysis and functional evidence from certain bacterial representatives of this family suggest that these proteins form a novel fatty-acid-binding component that is involved in the biosynthesis of lipids and polyketide antibiotics and that they possibly function as acyl-CoA-binding proteins. This structure has led to a re-evaluation of the DUF35 family, which has now been split into two entries in the latest Pfam release (v.24.0).
Abstract: The structure of LP2179, a member of the PF08866 (DUF1831) family, suggests a novel α+β fold comprising two β-sheets packed against a single helix. A remote structural similarity to two other uncharacterized protein families specific to the Bacillus genus (PF08868 and PF08968), as well as to prokaryotic S-adenosylmethionine decarboxylases, is consistent with a role in amino-acid metabolism. Genomic neighborhood analysis of LP2179 supports this functional assignment, which might also then be extended to PF08868 and PF08968.
Abstract: The NIH Protein Structure Initiative centers, such as the Joint Center for Structural Genomics (JCSG), have developed highly efficient technological platforms that are capable of experimentally determining the three-dimensional structures of hundreds of proteins per year. However, the overwhelming majority of the almost 5000 protein structures determined by these centers have yet to be described in the peer-reviewed literature. In a high-throughput structural genomics environment, the process of structure determination occurs independently of any associated experimental characterization of function, which creates a challenge for the annotation and analysis of structures and the publication of these results. This challenge has been addressed by developing TOPSAN (`The Open Protein Structure Annotation Network'), which enables the generation of knowledge via collaborations among globally distributed contributors supported by automated amalgamation of available information. TOPSAN currently provides annotations for all protein structures determined by the JCSG in addition to preliminary annotations on a large number of structures from the other PSI production centers. TOPSAN-enabled collaborations have resulted in insightful structure-function analysis for many proteins and have led to numerous peer-reviewed publications, as exemplified by the articles included in this issue of Acta Crystallographica Section F.
Abstract: Mre11 nuclease plays a central role in the repair of cytotoxic and mutagenic DNA double-strand breaks. As X-ray structural information has been available only for the Pyrococcus furiosus enzyme (PfMre11), the conserved and variable features of this nuclease across the domains of life have not been experimentally defined. Our crystal structure and biochemical studies demonstrate that TM1635 from Thermotoga maritima, originally annotated as a putative nuclease, is an Mre11 endo/exonuclease (TmMre11) and the first such structure from eubacteria. TmMre11 and PfMre11 display similar overall structures, despite sequence identity in the twilight zone of only approximately 20%. However, they differ substantially in their DNA-specificity domains and in their dimeric organization. Residues in the nuclease domain are highly conserved, but those in the DNA-specificity domain are not. The structural differences likely affect how Mre11 from different organisms recognize and interact with single-stranded DNA, double-stranded DNA and DNA hairpin structures during DNA repair. The TmMre11 nuclease active site has no bound metal ions, but is conserved in sequence and structure with the exception of a histidine that is important in PfMre11 nuclease activity. Nevertheless, biochemical characterization confirms that TmMre11 possesses both endonuclease and exonuclease activities on single-stranded and double-stranded DNA substrates, respectively.
Abstract: Pleckstrin homology (PH) domains have been identified only in eukaryotic proteins to date. We have determined crystal structures for three members of an uncharacterized protein family (Pfam PF08000), which provide compelling evidence for the existence of PH-like domains in bacteria (PHb). The first two structures contain a single PHb domain that forms a dome-shaped, oligomeric ring with C(5) symmetry. The third structure has an additional helical hairpin attached at the C-terminus and forms a similar but much larger ring with C(12) symmetry. Thus, both molecular assemblies exhibit rare, higher-order, cyclic symmetry but preserve a similar arrangement of their PHb domains, which gives rise to a conserved hydrophilic surface at the intersection of the beta-strands of adjacent protomers that likely mediates protein-protein interactions. As a result of these structures, additional families of PHb domains were identified, suggesting that PH domains are much more widespread than originally anticipated. Thus, rather than being a eukaryotic innovation, the PH domain superfamily appears to have existed before prokaryotes and eukaryotes diverged.
Abstract: CvfB is a conserved regulatory protein important for the virulence of Staphylococcus aureus. We show here that CvfB binds RNA. The crystal structure of the CvfB ortholog from Streptococcus pneumoniae at 1.4 A resolution reveals a unique RNA binding protein that is formed from a concatenation of well-known structural modules that bind nucleic acids: three consecutive S1 RNA binding domains and a winged helix (WH) domain. The third S1 and the WH domains are required for cooperative RNA binding and form a continuous surface that likely contributes to the RNA interaction. The WH domain is critical to CvfB function and contains a unique sequence motif. Thus CvfB represents a novel assembly of modules for binding RNA.
Abstract: Sufu (Suppressor of Fused), a two-domain protein, plays a critical role in regulating Hedgehog signaling and is conserved from flies to humans. A few bacterial Sufu-like proteins have previously been identified based on sequence similarity to the N-terminal domain of eukaryotic Sufu proteins, but none have been structurally or biochemically characterized and their function in bacteria is unknown. We have determined the crystal structure of a more distantly related Sufu-like homolog, NGO1391 from Neisseria gonorrhoeae, at 1.4 Å resolution, which provides the first biophysical characterization of a bacterial Sufu-like protein. The structure revealed a striking similarity to the N-terminal domain of human Sufu (r.m.s.d. of 2.6 Å over 93% of the NGO1391 protein), despite an extremely low sequence identity of ∼15%. Subsequent sequence analysis revealed that NGO1391 defines a new subset of smaller, Sufu-like proteins that are present in ∼200 bacterial species and has resulted in expansion of the SUFU (PF05076) family in Pfam.
Abstract: YeaZ is involved in a protein network that is essential for bacteria. The crystal structure of YeaZ from Thermotoga maritima was determined to 2.5 Å resolution. Although this protein belongs to a family of ancient actin-like ATPases, it appears that it has lost the ability to bind ATP since it lacks some key structural features that are important for interaction with ATP. A conserved surface was identified, supporting its role in the formation of protein complexes.
Abstract: The Joint Center for Structural Genomics high-throughput structural biology pipeline has delivered more than 1000 structures to the community over the past ten years. The JCSG has made a significant contribution to the overall goal of the NIH Protein Structure Initiative (PSI) of expanding structural coverage of the protein universe, as well as making substantial inroads into structural coverage of an entire organism. Targets are processed through an extensive combination of bioinformatics and biophysical analyses to efficiently characterize and optimize each target prior to selection for structure determination. The pipeline uses parallel processing methods at almost every step in the process and can adapt to a wide range of protein targets from bacterial to human. The construction, expansion and optimization of the JCSG gene-to-structure pipeline over the years have resulted in many technological and methodological advances and developments. The vast number of targets and the enormous amounts of associated data processed through the multiple stages of the experimental pipeline required the development of variety of valuable resources that, wherever feasible, have been converted to free-access web-based tools and applications.
Abstract: The first structural representative of the domain of unknown function DUF2006 family, also known as Pfam family PF09410, comprises a lipocalin-like fold with domain duplication. The finding of the calycin signature in the N-terminal domain, combined with remote sequence similarity to two other protein families (PF07143 and PF08622) implicated in isoprenoid metabolism and the oxidative stress response, support an involvement in lipid metabolism. Clusters of conserved residues that interact with ligand mimetics suggest that the binding and regulation sites map to the N-terminal domain and to the interdomain interface, respectively.
Abstract: Archaeal membrane lipids consist of branched, saturated hydrocarbons distinct from those found in bacteria and eukaryotes. Digeranylgeranylglycerophospholipid reductase (DGGR) catalyzes the hydrogenation process that converts unsaturated 2,3-di-O-geranylgeranylglyceryl phosphate to saturated 2,3-di-O-phytanylglyceryl phosphate as a critical step in the biosynthesis of archaeal membrane lipids. The saturation of hydrocarbon chains confers the ability to resist hydrolysis and oxidation and helps archaea withstand extreme conditions. DGGR is a member of the geranylgeranyl reductase family that is also widely distributed in bacteria and plants, where the family members are involved in the biosynthesis of photosynthetic pigments. We have determined the crystal structure of DGGR from the thermophilic heterotrophic archaea Thermoplasma acidophilum at 1.6Â Ã… resolution, in complex with flavin adenine dinucleotide (FAD) and a bacterial lipid. The DGGR structure can be assigned to the well-studied, p-hydroxybenzoate hydroxylase (PHBH) SCOP superfamily of flavoproteins that include many aromatic hydroxylases and other enzymes with diverse functions. In the DGGR complex, FAD adopts the IN conformation (closed) previously observed in other PHBH flavoproteins. DGGR contains a large substrate-binding site that extends across the entire ligand-binding domain. Electron density corresponding to a bacterial lipid was found within this cavity. The cavity consists of a large opening that tapers down to two, narrow, curved tunnels that closely mimic the shape of the preferred substrate. We identified a sequence motif, PxxYxWxFP, that defines a specificity pocket in the enzyme and precisely aligns the double bond of the geranyl group with respect to the FAD cofactor, thus providing a structural basis for the substrate specificity of geranylgeranyl reductases. DGGR is likely to share a common mechanism with other PHBH enzymes in which FAD switches between two conformations that correspond to the reductive and oxidative half cycles. The structure provides evidence that substrate binding likely involves conformational changes, which are coupled to the two conformational states of the FAD.
Abstract: The JCSG has recently developed a protocol for systematic comparisons of high-quality crystal and NMR structures of proteins. In this paper, the extent to which this approach can provide function-related information on the two functionally annotated proteins TM1081, a Thermotoga maritima anti-σ factor antagonist, and A2LD1 (gi:13879369), a mouse γ-glutamylamine cyclotransferase, is explored. The NMR structures of the two proteins have been determined in solution at 313 and 298 K, respectively, using the current JCSG protocol based on the software package UNIO for extensive automation. The corresponding crystal structures were solved by the JCSG at 100 K and 1.6 Å resolution and at 100 K and 1.9 Å resolution, respectively. The NMR and crystal structures of the two proteins share the same overall molecular architectures. However, the precision of the structure determination along the amino-acid sequence varies over a significantly wider range in the NMR structures than in the crystal structures. Thereby, in each of the two NMR structures about 65% of the residues have displacements below the average and in both proteins the less well ordered residues include large parts of the active sites, in addition to some highly solvent-exposed surface areas. Whereas the latter show increased disorder in the crystal and in solution, the active-site regions display increased displacements only in the NMR structures, where they undergo local conformational exchange on the millisecond time scale that appears to be frozen in the crystals. These observations suggest that a search for molecular regions showing increased structural disorder and slow dynamic processes in solution while being well ordered in the corresponding crystal structure might be a valid initial step in the challenge of identifying putative active sites in functionally unannotated proteins with known three-dimensional structure.
Abstract: Examination of the genomic context for members of the FmdE Pfam family (PF02663), such as the protein encoded by the fmdE gene from the methanogenic archaeon Methanobacterium thermoautotrophicum, indicates that 13 of them are co-transcribed with genes encoding subunits of molybdenum formylmethanofuran dehydrogenase (EC 1.2.99.5), an enzyme that is involved in microbial methane production. Here, the first crystal structures from PF02663 are described, representing two bacterial and one archaeal species: B8FYU2_DESHY from the anaerobic dehalogenating bacterium Desulfitobacterium hafniense DCB-2, Q2LQ23_SYNAS from the syntrophic bacterium Syntrophus aciditrophicus SB and Q9HJ63_THEAC from the thermoacidophilic archaeon Thermoplasma acidophilum. Two of these proteins, Q9HJ63_THEAC and Q2LQ23_SYNAS, contain two domains: an N-terminal thioredoxin-like α+β core domain (NTD) consisting of a five-stranded, mixed β-sheet flanked by several α-helices and a C-terminal zinc-finger domain (CTD). B8FYU2_DESHY, on the other hand, is composed solely of the NTD. The CTD of Q9HJ63_THEAC and Q2LQ23_SYNAS is best characterized as a treble-clef zinc finger. Two significant structural differences between Q9HJ63_THEAC and Q2LQ23_SYNAS involve their metal binding. First, zinc is bound to the putative active site on the NTD of Q9HJ63_THEAC, but is absent from the NTD of Q2LQ23_SYNAS. Second, whereas the structure of the CTD of Q2LQ23_SYNAS shows four Cys side chains within coordination distance of the Zn atom, the structure of Q9HJ63_THEAC is atypical for a treble-cleft zinc finger in that three Cys side chains and an Asp side chain are within coordination distance of the zinc.
Abstract: The crystal structure of a putative NTPase, YP_001813558.1 from Exiguobacterium sibiricum 255-15 (PF09934, DUF2166) was determined to 1.78 Å resolution. YP_001813558.1 and its homologs (dimeric dUTPases, MazG proteins and HisE-encoded phosphoribosyl ATP pyrophosphohydrolases) form a superfamily of all-α-helical NTP pyrophosphatases. In dimeric dUTPase-like proteins, a central four-helix bundle forms the active site. However, in YP_001813558.1, an unexpected intertwined swapping of two of the helices that compose the conserved helix bundle results in a `linked dimer' that has not previously been observed for this family. Interestingly, despite this novel mode of dimerization, the metal-binding site for divalent cations, such as magnesium, that are essential for NTPase activity is still conserved. Furthermore, the active-site residues that are involved in sugar binding of the NTPs are also conserved when compared with other α-helical NTPases, but those that recognize the nucleotide bases are not conserved, suggesting a different substrate specificity.
Abstract: In the plant pathogen Xanthomonas campestris pv. campestris, the product of the tcmJ gene, XcTcmJ, encodes a protein belonging to the RmlC family of cupins. XcTcmJ was crystallized in a monoclinic space group (C2) in the presence of zinc acetate and the structure was determined to 1.6 Å resolution. Previously, the apo structure has been reported in the absence of any bound metal ion [Chin et al. (2006), Proteins, 65, 1046-1050]. The most significant difference between the apo structure and the structure of XcTcmJ described here is a reorganization of the binding site for zinc acetate, which was most likely acquired from the crystallization solution. This site is located in the conserved metal ion-binding domain at the putative active site of XcTcmJ. In addition, an acetate was also bound within coordination distance of the zinc. In order to accommodate this binding, rearrangement of a conserved histidine ligand is required as well as several nearby residues within and around the putative active site. These observations indicate that binding of zinc serves a functional role in this cupin protein.
Abstract: Dipeptidyl-peptidase VI from Bacillus sphaericus and YkfC from Bacillus subtilis have both previously been characterized as highly specific γ-D-glutamyl-L-diamino acid endopeptidases. The crystal structure of a YkfC ortholog from Bacillus cereus (BcYkfC) at 1.8 Å resolution revealed that it contains two N-terminal bacterial SH3 (SH3b) domains in addition to the C-terminal catalytic NlpC/P60 domain that is ubiquitous in the very large family of cell-wall-related cysteine peptidases. A bound reaction product (L-Ala-γ-D-Glu) enabled the identification of conserved sequence and structural signatures for recognition of L-Ala and γ-D-Glu and, therefore, provides a clear framework for understanding the substrate specificity observed in dipeptidyl-peptidase VI, YkfC and other NlpC/P60 domains in general. The first SH3b domain plays an important role in defining substrate specificity by contributing to the formation of the active site, such that only murein peptides with a free N-terminal alanine are allowed. A conserved tyrosine in the SH3b domain of the YkfC subfamily is correlated with the presence of a conserved acidic residue in the NlpC/P60 domain and both residues interact with the free amine group of the alanine. This structural feature allows the definition of a subfamily of NlpC/P60 enzymes with the same N-terminal substrate requirements, including a previously characterized cyanobacterial L-alanine-γ-D-glutamate endopeptidase that contains the two key components (an NlpC/P60 domain attached to an SH3b domain) for assembly of a YkfC-like active site.
Abstract: The NMR structures of the TM1112 and TM1367 proteins from Thermotoga maritima in solution at 298 K were determined following a new protocol which uses the software package UNIO for extensive automation. The results obtained with this novel procedure were evaluated by comparison with the crystal structures solved by the JCSG at 100 K to 1.83 and 1.90 Å resolution, respectively. In addition, the TM1112 solution structure was compared with an NMR structure solved by the NESG using a conventional largely interactive methodology. For both proteins, the newly determined NMR structure could be superimposed with the crystal structure with r.m.s.d. values of <1.0 Å for the backbone heavy atoms, which provided a starting platform to investigate local structure variations, which may arise from either the methods used or from the different chemical environments in solution and in the crystal. Thereby, these comparative studies were further explored with the use of reference NMR and crystal structures, which were computed using the NMR software with input of upper-limit distance constraints derived from the molecular models that represent the results of structure determination by NMR and by X-ray diffraction, respectively. The results thus obtained show that NMR structure calculations with the new automated UNIO software used by the JCSG compare favorably with those from a more labor-intensive and time-intensive interactive procedure. An intriguing observation is that the `bundles' of two TM1112 or three TM1367 molecules in the asymmetric unit of the crystal structures mimic the behavior of the bundles of 20 conformers used to represent the NMR solution structures when comparing global r.m.s.d. values calculated either for the polypeptide backbone, the core residues with solvent accessibility below 15% or all heavy atoms.
Abstract: The NMR structure of the protein NP_247299.1 in solution at 313 K has been determined and is compared with the X-ray crystal structure, which was also solved in the Joint Center for Structural Genomics (JCSG) at 100 K and at 1.7 Å resolution. Both structures were obtained using the current largely automated crystallographic and solution NMR methods used by the JCSG. This paper assesses the accuracy and precision of the results from these recently established automated approaches, aiming for quantitative statements about the location of structure variations that may arise from either one of the methods used or from the different environments in solution and in the crystal. To evaluate the possible impact of the different software used for the crystallographic and the NMR structure determinations and analysis, the concept is introduced of reference structures, which are computed using the NMR software with input of upper-limit distance constraints derived from the molecular models representing the results of the two structure determinations. The use of this new approach is explored to quantify global differences that arise from the different methods of structure determination and analysis versus those that represent interesting local variations or dynamics. The near-identity of the protein core in the NMR and crystal structures thus provided a basis for the identification of complementary information from the two different methods. It was thus observed that locally increased crystallographic B values correlate with dynamic structural polymorphisms in solution, including that the solution state of the protein involves a slow dynamic equilibrium on a time scale of milliseconds or slower between two ensembles of rapidly interchanging conformers that contain, respectively, the cis or trans form of the C-terminal proline and represent about 25 and 75% of the total protein.
Abstract: Chorismate mutase/prephenate dehydrogenase from Haemophilus influenzae Rd KW20 is a bifunctional enzyme that catalyzes the rearrangement of chorismate to prephenate and the NAD(P)(+)-dependent oxidative decarboxylation of prephenate to 4-hydroxyphenylpyruvate in tyrosine biosynthesis. The crystal structure of the prephenate dehydrogenase component (HinfPDH) of the TyrA protein from H. influenzae Rd KW20 in complex with the inhibitor tyrosine and cofactor NAD(+) has been determined to 2.0 Å resolution. HinfPDH is a dimeric enzyme, with each monomer consisting of an N-terminal α/β dinucleotide-binding domain and a C-terminal α-helical dimerization domain. The structure reveals key active-site residues at the domain interface, including His200, Arg297 and Ser179 that are involved in catalysis and/or ligand binding and are highly conserved in TyrA proteins from all three kingdoms of life. Tyrosine is bound directly at the catalytic site, suggesting that it is a competitive inhibitor of HinfPDH. Comparisons with its structural homologues reveal important differences around the active site, including the absence of an α-β motif in HinfPDH that is present in other TyrA proteins, such as Synechocystis sp. arogenate dehydrogenase. Residues from this motif are involved in discrimination between NADP(+) and NAD(+). The loop between β5 and β6 in the N-terminal domain is much shorter in HinfPDH and an extra helix is present at the C-terminus. Furthermore, HinfPDH adopts a more closed conformation compared with TyrA proteins that do not have tyrosine bound. This conformational change brings the substrate, cofactor and active-site residues into close proximity for catalysis. An ionic network consisting of Arg297 (a key residue for tyrosine binding), a water molecule, Asp206 (from the loop between β5 and β6) and Arg365' (from the additional C-terminal helix of the adjacent monomer) is observed that might be involved in gating the active site.
Abstract: A novel aminoacyl-tRNA synthetase that contains an iron-sulfur cluster in the tRNA anticodon-binding region and efficiently charges tRNA with tryptophan has been found in Thermotoga maritima. The crystal structure of TmTrpRS (tryptophanyl-tRNA synthetase; TrpRS; EC 6.1.1.2) reveals an iron-sulfur [4Fe-4S] cluster bound to the tRNA anticodon-binding (TAB) domain and an L-tryptophan ligand in the active site. None of the other T. maritima aminoacyl-tRNA synthetases (AARSs) contain this [4Fe-4S] cluster-binding motif (C-x₂₂-C-x₆-C-x₂-C). It is speculated that the iron-sulfur cluster contributes to the stability of TmTrpRS and could play a role in the recognition of the anticodon.
Abstract: Approximately 65% of PSI structures report some type of ligand(s) that is bound in the crystal structure. Here, a description is given of how such ligands are handled and analyzed at the JCSG and a survey of the types, variety and frequency of ligands that are observed in the PSI structures is also compiled and analyzed, including illustrations of how these bound ligands have provided functional clues for annotation of proteins with little or no previous experimental characterization. Furthermore, a web server was developed as a tool to mine and analyze the PSI structures for bound ligands and other identifying features.
Abstract: KPN03535 (gi|152972051) is a putative lipoprotein of unknown function that is secreted by Klebsiella pneumoniae MGH 78578. The crystal structure reveals that despite a lack of any detectable sequence similarity to known structures, it is a novel variant of the OB-fold and structurally similar to the bacterial Cpx-pathway protein NlpE, single-stranded DNA-binding (SSB) proteins and toxins. K. pneumoniae MGH 78578 forms part of the normal human skin, mouth and gut flora and is an opportunistic pathogen that is linked to about 8% of all hospital-acquired infections in the USA. This structure provides the foundation for further investigations into this divergent member of the OB-fold family.
Abstract: The crystal structures of the proteins encoded by the YP_749275.1 and YP_001095227.1 genes from Shewanella frigidimarina and S. loihica, respectively, have been determined at 1.8 and 2.25 Å resolution, respectively. These proteins are members of a novel family of bacterial proteins that adopt the α/β SpoIIAA-like fold found in STAS and CRAL-TRIO domains. Despite sharing 54% sequence identity, these two proteins adopt distinct conformations arising from different dispositions of their α2 and α3 helices. In the `open' conformation (YP_001095227.1), these helices are 15 Å apart, leading to the creation of a deep nonpolar cavity. In the `closed' structure (YP_749275.1), the helices partially unfold and rearrange, occluding the cavity and decreasing the solvent-exposed hydrophobic surface. These two complementary structures are reminiscent of the conformational switch in CRAL-TRIO carriers of hydrophobic compounds. It is suggested that both proteins may associate with the lipid bilayer in their `open' monomeric state by inserting their amphiphilic helices, α2 and α3, into the lipid bilayer. These bacterial proteins may function as carriers of nonpolar substances or as interfacially activated enzymes.
Abstract: The crystal structure of the Bacteroides thetaiotaomicron protein BT_3984 was determined to a resolution of 1.7 Å and was the first structure to be determined from the extensive SusD family of polysaccharide-binding proteins. SusD is an essential component of the sus operon that defines the paradigm for glycan utilization in dominant members of the human gut microbiota. Structural analysis of BT_3984 revealed an N-terminal region containing several tetratricopeptide repeats (TPRs), while the signature C-terminal region is less structured and contains extensive loop regions. Sequence and structure analysis of BT_3984 suggests the presence of binding interfaces for other proteins from the polysaccharide-utilization complex.
Abstract: Proteins that contain the DUF2874 domain constitute a new Pfam family PF11396. Members of this family have predominantly been identified in microbes found in the human gut and oral cavity. The crystal structure of one member of this family, BVU2987 from Bacteroides vulgatus, has been determined, revealing a β-lactamase inhibitor protein-like structure with a tandem repeat of domains. Sequence analysis and structural comparisons reveal that BVU2987 and other DUF2874 proteins are related to β-lactamase inhibitor protein, PepSY and SmpA_OmlA proteins and hence are likely to function as inhibitory proteins.
Abstract: BT1062 from Bacteroides thetaiotaomicron is a homolog of Mfa2 (PGN0288 or PG0179), which is a component of the minor fimbriae in Porphyromonas gingivalis. The crystal structure of BT1062 revealed a conserved fold that is widely adopted by fimbrial components.
Abstract: Membrane-attack complex/perforin (MACPF) proteins are transmembrane pore-forming proteins that are important in both human immunity and the virulence of pathogens. Bacterial MACPFs are found in diverse bacterial species, including most human gut-associated Bacteroides species. The crystal structure of a bacterial MACPF-domain-containing protein BT_3439 (Bth-MACPF) from B. thetaiotaomicron, a predominant member of the mammalian intestinal microbiota, has been determined. Bth-MACPF contains a membrane-attack complex/perforin (MACPF) domain and two novel C-terminal domains that resemble ribonuclease H and interleukin 8, respectively. The entire protein adopts a flat crescent shape, characteristic of other MACPF proteins, that may be important for oligomerization. This Bth-MACPF structure provides new features and insights not observed in two previous MACPF structures. Genomic context analysis infers that Bth-MACPF may be involved in a novel protein-transport or nutrient-uptake system, suggesting an important role for these MACPF proteins, which were likely to have been inherited from eukaryotes via horizontal gene transfer, in the adaptation of commensal bacteria to the host environment.
Abstract: BT2081 from Bacteroides thetaiotaomicron (GenBank accession code NP_810994.1) is a member of a novel protein family consisting of over 160 members, most of which are found in the different classes of Bacteroidetes. Genome-context analysis lends support to the involvement of this family in carbohydrate metabolism, which plays a key role in B. thetaiotaomicron as a predominant bacterial symbiont in the human distal gut microbiome. The crystal structure of BT2081 at 2.05 Å resolution represents the first structure from this new protein family. BT2081 consists of an N-terminal domain, which adopts a β-sandwich immunoglobulin-like fold, and a larger C-terminal domain with a β-sandwich jelly-roll fold. Structural analyses reveal that both domains are similar to those found in various carbohydrate-active enzymes. The C-terminal β-jelly-roll domain contains a potential carbohydrate-binding site that is highly conserved among BT2081 homologs and is situated in the same location as the carbohydrate-binding sites that are found in structurally similar glycoside hydrolases (GHs). However, in BT2081 this site is partially occluded by surrounding loops, which results in a deep solvent-accessible pocket rather than a shallower solvent-exposed cleft.
Abstract: ECX21941 represents a very large family (over 600 members) of novel, ocean metagenome-specific proteins identified by clustering of the dataset from the Global Ocean Sampling expedition. The crystal structure of ECX21941 reveals unexpected similarity to Sm/LSm proteins, which are important RNA-binding proteins, despite no detectable sequence similarity. The ECX21941 protein assembles as a homopentamer in solution and in the crystal structure when expressed in Escherichia coli and represents the first pentameric structure for this Sm/LSm family of proteins, although the actual oligomeric form in vivo is currently not known. The genomic neighborhood analysis of ECX21941 and its homologs combined with sequence similarity searches suggest a cyanophage origin for this protein. The specific functions of members of this family are unknown, but our structure analysis of ECX21941 indicates nucleic acid-binding capabilities and suggests a role in RNA and/or DNA processing.
Abstract: Regulatory inactivation of DnaA is dependent on Hda (homologous to DnaA), a protein homologous to the AAA+ (ATPases associated with diverse cellular activities) ATPase region of the replication initiator DnaA. When bound to the sliding clamp loaded onto duplex DNA, Hda can stimulate the transformation of active DnaA-ATP into inactive DnaA-ADP. The crystal structure of Hda from Shewanella amazonensis SB2B at 1.75 A resolution reveals that Hda resembles typical AAA+ ATPases. The arrangement of the two subdomains in Hda (residues 1-174 and 175-241) differs dramatically from that of DnaA. A CDP molecule anchors the Hda domains in a conformation that promotes dimer formation. The Hda dimer adopts a novel oligomeric assembly for AAA+ proteins in which the arginine finger, crucial for ATP hydrolysis, is fully exposed and available to hydrolyze DnaA-ATP through a typical AAA+ type of mechanism. The sliding clamp binding motifs at the N-terminus of each Hda monomer are partially buried and combine to form an antiparallel beta-sheet at the dimer interface. The inaccessibility of the clamp binding motifs in the CDP-bound structure of Hda suggests that conformational changes are required for Hda to form a functional complex with the clamp. Thus, the CDP-bound Hda dimer likely represents an inactive form of Hda.
Abstract: The crystal structures of two homologous endopeptidases from cyanobacteria Anabaena variabilis and Nostoc punctiforme were determined at 1.05 and 1.60 A resolution, respectively, and contain a bacterial SH3-like domain (SH3b) and a ubiquitous cell-wall-associated NlpC/P60 (or CHAP) cysteine peptidase domain. The NlpC/P60 domain is a primitive, papain-like peptidase in the CA clan of cysteine peptidases with a Cys126/His176/His188 catalytic triad and a conserved catalytic core. We deduced from structure and sequence analysis, and then experimentally, that these two proteins act as gamma-D-glutamyl-L-diamino acid endopeptidases (EC 3.4.22.-). The active site is located near the interface between the SH3b and NlpC/P60 domains, where the SH3b domain may help define substrate specificity, instead of functioning as a targeting domain, so that only muropeptides with an N-terminal L-alanine can bind to the active site.
Abstract: Influenza virus presents an important and persistent threat to public health worldwide, and current vaccines provide immunity to viral isolates similar to the vaccine strain. High-affinity antibodies against a conserved epitope could provide immunity to the diverse influenza subtypes and protection against future pandemic viruses. Cocrystal structures were determined at 2.2 and 2.7 angstrom resolutions for broadly neutralizing human antibody CR6261 Fab in complexes with the major surface antigen (hemagglutinin, HA) from viruses responsible for the 1918 H1N1 influenza pandemic and a recent lethal case of H5N1 avian influenza. In contrast to other structurally characterized influenza antibodies, CR6261 recognizes a highly conserved helical region in the membrane-proximal stem of HA1 and HA2. The antibody neutralizes the virus by blocking conformational rearrangements associated with membrane fusion. The CR6261 epitope identified here should accelerate the design and implementation of improved vaccines that can elicit CR6261-like antibodies, as well as antibody-based therapies for the treatment of influenza.
Abstract: SsgA-like proteins (SALPs) are a family of homologous cell division-related proteins that occur exclusively in morphologically complex actinomycetes. We show that SsgB, a subfamily of SALPs, is the archetypal SALP that is functionally conserved in all sporulating actinomycetes. Sporulation-specific cell division of Streptomyces coelicolor ssgB mutants is restored by introduction of distant ssgB orthologues from other actinomycetes. Interestingly, the number of septa (and spores) of the complemented null mutants is dictated by the specific ssgB orthologue that is expressed. The crystal structure of the SsgB from Thermobifida fusca was determined at 2.6 A resolution and represents the first structure for this family. The structure revealed similarities to a class of eukaryotic "whirly" single-stranded DNA/RNA-binding proteins. However, the electro-negative surface of the SALPs suggests that neither SsgB nor any of the other SALPs are likely to interact with nucleotide substrates. Instead, we show that a conserved hydrophobic surface is likely to be important for SALP function and suggest that proteins are the likely binding partners.
Abstract: Cell-cycle-regulated stalk biogenesis in Caulobacter crescentus is controlled by a multistep phosphorelay system consisting of the hybrid histidine kinase ShkA, the histidine phosphotransfer (HPt) protein ShpA, and the response regulator TacA. ShpA shuttles phosphoryl groups between ShkA and TacA. When phosphorylated, TacA triggers a downstream transcription cascade for stalk synthesis in an RpoN-dependent manner. The crystal structure of ShpA was determined to 1.52 A resolution. ShpA belongs to a family of monomeric HPt proteins that feature a highly conserved four-helix bundle. The phosphorylatable histidine His56 is located on the surface of the helix bundle and is fully solvent exposed. One end of the four-helix bundle in ShpA is shorter compared with other characterized HPt proteins, whereas the face that potentially interacts with the response regulators is structurally conserved. Similarities of the interaction surface around the phosphorylation site suggest that ShpA is likely to share a common mechanism for molecular recognition and phosphotransfer with yeast phosphotransfer protein YPD1 despite their low overall sequence similarity.
Abstract: Molecular tethers have a central role in the organization of the complex membrane architecture of eukaryotic cells. p115 is a ubiquitous, essential tether involved in vesicle transport and the structural organization of the exocytic pathway. We describe two crystal structures of the N-terminal domain of p115 at 2.0 A resolution. The p115 structures show a novel alpha-solenoid architecture constructed of 12 armadillo-like, tether-repeat, alpha-helical tripod motifs. We find that the H1 TR binds the Rab1 GTPase involved in endoplasmic reticulum to Golgi transport. Mutation of the H1 motif results in the dominant negative inhibition of endoplasmic reticulum to Golgi trafficking. We propose that the H1 helical tripod contributes to the assembly of Rab-dependent complexes responsible for the tether and SNARE-dependent fusion of membranes.
Abstract: Fifteen rare cancer-derived mutants of PIK3CA, the gene coding for the catalytic subunit p110alpha of phosphatidylinositol 3-kinase (PI3K), were examined for their biological and biochemical properties. Fourteen of these mutants show a gain of function: they induce rapamycin-sensitive oncogenic transformation of chicken embryo fibroblasts, constitutively activate Akt and TOR-mediated signaling, and show enhanced lipid kinase activity. Mapping of these mutants on a partial structural model of p110alpha suggests three groups of mutants, defined by their location in distinct functional domains of the protein. We hypothesize that each of these three groups induces a gain of PI3K function by a different molecular mechanism. Mutants in the C2 domain increase the positive surface charge of this domain and therefore may enhance the recruitment of p110alpha to cellular membranes. Mutants in the helical domain map to a contiguous surface of the protein and may affect the interaction with other protein(s). Mutants in the kinase domain are located near the hinge of the activation loop. They may alter the position and mobility of the activation loop. Arbitrarily introduced mutations that have no detectable phenotype map either to the interior of the protein or are positioned on a surface region that lies opposite to the exposed surfaces containing gain-of-function mutants. Engineered mutants that exchange acidic or neutral residues for basic residues on the critical surfaces show a gain of function.
Abstract: BtDyP from Bacteroides thetaiotaomicron (strain VPI-5482) and TyrA from Shewanella oneidensis are dye-decolorizing peroxidases (DyPs), members of a new family of heme-dependent peroxidases recently identified in fungi and bacteria. Here, we report the crystal structures of BtDyP and TyrA at 1.6 and 2.7 A, respectively. BtDyP assembles into a hexamer, while TyrA assembles into a dimer; the dimerization interface is conserved between the two proteins. Each monomer exhibits a two-domain, alpha+beta ferredoxin-like fold. A site for heme binding was identified computationally, and modeling of a heme into the proposed active site allowed for identification of residues likely to be functionally important. Structural and sequence comparisons with other DyPs demonstrate a conservation of putative heme-binding residues, including an absolutely conserved histidine. Isothermal titration calorimetry experiments confirm heme binding, but with a stoichiometry of 0.3:1 (heme:protein).
Abstract: Cancer-specific mutations in the catalytic subunit of phosphatidylinositol 3-kinase (PI3K) p110 alpha occur in diverse tumors in frequencies that can exceed 30%. The majority of these mutations map to one of three hot spots in the gene, and the rest are distributed over much of the PI3K coding sequence. Most of the cancer-specific mutations induce a gain of function that results in oncogenicity, elevated lipid kinase activity and constitutive signaling through the kinases Akt and TOR. The location of the mutations on a model structure of p110 alpha indicates several distinct mechanisms for the gain of function. The mutated p110 alpha proteins are promising cancer targets. Although identification of mutant-specific small-molecule inhibitors seems technically challenging, the therapeutic benefits from such inhibitors could be extremely important.
Abstract: TyrA is a member of the dye-decolorizing peroxidase (DyP) family, a new family of heme-dependent peroxidase recently identified in fungi and bacteria. Here, we report the crystal structure of TyrA in complex with iron protoporphyrin (IX) at 2.3 A. TyrA is a dimer, with each monomer exhibiting a two-domain, alpha/beta ferredoxin-like fold. Both domains contribute to the heme-binding site. Co-crystallization in the presence of an excess of iron protoporphyrin (IX) chloride allowed for the unambiguous location of the active site and the specific residues involved in heme binding. The structure reveals a Fe-His-Asp triad essential for heme positioning, as well as a novel conformation of one of the heme propionate moieties compared to plant peroxidases. Structural comparison to the canonical DyP family member, DyP from Thanatephorus cucumeris (Dec 1), demonstrates conservation of this novel heme conformation, as well as residues important for heme binding. Structural comparisons with representative members from all classes of the plant, bacterial, and fungal peroxidase superfamily demonstrate that TyrA, and by extension the DyP family, adopts a fold different from all other structurally characterized heme peroxidases. We propose that a new superfamily be added to the peroxidase classification scheme to encompass the DyP family of heme peroxidases.
Abstract: Glutathione S-transferases (GSTs) comprise a diverse superfamily of enzymes found in organisms from all kingdoms of life. GSTs are involved in diverse processes, notably small-molecule biosynthesis or detoxification, and are frequently also used in protein engineering studies or as biotechnology tools. Here, we report the high-resolution X-ray structure of Atu5508 from the pathogenic soil bacterium Agrobacterium tumefaciens (atGST1). Through use of comparative sequence and structural analysis of the GST superfamily, we identified local sequence and structural signatures, which allowed us to distinguish between different GST classes. This approach enables GST classification based on structure, without requiring additional biochemical or immunological data. Consequently, analysis of the atGST1 crystal structure suggests a new GST class, distinct from previously characterized GSTs, which would make it an attractive target for further biochemical studies.
Abstract: CD1 antigens bind a variety of self and foreign lipid and glycolipid antigens for presentation to CD1-restricted T cell receptors (TCRs). Here we report the crystal structure of human CD1a in complex with a sulfatide self antigen at a resolution of 2.15 A. The lipid adopts an S-shaped conformation, with the sphingosine chain completely buried in the A' pocket and the fatty acid chain emerging from the interface of the A' pocket into the more exposed F' pocket. The headgroup is anchored in the A'-F' junction and protrudes into the F' pocket for TCR recognition. Because the A' pocket is narrow with a fixed terminus, it can act as a molecular 'ruler' to select alkyl chains of a particular length.
Abstract: Structural genomics is emerging as a principal approach to define protein structure-function relationships. To apply this approach on a genomic scale, novel methods and technologies must be developed to determine large numbers of structures. We describe the design and implementation of a high-throughput structural genomics pipeline and its application to the proteome of the thermophilic bacterium Thermotoga maritima. By using this pipeline, we successfully cloned and attempted expression of 1,376 of the predicted 1,877 genes (73%) and have identified crystallization conditions for 432 proteins, comprising 23% of the T. maritima proteome. Representative structures from TM0423 glycerol dehydrogenase and TM0449 thymidylate synthase-complementing protein are presented as examples of final outputs from the pipeline.
Abstract: Sialidosis is an autosomal recessive disease caused by the genetic deficiency of lysosomal sialidase, which catalyzes the catabolism of sialoglycoconjugates. The disease is associated with progressive impaired vision, macular cherry-red spots, and myoclonus (sialidosis type I) or with skeletal dysplasia, Hurler-like phenotype, dysostosis multiplex, mental retardation, and hepatosplenomegaly (sialidosis type II). We analyzed the effect of the missense mutations G68V, S182G, G227R, F260Y, L270F, A298V, G328S, and L363P, which are identified in the sialidosis type I and sialidosis type II patients, on the activity, stability, and intracellular distribution of sialidase. We found that three mutations, F260Y, L270F, and A298V, which are clustered in the same region on the surface of the sialidase molecule, dramatically reduced the enzyme activity and caused a rapid intralysosomal degradation of the expressed protein. We suggested that this region might be involved in sialidase binding with lysosomal cathepsin A and/or beta-galactosidase in the multienzyme lysosomal complex required for the expression of sialidase activity. Transgenic expression of mutants followed by density gradient centrifugation of cellular extracts confirmed this hypothesis and showed that sialidase deficiency in some sialidosis patients results from disruption of the lysosomal multienzyme complex.
Abstract: Sialidosis is an autosomal recessive disease caused by the genetic deficiency of lysosomal sialidase, which catalyzes the hydrolysis of sialoglycoconjugates. The disease is associated with progressive impaired vision, macular cherry-red spots and myoclonus (sialidosis type I) or with skeletal dysplasia, Hurler-like phenotype, dysostosis multiplex, mental retardation and hepatosplenomegaly (sialidosis type II). We have analyzed the genomic DNA from nine sialidosis patients of multiple ethnic origin in order to find mutations responsible for the enzyme deficiency. The activity of the identified variants was studied by transgenic expression. One patient had a frameshift mutation (G623delG deletion), which introduced a stop codon, truncating 113 amino acids. All others had missense mutations: G679G-->A (Gly227Arg), C893C-->T (Ala298Val), G203G-->T (Gly68Val), A544A-->G (Ser182Gly) C808C-->T (Leu270Phe) and G982G-->A (Gly328Ser). We have modeled the three-dimensional structure of sialidase based on the atomic coordinates of the homologous bacterial sialidases, located the positions of mutations and estimated their potential effect. This analysis showed that five mutations are clustered in one region on the surface of the sialidase molecule. These mutations dramatically reduce the enzyme activity and cause a rapid intralysosomal degradation of the expressed protein. We hypothesize that this region may be involved in the interface of sialidase binding with lysosomal cathepsin A and/or beta-galactosidase in their high-molecular-weight complex required for the expression of sialidase activity in the lysosome.
Abstract: A key step in the targeting of soluble lysosomal enzymes is their recognition and phosphorylation by a 540 kDa multisubunit enzyme, UDP-N-acetylglucosamine-phosphotransferase (phosphotransferase). The molecular mechanism of recognition is still unknown, but previous experiments suggested that the phosphotransferase-binding sites on lysosomal proteins are represented by structurally conserved surface patches of amino acids. We identified four such regions on nonhomologous lysosomal enzymes, cathepsins A, B, and D, which were superimposed by rotating their structures around the Calpha atom of the glycosylated Asn residue. We proposed that these regions represent putative phosphotransferase-binding sites and tested synthetic peptides, derived from these regions on the basis of surface accessibility, for their ability to inhibit in vitro phosphorylation of purified cathepsins A, B, and D. Our results indicate that cathepsin A and cathepsin D have one closely related phosphotransferase recognition site represented by a structurally and topologically conserved beta-hairpin loop, similar to that previously identified in lysosomal beta-glucuronidase. The most potent inhibition of phosphorylation was demonstrated by homologous peptides derived from the regions located on cathepsin molecules opposite the oligosaccharide chains which are phosphorylated by the phosphotransferase. We propose that recognition and catalytic sites of the phosphotransferase are located on different subunits, therefore, providing an effective mechanism for binding and phosphorylation of lysosomal proteins of different molecular size.
Abstract: The green fluorescent protein (GFP) from the jellyfish Aequorea victoria has become a useful tool in molecular and cell biology. Recently, it has been found that the fluorescence spectra of most mutants of GFP respond rapidly and reversibly to pH variations, making them useful as probes of intracellular pH. To explore the structural basis for the titration behavior of the popular GFP S65T variant, we determined high-resolution crystal structures at pH 8.0 and 4.6. The structures revealed changes in the hydrogen bond pattern with the chromophore, suggesting that the pH sensitivity derives from protonation of the chromophore phenolate. Mutations were designed in yellow fluorescent protein (S65G/V68L/S72A/T203Y) to change the solvent accessibility (H148G) and to modify polar groups (H148Q, E222Q) near the chromophore. pH titrations of these variants indicate that the chromophore pKa can be modulated over a broad range from 6 to 8, allowing for pH determination from pH 5 to pH 9. Finally, mutagenesis was used to raise the pKa from 6.0 (S65T) to 7.8 (S65T/H148D). Unlike other variants, S65T/H148D exhibits two pH-dependent excitation peaks for green fluorescence with a clean isosbestic point. This raises the interesting possibility of using fluorescence at this isosbestic point as an internal reference. Practical real time in vivo applications in cell and developmental biology are proposed.
Abstract: Galactosialidosis is a recessively inherited lysosomal storage disease characterized by the combined deficiency of neuraminidase and beta-galactosidase secondary to the genetic deficiency of cathepsin A/protective protein. In lysosomes, cathepsin A forms a high-molecular-weight complex with beta-galactosidase and neuraminidase that protects these enzymes against intralysosomal proteolysis. In a patient affected with late infantile form of galactosialidosis, we found two new cathepsin A mutations, a two-nucleotide deletion, c517delTT and an intronic mutation, IVS8+9C-->G resulting in abnormal splicing and a five-nucleotide insertion in the cathepsin A cDNA. Both mutations cause frameshifts and result in the synthesis of truncated cathepsin A proteins, which, as suggested by structural modeling, are incapable of dimerization, complex formation, and catalysis. However, enzymatic assays, gel-filtration, and Western blot analysis of the patient's cultured skin fibroblast extracts showed the presence of a small amount of normal-size, catalytically active cathepsin A and cathepsin A-beta-galactosidase 680 kDa complex, suggesting that a low amount of cathepsin A mRNA is spliced normally and produces the wild-type protein. This may contribute to the relatively mild phenotype of the patient and illustrates the importance of critically comparing molecular results with clinical and biochemical phenotypes.
Abstract: Because of its ability to spontaneously generate its own fluorophore, the green fluorescent protein (GFP) from the jellyfish Aequorea victoria is used extensively as a fluorescent marker in molecular and cell biology. The yellow fluorescent proteins (YFPs) have the longest wavelength emissions of all GFP variants examined to date. This shift in the spectrum is the result of a T203Y substitution (single-letter amino acid code), a mutation rationally designed on the basis of the X-ray structure of GFP S65T.
Abstract: Sialidase (neuraminidase, EC 3.2.1.18) catalyses the hydrolysis of terminal sialic acid residues of glyconjugates. Sialidase has been well studied in viruses and bacteria where it destroys the sialic acid-containing receptors at the surface of host cells, and mobilizes bacterial nutrients. In mammals, three types of sialidases, lysosomal, plasma membrane and cytosolic, have been described. For lysosomal sialidase in humans, the primary genetic deficiency results in an autosomal recessive disease, sialidosis, associated with tissue accumulation and urinary excretion of sialylated oligosaccharides and glycolipids. Sialidosis includes two main clinical variants: late-onset, sialidosis type I, characterized by bilateral macular cherry-red spots and myoclonus, and infantile-onset, sialidosis type II, characterized by skeletal dysplasia, mental retardation and hepatosplenomegaly. We report the identification of human lysosomal sialidase cDNA, its cloning, sequencing and expression. Examination of six sialidosis patients revealed three mutations, one frameshift insertion and two missense. We mapped the lysosomal sialidase gene to human chromosome 6 (6p21.3), which is consistent with the previous chromosomal assignment of this gene in proximity to the HLA locus.
Abstract: We previously showed that human liver hydroxymethylglutaryl-CoA (HMG-CoA) lyase (HL; EC 4.1.3.4) is found in both mitochondria and peroxisomes. HL contains a 27-residue N-terminal mitochondrial targeting sequence which in cleaved on mitochondrial entry, as well as a C-terminal Cys-Lys-Leu peroxisomal targeting motif. Because peroxisomal HL has a greater molecular mass and more basic pI value than mitochondrial HL, we predicted that peroxisomal HL retains the mitochondrial leader. To test this hypothesis, we expressed both the precursor (pHL) and mature (mHL) peptides in Escherichia coli and studied their properties. pHL purified by ion-exchange and hydrophobic chromatography had a pI of 7.6 on FPLC chromatofocusing and a molecular mass of 34.5 kDa on SDS/PAGE, similar to our findings for peroxisomal HL. For purified mHL, pI (6.2) and molecular mass (32 kDa) values resemble those of mitochondrial HL. Purified pHL is similar to mHL in K(m) for HMG-CoA (44.8 microM), k(cat) (6.3 min(-1)) and pH optimum (9.0-9.5). However, the quaternary structures of pHL and mHL differ. On Superose 12 FPLC gel filtration and also on ultrafiltration, both in the presence and in the absence of HMG-CoA), pHL behaves as a monomer whereas mHL migrates as a dimer. We conclude that the HL percursor is probably identical to peroxisomal HL, that its catalytic properties resemble those of mature mitochondrial HL, and that the mitochondrial leader peptide prevents dimerization on pHL.
Abstract: Human cathepsin A ("lysosomal protective protein"; E.C.3.4.16.5) is a multifunctional lysosomal protein which forms a high-molecular-weight complex with beta-galactosidase and alpha-neuraminidase, protecting them against intralysosomal proteolysis. In addition to this protective function, cathepsin A is a serine carboxypeptidase and the understanding of its catalytic function requires a definition of its substrate specificity. For this purpose, we used a combined experimental [Pshezhetsky, A. V., Vinogradova, M. V., Elsliger, M.-A., El-Zein, F., Svedas, V.K., & Potier, M. (1995) Anal. Biochem. 230, 303-307] and theoretical approach comparing cathepsin A to two different homologous carboxypeptidases of the same family: yeast carboxypeptidase Y and wheat carboxypeptidase II. We computed the energies involved in substrate binding to the S1' subsite (C-terminal) of cathepsin A using a structural model based on the X-ray structure of the homologous wheat carboxypeptidase II. The binding energies of N-blocked Phe-Xaa dipeptide substrates to the active sites of cathepsin A, wheat carboxypeptidase II, and yeast carboxypeptidase Y were estimated using a molecular mechanics force field supplemented with a solvation energy term. This theoretical analysis showed a good correlation with the experimentally determined free energies of substrate binding. This result validates the use of this approach to analyze the energetics of substrate binding to the S1' subsite and provides a rational interpretation of serine carboxypeptidase-substrate interactions in molecular terms. We conclude that the three serine carboxypeptidases have similar affinities for substrates with hydrophobic P1' amino acid residues but that the wheat enzyme has an additional capacity for binding positively charged P1' residues. Finally, the substrate specificity of human cathepsin A is very similar to that of carboxypeptidase Y, with a high binding affinity for substrates with hydrophobic P1' residues, but the affinity of cathepsin A for P1; Phe residue is higher than for the Leu residue.
Abstract: Human lysosomal beta-galactosidase is organized as a 680-kDa complex with cathepsin A (also named carboxypeptidase L and protective protein), which is necessary to protect beta-galactosidase from intralysosomal proteolysis. To understand the molecular mechanism of beta-galactosidase protection by cathepsin A, we defined the structural organization of their complex including the beta-galactosidase-binding interface on cathepsin A. Radiation inactivation analysis suggested the existence of a 168-kDa structural subunit of the complex containing both beta-galactosidase and cathepsin A. Chemical cross-linking of the complex confirmed the existence of this subunit and showed that it is composed of one cathepsin A dimer and one beta-galactosidase monomer. The modeling of the cathepsin A dimer tertiary structure based on atomic coordinates of a wheat carboxypeptidase suggested a putative beta-galactosidase-binding cavity formed by the association of two cathepsin A monomers. According to this model two exposed loops of cathepsin A bordering the cavity were chosen as part of a putative beta-galactosidase-binding interface. Synthetic peptides corresponding to these loops were found both to dissociate the complex and to inhibit its in vitro reconstitution from purified cathepsin A and beta-galactosidase. The defined location of the GAL monomer in the complex with 35% of its surface covered by the CathA dimer may explain the stabilizing effect of CathA on GAL in lysosome.
Abstract: We describe a method to determine the substrate specificity of human lysosomal carboxypeptidase, cathepsin A/protective protein, using furylacryloyl (FA)-Phe-X dipeptides as substrates. These dipeptides contain a chromophore which allows continuous spectrophotometric assay at wavelengths above 324 nm with little interference from protein absorbance. The results obtained with cathepsin A purified from human placenta demonstrate that the enzyme has the highest affinity for substrates with large hydrophobic (Phe, Leu) or positively charged (Arg) amino acid residues in P1' position. The three substrates (FA-Phe-Phe, FA-Phe-Leu, and FA-Phe-Ala) which demonstrated the highest specificity (kcat/Km) for the purified enzyme were then used to assay cathepsin A activity in cultured skin fibroblasts from patients affected with galactosialidosis, an inherited lysosomal storage disease caused by the genetic deficiency of cathepsin A. Residual cathepsin A activity in galactosialidosis fibroblasts was lower than 6% of controls, indicating the high specificity of the assay method.
Abstract: The deficiency of the lysosomal protective protein/carboxypeptidase L (CARB L) causes the lysosomal storage disorder, galactosialidosis, characterized by neuraminidase and beta-galactosidase deficiencies in patients' cells. The three enzymes form a complex inside the lysosome, and the neuraminidase and beta-galactosidase deficiencies are secondary to CARB L deficiency. Sequence similarity and common enzymological properties suggest that the protomeric tertiary structure of CARB L is conserved within a family of serine carboxypeptidases which includes the yeast carboxypeptidase Y, killer expression I gene product and several plant carboxypeptidases. We used this homology to build a model of the CARB L structure based on the recently published X-ray atomic coordinates of the wheat carboxypeptidase II (CPDW-II) which shares 32% primary structure identity with CARB L. Small insertions and deletions were accommodated into the model structure by energy minimization using the DREIDING II force field. The C alpha atomic coordinates of the final CARB L model have a RMS shift of 1.01 A compared to the corresponding conserved residues in the CPDW-II template structure. The correct orientation of the homologous catalytic triad residues Ser150, His429 and Asp392, the potential energy calculations and the distribution of hydrophobic and hydrophillic residues in the structure all support the validity of the CARB L model. Most missense mutations identified in galactosialidosis patients were located in secondary structural elements except for the Tyr211-->Asn mutation which is in a loop. The other mutant residues have their side chains deeply buried in the central beta-sheet of the model structure except for the Phe412-->Val mutation which is located in the dimer interface. The predicted effects of specific mutations on CARB L structural stability correlates well with recently published transient expression studies of mutant CARB L (Shimmoto, M. et al., J. Clin. Invest., 91:2393-2399, 1993).
Abstract: Intestinal and renal trehalase isozymes have been distinguished in normal human amniotic fluid on the basis of their membrane-bound character and isoelectric point (pI). The intestinal trehalase was mostly membrane bound in amniotic fluid and had a pI around 4.60. In contrast, the renal form of trehalase was soluble and had a pI around 4.37. These pI values were consistent with those found in extracts of fetal intestinal (pI 4.60) and renal (pI 4.24) tissues. The determination of trehalase isozyme composition of amniotic fluid from pathological pregnancies with anal imperforation and polycystic kidney disease confirmed our findings on the origin of amniotic fluid trehalase. In the sample from a fetus with anal imperforation, low or absent intestinal trehalase isozyme was observed whereas a higher than normal level of renal trehalase activity was found in a fetus with polycystic kidney disease.
Abstract: We have used a cell-free system derived from hamster brain to investigate protein synthesis during experimental phenylketonuria. In such a system the elongation inhibitor emetine impeded translation in extracts derived from both treated and control animals. On the other hand the initiation inhibitor aurintricarboxylic acid showed no effects on protein synthesis activity of treated hamsters, although it was severely inhibiting in controls. This suggests that initiation is the altered step in brain protein synthesis failure consecutive to phenylketonuria.