Center for Gene Research Nagoya University Nagoya 464-8602 Japan
sugita@gene.nagoya-u.ac.jp
Education: Hokkaido University, Hokkaido, Japan (MS. Botany 1980) Hokkaido University, Graduate School of Science, Botany Department (Ph.D. course 1983) Degree: Doctor of Science, Nagoya University, 1984
Academic carrier: Hokkaido University, Department of Botany, Research Associate (1983. 10~) University of California, Berkeley, Department of Botany, Postdoctoral Fellow (1985.4~) Hokkaido University, Department of Botany, Lecturer (1988. 8~) Nagoya University, Center for Gene Research, Associate Professor (1989. 4~) Nagoya University, Graduate School of Human Informatics, Professor (1998.10~) Nagoya University, Center for Gene Research, Professor (2000. 12~)
Memberships: American Association for the Advancement of Science The International Society for Plant Molecular Biology The Botanical Society of Japan The Molecular Biology Society of Japan The Japanese Society of Plant Physiologists
Editorial Boards: Plant & Cell Physiology (The Japanese Society of Plant Physiologists) 1998~2001 The International Journal, Plant Biology (German Botanical Society and Royal Botanical Society of the Netherlands) 2004. 1~
Awards: Inoue Research Award for Young Scientists (1985) THE GENETICS SOCIETY OF JAPAN Young Investigator Award (1994)
Abstract: Pentatricopeptide repeat (PPR) proteins constitute a large family in land plants and are required for various post-transcriptional steps associated with RNA in plant organelles. The moss Physcomitrella patens PPR protein, PpPPR_38, is a nuclear-encoded chloroplast protein and was previously shown to be involved in the maturation step of chloroplast clpP pre-mRNA. To understand precisely the molecular function of PpPPR_38, we prepared recombinant PpPPR_38 protein and characterized it in maturation steps of clpP pre-mRNA. In vitro RNA-binding assays showed that the recombinant protein strongly bound to the clpP-5'-rps12 intergenic region, which is highly AU-rich and includes an inverted repeat sequence potentially forming a stem-loop structure. Digestion of the bound RNA region by RNase V1 was significantly accelerated by the addition of the recombinant protein. This strongly suggests that the binding of PpPPR_38 facilitates the formation of a stable stem-loop structure. An in vitro degradation assay using chloroplast lysates gave rise to the possibility that the stable stem-loop structure formed by PpPPR_38 contributes the correct intergenic RNA cleavage and protection of mature clpP mRNA against 3' to 5' exoribonuclease. Because an RNA-binding assay also showed weak binding to the clpP first exon-intron region, PpPPR_38 is likely to be related to the splicing of clpP pre-mRNA. Taking together all of the above findings, we conclude that PpPPR_38 is necessary for several steps in the clpP mRNA maturation process.
Abstract: The plant-specific DYW subclass of pentatricopeptide repeat proteins has been postulated to be involved in RNA editing of organelle transcripts. We discovered that the DYW proteins CHLORORESPIRATORY REDUCTION22 (CRR22) and CRR28 are required for editing of multiple plastid transcripts but that their DYW motifs are dispensable for editing activity in vivo. Replacement of the DYW motifs of CRR22 and CRR28 by that of CRR2, which has been shown to be capable of endonucleolytic cleavage, blocks the editing activity of both proteins. In return, the DYW motifs of neither CRR22 nor CRR28 can functionally replace that of CRR2. We propose that different DYW family members have acquired distinct functions in the divergent processes of RNA maturation, including RNA cleavage and RNA editing.
Abstract: In the unicellular cyanobacterium Synechococcus elongatus PCC 7942, essentially all promoter activities are under the control of the circadian clock under continuous light (LL) conditions. Here, we used high-density oligonucleotide arrays to explore comprehensive profiles of genome-wide Synechococcus gene expression in wild-type, kaiABC-null, and kaiC-overexpressor strains under LL and continuous dark (DD) conditions. In the wild-type strains, >30% of transcripts oscillated significantly in a circadian fashion, peaking at subjective dawn and dusk. Such circadian control was severely attenuated in kaiABC-null strains. Although it has been proposed that KaiC globally represses gene expression, our analysis revealed that dawn-expressed genes were up-regulated by kaiC-overexpression so that the clock was arrested at subjective dawn. Transfer of cells to DD conditions from LL immediately suppressed expression of most of the genes, while the clock kept even time in the absence of transcriptional feedback. Thus, the Synechococcus genome seems to be primarily regulated by light/dark cycles and is dramatically modified by the protein-based circadian oscillator.
Abstract: RNA editing of cytidine (C) to uridine (U) transitions occurs in plastids and mitochondria of most land plants. In this study, we amplified and sequenced the group I intron-containing tRNA Leu gene, trnL-CAA, from Takakia lepidozioides, a moss. DNA sequence analysis revealed that the T. lepidozioides tRNA Leu gene consisted of a 35-bp 5' exon, a 469-bp group I intron and a 50-bp 3' exon. The intron was inserted between the first and second position of the tRNA Leu anticodon. In general, plastid tRNA Leu genes with a group I intron code for a TAA anticodon in most land plants. This strongly suggests that the first nucleotide of the CAA anticodon could be edited in T. lepidozioides plastids. To investigate this possibility, we analysed cDNAs derived from the trnL-CAA transcripts. We demonstrated that the first nucleotide C of the anticodon was edited to create a canonical UAA anticodon in T. lepidozioides plastids. cDNA sequencing analyses of the spliced or unspliced tRNA Leu transcripts revealed that, while the spliced tRNA was completely edited, editing in the unspliced tRNAs were only partial. This is the first experimental evidence that the anticodon editing of tRNA occurs before RNA splicing in plastids. This suggests that this editing is a prerequisite to splicing of pre-tRNA Leu.
Abstract: Pentatricopeptide repeat (PPR) proteins form a huge family in plants (450 members in Arabidopsis and 477 in rice) defined by tandem repetitions of characteristic sequence motifs. Some of these proteins have been shown to play a role in posttranscriptional processes within organelles, and they are thought to be sequence-specific RNA-binding proteins. The origins of this family are obscure as they are lacking from almost all prokaryotes, and the spectacular expansion of the family in land plants is equally enigmatic. In this study, we investigate the growth of the family in plants by undertaking a genome-wide identification and comparison of the PPR genes of 3 organisms: the flowering plants Arabidopsis thaliana and Oryza sativa and the moss Physcomitrella patens. A large majority of the PPR genes in each of the flowering plants are intron less. In contrast, most of the 103 PPR genes in Physcomitrella are intron rich. A phylogenetic comparison of the PPR genes in all 3 species shows similarities between the intron-rich PPR genes in Physcomitrella and the few intron-rich PPR genes in higher plants. Intron-poor PPR genes in all 3 species also display a bias toward a position of their introns at their 5' ends. These results provide compelling evidence that one or more waves of retrotransposition were responsible for the expansion of the PPR gene family in flowering plants. The differing numbers of PPR proteins are highly correlated with differences in organellar RNA editing between the 3 species.
Abstract: Many plant pentatricopeptide repeat (PPR) proteins are known to contain a highly conserved C-terminal DYW domain whose function is unknown. Recently, the DYW domain has been proposed to play a role in RNA editing in plant organelles. To address this possibility, we prepared recombinant DYW proteins and tested their cytidine deaminase activity. However, we could not detect any activity in the assays we used. Instead, we found that the recombinant DYW domains possessed endoribonuclease activity and cleaved before adenosine residues in the RNA molecule. Some DYW-containing PPR proteins may catalyze site-specific cleavage of target RNA species.
Abstract: RNA editing in land plant organelles is a process primarily involving the conversion of cytidine to uridine in pre-mRNAs. The process is required for gene expression in plant organelles, because this conversion alters the encoded amino acid residues and improves the sequence identity to homologous proteins. A recent study uncovered that proteins encoded in the nuclear genome are essential for editing site recognition in chloroplasts; the mechanisms by which this recognition occurs remain unclear. To understand these mechanisms, we determined the genomic and cDNA sequences of moss Takakia lepidozioides chloroplast genes, then computationally analyzed the sequences within -30 to +10 nucleotides of RNA editing sites (neighbor sequences) likely to be recognized by trans-factors. As the T. lepidozioides chloroplast has many RNA editing sites, the analysis of these sequences provides a unique opportunity to perform statistical analyses of chloroplast RNA editing sites. We divided the 302 obtained neighbor sequences into eight groups based on sequence similarity to identify group-specific patterns. The patterns were then applied to predict novel RNA editing sites in T. lepidozioides transcripts; approximately 60% of these predicted sites are true editing sites. The success of this prediction algorithm suggests that the obtained patterns are indicative of key sites recognized by trans-factors around editing sites of T. lepidozioides chloroplast genes.
Abstract: SUMMARY: In BT-type cytoplasmic male sterile rice (Oryza sativa L.) with Chinsurah Boro II cytoplasm, cytoplasmic male sterility (CMS) is caused by an accumulation of the cytotoxic peptide ORF79. The ORF79 protein is expressed from a dicistronic gene atp6-orf79, which exists in addition to the normal atp6 gene in the BT-type mitochondrial genome. The CMS is restored by a PPR (pentatricopeptide-repeat) gene, Rf1, via RNA processing. However, it has not yet been elucidated how the accumulation of ORF79 is reduced by the action of the Rf1 protein. Here, we report that the level of processed orf79 transcripts in the restorer line was reduced to 50% of the unprocessed atp6-orf79 transcripts in the CMS line. Ninety percent of the processed orf79 transcripts, which remained after degradation, were not associated with the ribosome for translation. Our data suggests that the processing of atp6-orf79 transcripts diminishes the expression of orf79 by the translational reduction and degradation of the processed orf79 transcripts.
Abstract: The entire genome of the unicellular cyanobacterium Synechococcus elongatus PCC 6301 (formerly Anacystis nidulans Berkeley strain 6301) was sequenced. The genome consisted of a circular chromosome 2,696,255 bp long. A total of 2,525 potential protein-coding genes, two sets of rRNA genes, 45 tRNA genes representing 42 tRNA species, and several genes for small stable RNAs were assigned to the chromosome by similarity searches and computer predictions. The translated products of 56% of the potential protein-coding genes showed sequence similarities to experimentally identified and predicted proteins of known function, and the products of 35% of the genes showed sequence similarities to the translated products of hypothetical genes. The remaining 9% of genes lacked significant similarities to genes for predicted proteins in the public DNA databases. Some 139 genes coding for photosynthesis-related components were identified. Thirty-seven genes for two-component signal transduction systems were also identified. This is the smallest number of such genes identified in cyanobacteria, except for marine cyanobacteria, suggesting that only simple signal transduction systems are found in this strain. The gene arrangement and nucleotide sequence of Synechococcus elongatus PCC 6301 were nearly identical to those of a closely related strain Synechococcus elongatus PCC 7942, except for the presence of a 188.6 kb inversion. The sequences as well as the gene information shown in this paper are available in the Web database, CYORF (http://www.cyano.genome.jp/).
Abstract: In general, in higher plants, the core subunits of a bacterial-type plastid-encoded RNA polymerase (PEP) are encoded by the plastid rpoA, rpoB, rpoC1 and rpoC2 genes. However, an rpoA gene is absent from the moss Physcomitrella patens plastid genome, although the PpRpoA gene (renamed PpRpoA1) nuclear counterpart is present in the nuclear genome. In this study, we identified and characterized a second gene encoding the plastid-targeting alpha subunit (PpRpoA2). PpRpoA2 comprised 525 amino acids and showed 59% amino acid identity with PpRpoA1. Two PpRpoA proteins were present in the PEP active fractions separated from the moss chloroplast lysate, confirming that both proteins are alpha subunits of PEP. Northern blot analysis showed that PpRpoA2 was highly expressed in the light, but not in the dark, whereas PpRpoA1 was constitutively expressed. Disruption of the PpRpoA1 gene resulted in an increase in the PpRpoA2 transcript level, but most plastid gene transcript levels were not significantly altered. This indicates that transcription of most plastid genes depends on PpRpoA2-PEP rather than on PpRpoA1-PEP. In contrast, the transcript levels of petN, psbZ and ycf3 were altered in the PpRpoA1 gene disruptant, suggesting that these are PpRpoA1-PEP-dependent genes. These observations suggest that plastid genes are differentially transcribed by distinct PEP enzymes with either PpRpoA1 or PpRpoA2.
Abstract: Pentatricopeptide repeat (PPR) proteins are encoded by the nuclear genome as a large gene family in land plants. PPR proteins play essential roles in organelle-related functions, mostly in RNA-processing steps in plastids and mitochondria. In the moss Physcomitrella patens, there is also a large gene family, but the moss PPR proteins are likely to be divergent from those of higher plants. To investigate the function of plastid PPR proteins, we have generated and characterized a PPR protein gene disruptant of P. patens. The PPR531-11-disrupted mosses displayed abnormal phenotypic characteristics, such as a significantly smaller protonemal colony, different chloroplast morphology, and incomplete thylakoid membrane formation. In addition, the quantum yield of photosystem II was reduced in the disrupted mosses. To further investigate whether disruption of the PPR531-11 gene affects chloroplast gene expression, we performed Northern blot and reverse transcription polymerase chain reaction analyses. These analyses revealed that PPR531-11 has a role in intergenic RNA cleavage between clpP and 5'-rps12 and in the splicing of clpP pre-mRNA. Western blot analysis showed that disruption of PPR531-11 resulted in a reduced level of ClpP, photosystem II reaction center protein D1, and the stromal enzyme, ribulose-bisphosphate carboxylase/oxygenase. These reductions might result in the severely retarded growth of the protonemal colony. Taken together, we propose a model where PPR531-11 function affects the steady-state level of ClpP, which regulates the formation and maintenance of thylakoid membranes in chloroplasts. This is the first evidence of a PPR protein controlling the protein expression level of ClpP.
Abstract: Small, regulatory, non-coding RNA (ncRNA) is involved in various cell functions in both prokaryotes and eukaryotes. However, information on ncRNA in cyanobacteria is still scarce. We studied ncRNA genes by computational screening to compare the intergenic regions of the Synechococcus elongatus PCC 6301 genome with the genomes of three freshwater cyanobacteria. We identified an ncRNA gene in S. elongatus, which has been previously described as yfr1 in marine cyanobacteria. The S. elongatus yfr1 gene is 65 nucleotides long and is positioned between guaB and trxA. We found a high conservation of the yfr1 gene in most cyanobacterial lineages. A yfr1-deficient mutant showed reduced growth under various stress conditions, e.g. oxidative stress and high salt stress conditions, and showed unusual accumulation of sbtA mRNA. A gel shift assay demonstrated interaction of the Yfr1 RNA with sbtA mRNA in vitro. This suggests that the sbtA transcript is a target RNA for the Yfr1 RNA.
Abstract: The Synechococcus elongatus mutant lacking the nrtABCD gene cluster (NA3) is defective in active nitrate transport and requires high nitrate concentrations (>30 mm) for sustained growth. Prolonged incubation of NA3 in medium containing 2 mm nitrate led to isolation of a pseudorevertant (NA3R) capable of transport of millimolar concentrations of nitrate, from which three mutants with improved affinity for nitrate were obtained. We identified three genes responsible for the latent transport activity for nitrate: ltnA, which encodes a response regulator with no effector domain; ltnB, which encodes a hybrid histidine kinase with two receiver domains; and ltnT, which encodes a sulfate permease-like protein with a putative cyclic nucleoside monophosphate (cNMP)-binding domain. Missense mutations of the high affinity derivatives of NA3R were found in ltnT, verifying that LtnT acts as the transporter. Overexpression of truncated LtnT lacking the cNMP-binding domain (but not full-length LtnT) conferred nitrate transport activity on NA3, suggesting that the cNMP-binding domain inhibits transport under normal conditions. A nonsense mutation in ltnB that resulted in elimination of the receiver domains of the encoded protein was responsible for expression of nitrate transport activity in NA3R. Expression of LtnB derivatives lacking the receiver domains also conferred low affinity nitrate transport activity on NA3. The phosphoryl group of the histidine kinase domain of LtnB was transferred to Asp(52) of LtnA in vitro. Overexpression of LtnA (but not LtnA(D52E)) led to manifestation of the latent nitrate transport activity in NA3, indicating involvement of phosphorylated LtnA in activation of the novel transporter.
Abstract: In higher plants, RNA editing is a post-transcriptional process that converts C to U in organelle mRNAs. We have previously shown that an Arabidopsis thaliana crr4 mutant is defective with respect to RNA editing for creating the translational initial codon of the plastid ndhD gene (the ndhD-1 site). CRR4 contains 11 pentatricopeptide repeat motifs but does not contain any domains that are likely to be involved in the editing activity. The green fluorescent protein fused to the putative transit peptide of CRR4 targeted the plastid. The recombinant CRR4 expressed in Escherichia coli specifically bound to the 25 nucleotides of the upstream and the 10 nucleotides of the downstream sequences surrounding the editing site of ndhD-1. The target C nucleotide of this editing is not essential for the binding of CRR4. Taken together with the genetic evidence, we conclude that the pentatricopeptide repeat protein CRR4 is a sequence-specific RNA-binding protein that acts as a site recognition factor in plastid RNA editing.
Abstract: RNA editing is a post-transcriptional process that changes individual nucleotides in transcripts, and usually occurs in the plastids of land plants. The number of RNA editing sites in a plastid is significantly divergent in bryophytes, ranging from zero in liverworts to almost 1,000 sites in hornworts. In this study, we identified 132 RNA editing sites in the transcripts of six genes from the psbB operon and the rpoA of the moss Takakia lepidozioides. This is the highest number of RNA editing sites known in this region among land plant species. All were cytidine-to-uridine conversions. More than 91% of RNA editing occurred at the first or second codon positions, and it altered amino acid identity. Six editing sites created new translation initiation codons or stop codons. Thirty-two sites were commonly observed in the hornwort Anthoceros angustus. This finding suggests that the enigmatic bryophyte Takakia is closely related to hornworts with respect to RNA editing events.
Abstract: KaiA, KaiB, and KaiC clock proteins from cyanobacteria and ATP are sufficient to reconstitute the KaiC phosphorylation rhythm in vitro, whereas almost all gene promoters are under the control of the circadian clock. The mechanism by which the KaiC phosphorylation cycle drives global transcription rhythms is unknown. Here, we report that RpaA, a potential DNA-binding protein that acts as a cognate response regulator of the KaiC-interacting kinase SasA, mediates between KaiC phosphorylation and global transcription rhythms. Circadian transcription was severely attenuated in sasA (Synechococcus adaptive sensor A)- and rpaA (regulator of phycobilisome-associated)-mutant cells, and the phosphotransfer activity from SasA to RpaA changed dramatically depending on the circadian state of a coexisting Kai protein complex in vitro. We propose a model in which the SasA-RpaA two-component system mediates time signals from the enzymatic oscillator to drive genome-wide transcription rhythms in cyanobacteria. Moreover, our results indicate the presence of secondary output pathways from the clock to transcription control, suggesting that multiple pathways ensure a genome-wide circadian system.
Abstract: Nitrate transport activity of the LtnT permease of the cyanobacterium Synechococcus elongatus is activated when LtnA, a response regulator without an effector domain, is phosphorylated by LtnB, a hybrid histidine kinase. We identified a protein (LtnC) that is required for activation of LtnT. LtnC consists of an N-terminal histidine-containing phosphoacceptor (HisKA) domain, a receiver domain, and a unique C-terminal domain found in some cyanobacterial proteins. Because LtnC lacks an ATP-binding kinase domain of a histidine kinase, it is incapable of autophosphorylation, but LtnC is phosphorylated by LtnA. The histidine residue in the HisKA domain but not the aspartate residue in the receiver domain is essential for phosphorylation of LtnC and activation of LtnT. LtnC phosphorylation leads to oligomerization of the protein. Fusion of the C-terminal domain of LtnC to glutathione S-transferase, which forms oligomers, also activates LtnT, suggesting that oligomerization of the LtnC C-terminal domain causes LtnT activation. These results indicate that the C-terminal domain of LtnC acts as an effector domain that directs the output of the signal from the phosphorelay system. The two-step (His-Asp-His) phosphorelay system, composed of the LtnB, LtnA, and LtnC proteins, is distinct from the known phosphorelay systems, namely, the typical two-component system (His-Asp) and the multistep phosphorelay system (His-Asp-His-Asp), because the HisKA domain of LtnC is the terminal phosphoacceptor that determines the signal output. LtnC is a new class of signal transducer in His-Asp phosphorelay systems that contains a HisKA domain and an effector domain.
Abstract: The moss Physcomitrella patens is a newly established model plant that is widely used for the characterization of gene function by targeted gene knockout or over-expression. The target gene disruption occurs in both the nuclear and chloroplast genomes. We applied DNA microarray technology to the P. patens plastid genome for large-scale analysis of transcripts. A microarray was constructed containing 108 DNA fragments to detect all annotated plastid genes. We analyzed the transcript profile in a knockout transformant for the arginine tRNA gene, trnR-CCG, and confirmed previous results that rbcL and psaI transcripts accumulate in similar levels to wild-type moss, and accD transcript level is higher than those of wild-type moss. Additionally, the plastid DNA microarray revealed that most plastid genes were expressed at similar levels in wild-type and transformant mosses. This indicates that trnR-CCG is not essential for the expression of plastid genes.
Abstract: Opinions on the basal relationship of land plants vary considerably and no phylogenetic tree with significant statistical support has been obtained. Here, we report phylogenetic analyses using 51 genes from the entire chloroplast genome sequences of 20 representative green plant species. The analyses, using translated amino acid sequences, indicated that extant bryophytes (mosses, liverworts, and hornworts) form a monophyletic group with high statistical confidence and that extant bryophytes are likely sisters to extant vascular plants, although the support for monophyletic vascular plants was not strong. Analyses at the nucleotide level could not resolve the basal relationship with statistical confidence. Bryophyte monophyly inferred using amino acid sequences has a good statistical foundation and is not rejected statistically by other data sets. We propose bryophyte monophyly as the currently best hypothesis.
Abstract: A large gene family encoding proteins with a pentatricopeptide repeat (PPR) motif exists in flowering plants but not in algae, fungi, or animals. This suggests that PPR protein genes expanded vastly during the evolution of the land plants. To investigate this possibility, we analysed PPR protein genes in the basal land plant, the moss Physcomitrella patens. An extensive survey of the Physcomitrella expressed sequence tag (EST) databases revealed 36 ESTs encoding PPR proteins. This indicates that a large gene family of PPR proteins originated before the divergence of the vascular plant and moss lineages. We also characterized five full-length cDNAs encoding PPR proteins, designated PPR513-10, PPR566-6, PPR868-14, PPR986-12, and PPR423-6. Intracellular localization analysis demonstrated two PPR proteins in chloroplasts (cp), whereas the cellular localization of the other three PPR proteins is unclear. The genes of the cp-localized PPR513-10 and PPR566-6 were expressed differentially in protonemata grown under different light-dark conditions, suggesting they have distinctive functions in cp. This is the first report and analysis of genes encoding PPR proteins in bryophytes.
Abstract: Three distinct arginine tRNA genes, trnR-CCG, trnR-ACG, and trnR-UCU, are present in the plastid genome of bryophytes, whereas only the latter two trnR genes are present in the major vascular plants, except for black pine. trnR-CCG is located between rbcL and accD in the moss Physcomitrella patens and it was previously believed to be functional in plastids. However, no trnR-CCG transcript has been detected by Northern hybridization, and the codon usage of CGG is quite low in plastid protein-coding sequences. This raises the possibility that trnR-CCG is non-functional. To investigate this possibility, we integrated a foreign gene into the trnR-CCG coding region via homologous recombination, and constructed stable plastid trnR-CCG knock-out moss transformants. The trnR-CCG knock-out transformants grew normally, indicating that the P. patenstrnR-CCG gene is not essential for plastid function.
Abstract: Chloroplast gene expression is mainly regulated at the post-transcriptional level by numerous nuclear-encoded RNA-binding protein factors. In the present study, we focus on two RNA-binding proteins: cpRNP (chloroplast ribonucleoprotein) and PPR (pentatricopeptide repeat) protein. These are suggested to be major contributors to chloroplast RNA metabolism. Tobacco cpRNPs are composed of five different proteins containing two RNA-recognition motifs and an acidic N-terminal domain. The cpRNPs are abundant proteins and form heterogeneous complexes with most ribosome-free mRNAs and the precursors of tRNAs in the stroma. The complexes could function as platforms for various RNA-processing events in chloroplasts. It has been demonstrated that cpRNPs contribute to RNA stabilization, 3'-end formation and editing. The PPR proteins occur as a superfamily only in the higher plant species. They are predicted to be involved in RNA/DNA metabolism in chloroplasts or mitochondria. Nuclear-encoded HCF152 is a chloroplast-localized protein that usually has 12 PPR motifs. The null mutant of Arabidopsis, hcf152, is impaired in the 5'-end processing and splicing of petB transcripts. HCF152 binds the petB exon-intron junctions with high affinity. The number of PPR motifs controls its affinity and specificity for RNA. It has been suggested that each of the highly variable PPR proteins is a gene-specific regulator of plant organellar RNA metabolism.
Abstract: The nuclear-encoded plastid sigma factors are supposed to be a regulatory subunit of the multisubunit bacteria-type plastid RNA polymerase. We studied here whether or not three genes, PpSig1, PpSig2, and PpSig5 encoding plastid sigma factors, are controlled by the circadian clock and/or by blue light signaling in the moss Physcomitrella patens. Among the three PpSig genes, only PpSig5 was clearly controlled by the circadian clock. In contrast to the differential regulation on a daily timescale, a pulse of blue light induced the expression of all the three PpSig genes. This induction was significantly reduced in a knockout mutant that lacked the blue light photoreceptor cryptochromes PpCRY1a and PpCRY1b, indicating that PpCRY1a and/or PpCRY1b mediate the blue light signal that induces the expression of the PpSig genes. In a daily cycle of 12-h blue light/12-h dark, the timing of peak expression of PpSig5 and a chloroplast gene psbD, encoding the D2 subunit of photosystem II, advanced in the cryptochrome mutant relative to those in the wild type, suggesting the presence of regulatory interactions among the expression of PpSig5 and psbD, the circadian clock, and the blue light signaling mediated by the cryptochrome(s).
Abstract: The C to U editing event that converts an ACG codon to an AUG translation initiation codon in the chloroplast rps 14 transcript is unique to the moss Physcomitrella patens and has not been found in other species. The efficiency of RNA editing was 80% in the young protonemata and decreased to approximately 20% in old protonemata and fully developed leafy shoots. This indicates that RNA editing of this site is regulated in a tissue- and stage-specific manner. In this study, a novel C to U RNA editing site has been identified at the -1 position relative to the AUG. Because the editing site is localized in the mRNA 5' untranslated region, it may affect the efficiency of rps 14 mRNA translation.
Abstract: The complete chloroplast DNA sequence (122 890 bp) of the moss Physcomitrella patens has been determined. The genome contains 83 protein, 31 tRNA and four rRNA genes, and a pseudogene. Four protein genes (rpoA, cysA, cysT and ccsA) found in the liverwort Marchantia polymorpha and the hornwort Anthoceros formosae are absent from P.patens. The overall structure of P.patens chloroplast DNA (cpDNA) differs substantially from that of liverwort and hornwort. Compared with its close relatives, a 71 kb region from petD to rpoB of P.patens is inverted. To investigate whether this large inversion and the loss of rpoA usually occur in moss plants, we analyzed amplified cpDNA fragments from four moss species. Our data indicate that the large inversion occurs only in P.patens, whereas the loss of the rpoA gene occurs in all mosses. Moreover, we have isolated and characterized the nuclear rpoA gene encoding the alpha subunit of RNA polymerase (RNAP) from P.patens and examined its subcellular localization. When fused to green fluorescent protein, RpoA was observed in the chloroplasts of live moss protonemata cells. This indicates that chloroplast RNAP is encoded separately by chloroplast and nuclear genomes in the moss. These data provide new insights into the regulation and evolution of chloroplast transcription.
Abstract: The plastid genome of higher plants includes about 120 genes. We adopted genomic array technologies to the tobacco plastid genome. A microarray was constructed, consisting of 220 DNA fragments that cover the whole genome sequence. Each DNA fragment corresponds to a single known gene or an intergenic region. We evaluated reliability of this microarray by comparing the plastid RNA level in light- or dark-grown tobacco seedlings. The transcripts encoding photosynthetic subunits increased significantly in light-grown tissues as expected. Furthermore, we found unexpected signals in several intergenic regions, suggesting the existence of novel transcripts in tobacco plastids.
Abstract: Methionine aminopeptidase, known to be encoded by single genes in prokaryotes, is a cobalt-dependent enzyme that catalyzes the removal of N-terminal methionine residues from nascent polypeptides. Three ORFs encoding putative methionine aminopeptidases from the genome of cyanobacterium Synechocystis sp. strain PCC6803, designated as slr0786 ( map-1), slr0918 ( map-2) and sll0555 ( map-3) were cloned and expressed in Escherichia coli. The purified recombinant proteins encoded by map-1 and map-3 had much higher methionine aminopeptidase activity than the recombinant protein encoded by map-2. Comparative analysis revealed that the three recombinant enzymes differed in their substrate specificity, divalent ion requirement, pH, and temperature optima. The broad activities of the iso-enzymes are discussed in light of the structural similarities with other peptidase families and their levels of specificity in the cell. Potential application of cyanobacterial MetAPs in the production of recombinant proteins used in medicine is proposed. This is the first report of a prokaryote harboring multiple methionine aminopeptidases.
Abstract: Nine Lhcb1 genes encoding the light-harvesting chlorophyll a/b-binding proteins of photosystem II were isolated and characterized from Nicotiana sylvestris. Their nucleotide sequences are highly similar. Lhcb1 transcripts are accumulated in leaves and stems but not in roots and non-green cultured cells. RNase protection assay revealed that no transcripts were detected from the gene, Lhcb1*2, in Nicotiana tabacum. This finding raises the possibility that the amphidiploid tobacco cultivar (N. tabacum) lost one gene from the female progenitor (N. sylvestris) during evolution. Transcriptional initiation sites were mapped and found to be mostly cytidine residues, which is unique to the N. sylvestris Lhcb1 genes. Four of the nine genes have single start sites and the remaining genes possess multiple initiation sites. The TATA-like sequences of nine Lhcb1 genes can be classified into two groups; one that possesses a TTTATA sequence and the other that has a sequence diverged from it. The genes with single initiation sites belong to the first group. A consensus motif for the initiation region is CTC*A (C* for initiation site), which differs from those of other plant genes or mammalian genes.
Abstract: The rps14 transcript is edited in the moss Physcomitrella patens chloroplast by a C-to-U transition, to create a translation initiation codon, AUG. The efficiency of RNA editing was low, with approximately 20% of rps14 transcripts edited. This suggests that the translation of rps14 mRNA is strictly regulated by RNA editing. This is the first report of RNA editing in P. patens and the creation of a translation initiation codon in rps14 mRNA in chloroplasts.
Abstract: A third nuclear gene encoding a bacteriophage T7-type RNA polymerase, NsRpoT-C, was isolated and characterized from Nicotiana sylvestris. The gene, NsRpoT-C, consists of 21 exons and 20 introns and encodes a polypeptide of 977 amino acid residues. The predicted NsRpoT-C protein shows the highest identity (72% amino acid identity) with Arabidopsis thaliana RpoT;3 which is a plastid-targeted protein. Surprisingly, comparison of the deduced amino acid sequence of NsRpoT-C with that of A. thaliana RpoT;3 predicted that the NsRpoT-C starts at a CUG triplet, a rare translation initiation codon. Transient expression assays in protoplasts from tobacco leaves demonstrated that the putative N-terminal transit peptide of NsRpoT-C encodes a targeting signal directing the protein into chloroplasts. This strongly suggests that NsRpoT-C functions as an RNA polymerase transcribing plastid-encoded genes. We have designated this protein NsRpoTp.
Abstract: We isolated the cDNA for a sigma factor from the moss Physcomitrella patens, which possesses unusually large N-terminal extension and the conserved subdomains 1.2-4.2. Phylogenetic analyses indicated that this novel sigma factor and PpSIG1*(2), a plastid sigma factor previously identified from Physcomitrella, were classified into SigA and SigB groups, two major classes of higher plant plastid sigma factors, respectively. According to the nomenclature recently proposed, we renamed PpSIG1* into PpSIG2, and named the novel sigma factor PpSIG1. A transient expression assay using a green fluorescent protein showed that the N-terminal region of PpSIG1 acts as a chloroplast-targeting signal. Reverse transcription-PCR experiments showed that light induces the expression of the Sig1 and Sig2 genes encoding PpSIG1 and PpSIG2, respectively. Thus, PpSIG1 and PpSIG2 are likely plastid sigma factors regulating plastid gene expression in response to light signals.
Abstract: Post-transcriptional RNA processing is an important step in the regulation of chloroplast gene expression, and a number of chloroplast ribonucleoproteins (cpRNPs) are likely to be involved in this process. The major tobacco cpRNPs are composed of five species: cp28, cp29A, cp29B, cp31, and cp33 and these are divided into three groups (I, II, and III). By immunoprecipitation, gel filtration, and Western blot analysis, we demonstrated that these cpRNPs are abundant stromal proteins that exist as complexes with ribosome-free mRNAs. Many ribosome-free psbA mRNAs coprecipitate with cpRNPs, indicating that the majority of stromal psbA mRNAs are associated with cpRNPs. In addition, an in vitro mRNA degradation assay indicated that exogenous psbA mRNA is more rapidly degraded in cpRNP-depleted extracts than in nondepleted extracts. When the depleted extract was reconstituted with recombinant cpRNPs, the psbA mRNA in the extract was protected from degradation to a similar extent as the psbA mRNA in the nondepleted extract. Moreover, restoration of the stabilizing activity varied following addition of individual group-specific cpRNPs alone or in combination. When the five cpRNPs were supplemented in the depleted extract, full activity was restored. We propose that these cpRNPs act as stabilizing factors for nonribosome-bound mRNAs in the stroma.
Abstract: We isolated a cDNA PpSig1 encoding a plastid sigma factor from the moss Physcomitrella patens. The PpSIG1 protein is composed of the conserved subdomains for recognition of -10 and -35 promoter elements, core complex binding and DNA melting. Southern blot analysis showed that the moss sig1 gene is likely a member of a small gene family. Transient expression assay using green fluorescent protein demonstrated that the N-terminal region of PpSIG1 functions as a chloroplast-targeting signal peptide. These observations suggest that multiple nuclear-encoded sigma factors regulate chloroplast gene expression in P. patens.
Abstract: We isolated and sequenced a nuclear gene and cDNA encoding a bacteriophage T7-type RNA polymerase, NsRpoT-B, from Nicotiana sylvestris. The gene, NsRpoT-B, consists of 19 exons and 18 introns and encodes a polypeptide of 1020 amino acid residues. The predicted NsRpoT-B protein shows 71% amino acid identity with NsRpoT-A which is a mitochondrial protein. Quantitative RT-PCR revealed that steady-state NsRpoT-B mRNA accumulation is highest in the mature leaves and lowest in the cotyledons. Transient expression assays in protoplasts from N. sylvestris leaves demonstrated that the putative N-terminal transit peptide of NsRpoT-B encodes dual targeting signals directing the protein into mitochondria and plastids. This strongly suggests that NsRpoT-B functions as an RNA polymerase transcribing genes from two different plant organelle genomes. NsRpoT-B transcripts have two potential translation initiation codons. An in vitro translation assay indicated that a chimeric mRNA encoding the N-terminal NsRpoT-B fused to an sGFP produced two polypeptides translated from the first and second initiation codons. This implies that the dual targeting of NsRpoT-B protein is regulated, in part, at the level of translation. We have designated this protein NsRpoTpm.
Abstract: We have isolated and sequenced a nuclear gene and cDNA encoding bacteriophage T7-type single subunit RNA polymerase, NsRpoT-A, from Nicotiana sylvestris. NsRpoT-A consists of 19 exons and 18 introns; the first intron is 17 kb, the longest yet identified in a plant gene. Genomic Southern analysis indicated that N. sylvestris contains a small family of NsRpoT genes. Quantitative RT-PCR revealed that steady-state mRNA levels are highest in the leaves and lowest in the cotyledons. Phylogenetic analysis of NsRpoT-A and the RpoT proteins of other plant species suggested that NsRpoT-A is a mitochondrial protein. The TargetP program predicted localization of the NsRpoT-A gene product to the mitochondria. Using a transient expression assay and protoplasts from N. sylvestris mesophyll cells, we clearly demonstrated that the N-terminal sequence of NsRpoT-A targets the protein to the mitochondria. We therefore named this protein NsRpoTm.
Abstract: The effect of alteration of 5' and 3' flanking sequences on the transcription of plant tRNA genes was analysed using an RNA polymerase III-dependent in vitro transcription system derived from nuclei of cultured tobacco cells. A TATA-like sequence and the CAA motif frequently observed upstream of plant tRNA genes, and the poly(T) stretch usually present downstream, were shown to be necessary for efficient re-initiation of transcription. The CAA motif was shown to be a transcription initiation site. Introduction of the CAA and TATA-like motifs into a gene naturally lacking them greatly enhanced transcription by promoting efficient re-initiation.
Abstract: Cyanobacteria are prokaryotes that carry out plant-type photosynthesis and contain several eukaryotic-type RNA-binding proteins. Using a single-stranded DNA column, a 33-kDa protein was isolated and characterized from Synechococcus sp. PCC6301. This protein of 293 amino acids is similar in overall structure to the ribosomal protein S1 found in the same species, and contains three repeated units that are highly similar to the S1 motif originally found in the ribosomal protein S1 of Escherichia coli. However, the 33-kDa protein was found not to be associated with ribosomes and its nucleic acid binding specificity is distinct from that of the ribosomal protein S1. As this protein has high affinity for both single- and double-stranded DNA, as well as for poly(G) and poly(A), we tentatively named it nucleic acid-binding protein 1 (Nbp1).
Abstract: Tobacco chloroplasts possess five conserved ribonucleoproteins (cpRNPs). To elucidate the function of cpRNPs we analyzed their localization and target nucleic acid molecules in chloroplasts. Immunoprecipitation of the stromal extract and Northern analysis revealed that cpRNPs are associated in vivo with not only various species of chloroplast mRNAs but also intron-containing precursor (pre-) tRNAs. This observation strongly suggests that cpRNPs are involved in RNA processing, including mRNA stability and pre-tRNA splicing.
Abstract: We have isolated the 10Sa RNA (tmRNA) from the unicellular cyanobacterium Synechococcus sp. strain PCC6301. It comprises of 394 nucleotides (nt) and has 55% homology to Escherichia coli tmRNA. The cloning and sequencing of the corresponding gene have revealed that, like in many tRNA genes, the terminal CCA sequence reported in all the tmRNA species characterized so far is not encoded in the DNA. Hybridization analysis has shown that the tmRNA gene is present as a single copy. Fairly high levels of tmRNA accumulate throughout the cell cycle; however, a slight increase in its level is observed during late-log to stationary phase. This suggests that tmRNA is functional not only when cells divide actively but also when cell growth stops.
Abstract: The entire sequence (120-190 kb) of chloroplast genomes has been determined from a dozen plant species. The genome contains from 87 to 183 known genes, of which half encode components involved in translation. These include a complete set of rRNAs and about 30 tRNAs, which are likely to be sufficient to support translation in chloroplasts. RNA editing (mostly C to U base changes) occurs in some chloroplast transcripts, creating start and stop codons and changing codons to retain conserved amino acids. Many components that constitute the chloroplast translational machinery are similar to those of Escherichia coli, whereas only one third of the chloroplast mRNAs contain Shine-Dalgarno-like sequences at the correct positions. Analyses conducted in vivo and in vitro have revealed the existence of multiple mechanisms for translational initiation in chloroplasts.
Abstract: The plastid ATP synthase complex is composed of nine subunits, of which six are encoded in the plastome. The plastid-encoded genes are arranged in two transcriptional units: atpB/E and atpI/H/F/A. We have recently reported that besides containing four -10 and -35 consensus-type (CT) promoters, the atpB/E operon also contains a non-consensus type (NCII) promoter that alone is responsible for its expression in non-photosynthetic plastids. As the functionality of ATP synthase requires expression of all nine subunits, NCII promoter-driven transcription of the atpI/H/F/A operon is to be expected in non-photosynthetic plastids. Therefore, a detailed transcriptional analysis of this operon was carried out using RNA samples from tobacco leaf, cultured cells (BY-2) and seedlings grown on streptomycin and spectinomycin; which contain chloroplasts, translationally active non-photosynthetic plastids and translationally inactive plastids, respectively. We identified a total of three transcription initiation sites (TIS) and four transcript processing sites in the non-coding regions of this operon. Our results also demonstrate that rps2 is co-transcribed with the atpI/H/F/A genes. One of the TIS (-208 atpI) is characterized by an NCII type promoter, while other two primary transcripts (-131 atpI and -384 atpH) initiate from CT promoters. In non-photosynthetic plastids the atpI/H/F/A-specific transcript pool seems to be solely contributed by initiation at the -208 atpI (NCII type) promoter, because transcripts from CT promoters do not accumulate in these plastid types.
Abstract: The structure of a large gene cluster containing 22 ribosomal protein (r-protein) genes of the cyanobacterium Synechococcus sp. strain PCC6301 is presented. Based on DNA and protein sequence analyses, genes encoding r-proteins L3, L4, L23, L2, S19, L22, S3, L16, L29, S17, L14, L24, L5, S8, L6, L18, S5, L15, L36, S13, S11, L17, SecY, adenylate kinase (AK) and the alpha subunit of RNA polymerase were identified. The gene order is similar to that of the E. coli S10, spc and alpha operons. Unlike the corresponding E. coli operons, the genes for r-proteins S4, S10, S14 and L30 are not present in this cluster. The organization of Synechococcus r-protein genes also resembles that of chloroplast (cp) r-protein genes of red and brown algal species. This strongly supports the endosymbiotic theory that the cp genome evolved from an ancient photosynthetic bacterium.
Abstract: A nuclear extract derived from tobacco cultured BY-2 cells supports RNA polymerase III-dependent transcription of Arabidopsis tRNA(Ser) genes. Primer extension analysis indicated that the transcription starts at 6 bp upstream from the 5' end of tRNA coding region. Procedures for nuclear extraction and in vitro reaction conditions have been optimized for tRNA transcription, which allows direct detection of de novo synthesized tRNA by gel electrophoresis. This improved in vitro system yields a mature-sized tRNA of 85 nucleotides from the Arabidopsis tRNA(Ser) gene, indicating that efficient processing of the pre-tRNA also occurs in the tobacco nuclear extract.
Abstract: The small plastid RNA (spRNA) which includes a segment that is complementary to the pre-16S rRNA has been suggested to facilitate maturation of pre-16S rRNA in tobacco. To investigate the function of spRNA, the gene encoding it (sprA) was removed from the plastid genome using targeted gene deletion. We report here that deletion of sprA does not significantly affect pre-16S rRNA maturation, nor does it cause any obvious phenotype. Although the spRNA still may be involved in rRNA maturation, it is non-essential under normal growth conditions.
Abstract: We isolated a novel RNA species from the unicellular cyanobacterium Synechococcus PCC6301 and determined its gene sequence. This novel RNA was termed 6Sa RNA from its length (185 nt). Cross-hybridization of 6Sa RNA to other related microorganisms suggests that its existence is restricted to the Synechococcus genus or related organisms. A high level of accumulation of this RNA was observed by Northern analysis, indicating that 6Sa RNA is stable in cells. Computer-aided prediction of the 6Sa RNA secondary structure also supports its stability.
Abstract: In this study, a cDNA encoding a small RNA-binding protein was isolated from a Nicotiana sylvestris cDNA library. The predicted protein (RGP-3) is 144 amino acid residues long, and contains a consensus sequence-type RNA binding domain (CS-RBD) of 83 amino acids and a short glycine-rich region of 15 amino acids. RGP-3 synthesized in Escherichia coli has high affinity for poly(U). Immunocytochemical analysis indicated that RGP-3 is localized in the nucleoplasm, and that RGP-1b, a related protein reported previously, is localized in the nucleolus. Possible roles of these proteins in pre-mRNA or pre-rRNA processing are discussed.
Abstract: The complete nucleotide sequence of the chloroplast genome (150,613 bp) from the unicellular green alga Chlorella vulgaris C-27 has been determined. The genome contains no large inverted repeat and has one copy of rRNA gene cluster consisting of 16S, 23S, and 5S rRNA genes. It contains 31 tRNA genes, of which the tRNALeu(GAG) gene has not been found in land plant chloroplast DNAs analyzed so far. Sixty-nine protein genes and eight ORFs conserved with those found in land plant chloroplasts have also been found. The most striking is the existence of two adjacent genes homologous to bacterial genes involved in cell division, minD and minE, which are arranged in the same order in Escherichia coli. This finding suggests that the mechanism of chloroplast division is similar to bacterial division. Other than minD and minE homologues, genes encoding ribosomal proteins L5, L12, L19, and S9 (rpl5, rpl12, rpl19, and rps9); a chlorophyll biosynthesis Mg chelating subunit (chlI); and elongation factor EF-Tu (tufA), which have not been reported from land plant chloroplast DNAs, are present in this genome. However, many of the new chloroplast genes recently found in red and brown algae have not been found in C. vulgaris. Furthermore, this algal species possesses two long ORFs related to ycf1 and ycf2 that are exclusively found in land plants. These observations suggest that C. vulgaris is closer to land plants than to red and brown algae.
Abstract: A physical map of the unicellular cyanobacterium Synechococcus sp. strain PCC6301 genome has been constructed with restriction endonucleases PmeI, SwaI, and an intron-encoded endonuclease I-CeuI. The estimated size of the genome is 2.7 Mb. On the genome 49 genes or operons have been mapped. Two rRNA operons are separated by 600 kb and transcribed oppositely.
Abstract: We have previously characterized a tobacco cDNA encoding a novel type RNA-binding protein (RZ-1), which contains a zinc finger motif in addition to a consensus sequence-type RNA-binding domain and is localized in the nucleus. Here we isolated its genomic clone from a Nicotiana sylvestris genomic library. Southern blot analysis suggested that RZ-1 is coded for by a single locus per haploid genome. Comparison of the cDNA and genomic sequences indicated that the RZ-1 gene contains two introns, one in the coding region and another in the 3'-untranslated region. RT-PCR and ribonuclease protection analyses showed that splicing of RZ-1 pre-mRNA occurs efficiently. The RZ-1 protein is actively synthesized in rapidly dividing tobacco cells, as demonstrated by immunoblot analysis.
Abstract: A cDNA encoding a protein with a consensus sequence-type RNA-binding domain (CS-RBD) has been isolated from a Nicotiana sylvestris cDNA library. The deduced protein (designated 'RZ-1') contains CS-RBD in its N-terminal half, arginine/aspartic acid repeats in its center and a glycine-rich-C-terminal region in which a zinc finger motif of the CCHC type is present. The corresponding gene appears to be expressed constitutively in all tobacco organs. Immunocytochemical assays revealed that RZ-1 is localized in the nucleoplasm of tobacco cultured cells. Glycerol gradient fractionation of tobacco nuclear lysates showed that RZ-1 is associated with a large ribonucleoprotein particle of around 60 S in size. Nucleic acid-binding assays indicated that RZ-1 binds preferentially to poly (G) and both the CS-RBD and glycine-rich region are necessary for its binding activity. A possible role of RZ-1 is discussed.
Abstract: A 2.0-kbp Pst I DNA fragment of the unicellular cyanobacterium Synechococcus sp. strain PCC 6301 genome contains two open reading frames (ORFs). The first ORF of 100 codons potentially encodes a polypeptide having 47% amino acid identity to Escherichia coli ribosomal protein S14, suggesting it as a ribosomal protein S14 gene (rps14). The second ORF of 351 codons is located 81 bp downstream of rps14 and its deduced amino acid sequence is in part similar to that of the Salmonella typhimurium oligopeptide permease membrane protein OppC. Northern blot analysis showed that rps14 is expressed as a 0.48-kb transcript whereas no transcript was detected from ORF351. Pulsed-field electrophoresis and blot hybridization analysis revealed that rps14 is a single-copy gene and is found within a 165-kbp region located upstream of rrnA on the circular genome.
Abstract: Chloroplasts contain their own genetic system which has a number of prokaryotic as well as some eukaryotic features. Most chloroplast genes of higher plants are organized in clusters and are cotranscribed as polycistronic pre-RNAs which are generally processes into many shorter overlapping RNA species, each of which accumulates of steady-state RNA levels. This indicates that posttranscriptional RNA processing of primary transcripts is an important step in the control of chloroplast gene expression. Chloroplast RNA processing steps include RNA cleavage/trimming, RNA splicing, ENA editing and RNA stabilization. Several chloroplast genes are interrupted by introns and therefore require processing for gene function. In tobacco chloroplasts, 18 genes contain introns, six for tRNA genes and 12 for protein-encoding genes. A number of specific proteins and RNA factors are believed to be involved in splicing and maturation of pre-RNAs in chloroplasts. Processing enzymes and RNA-binding proteins which could be involved in posttranscriptional steps have been identified in the last several years. Our current knowledge of the regulation of gene expression in chloroplasts of higher plants is overviewed and further studies on this matter are also considered.
Abstract: Five chloroplast RNA-binding proteins with consensus sequence-type RNA-binding domains have been isolated from tobacco chloroplasts. Here we report three nuclear genes for similar chloroplast RNA-binding proteins (cp29, cp31 and cp33) from Arabidopsis thaliana. Each of the three genes consists of four exons and three introns and their exon/intron junctions were determined by sequencing respective cDNAs. In vitro import assays showed that all three proteins are located in chloroplasts. The three genes are singly-copy each and the transcription start sites were determined to be 80/82 bp (cp29) and 76/88 bp (cp31) upstream from the translational initiation codons. Northern blot analysis revealed that the three genes are transcribed both in leaves and roots, but the transcript level in leaves is higher than in roots. Phylogenetic analysis of chloroplast RNA-binding proteins so far identified shows that these proteins can be classified into three groups. Tobacco and Arabidopsis have these three types of proteins and structural features of each group are conserved between the two plants, suggesting that they are important for chloroplast functions. Interestingly, cp31 (238 amino acids) shares the identical amino acid sequence from the 30th to the last (238th) residues (including two RNA-binding domains) with the Arabidopsis nucleolin-like ribonucleoprotein, FMV3bp [11]. FMV3bp lacks a transit-peptide and must be located in the nucleus or the cytoplasm.
Abstract: Transgenic tobacco (Nicotiana tabacum L. cv SR1) with decreased activity of glutathione reductase exhibited enhanced sensitivity to paraquat in the light as evaluated by chlorophyll destruction and electrolyte leakage from leaf discs. This result indicates the involvement of glutathione reductase in the tolerance of plants to photooxidative stress caused by the herbicide.
Abstract: An in vitro transcription system derived from tobacco (Nicotiana tabacum) cultured cell (BY-2) nuclei supports transcription of the RNA polymerase I-dependent tobacco rRNA gene. The transcription initiation site determined in vitro was found at residue A which corresponds to that lying within the consensus sequence surrounding plant pre-rRNA initiation sites. The tobacco rRNA gene is actively transcribed in the tobacco nuclear extract while the broad bean rRNA gene is inactive in the heterologous system, indicating that plant RNA polymerase I-dependent transcription is species-specific.
Abstract: We isolated a 38 kDa ssDNA-binding protein from the unicellular cyanobacterium Synechococcus sp. strain PCC 6301 and determined its N-terminal amino acid sequence. A genomic clone encoding the 38 kDa protein was isolated by using a degenerate oligonucleotide probe based on the amino acid sequence. The nucleotide sequence and predicted amino acid sequence revealed that the 38 kDa protein is 306 amino acids long and homologous to the nuclear-encoded 370 amino acid chloroplast ribosomal protein CS1 of spinach (48% identity), therefore identifying it as ribosomal protein (r-protein) S1. Cyanobacterial and chloroplast S1 proteins differ in size from Escherichia coli r-protein S1 (557 amino acids). This provides an additional evidence that cyanobacteria are closely related to chloroplasts. The Synechococcus gene rps1 encoding S1 is located 1.1 kb downstream from psbB, which encodes the photosystem II P680 chlorophyll a apoprotein. An open reading frame encoding a potential protein of 168 amino acids is present between psbB and rps1 and its deduced amino acid sequence is similar to that of E. coli hypothetical 17.2 kDa protein. Northern blot analysis showed that rps1 is transcribed as a monocistronic mRNA.
Abstract: A part of the tRNA(Leu)(UAA) gene containing a 240-nucleotide group I intron was amplified by PCR from cyanobacterium Synechococcus PCC 6301 genomic DNA. The pre-tRNA synthesized from the cloned PCR product was efficiently self-spliced in vitro under physiological conditions. The gene encoding the tRNA(Leu)(UAA), trnL-UAA, was isolated from a Synechococcus PCC 6301 genomic library and the nucleotide sequence of a 2,167-bp portion was determined. The trnL-UAA consists of a 34-bp 5' exon, a 240-bp group I intron and a 50-bp 3' exon. In addition, three open reading frames (ORF1, ORF2 and ORF3) were found in the 5' and 3' flanking regions of trnL-UAA. The predicted protein sequence of ORF3, which is located 74-bp upstream from trnL-UAA on the opposite strand, shows 66.2% amino acid identity to that of the Synechocystis PCC 6803 gene encoding subunit L of NADH dehydrogenase (ndhL).
Abstract: The human P68 protein is an ATP-dependent RNA helicase and thought to be involved in cell growth and division. We have isolated a Nicotiana sylvestris cDNA which encodes a p68-like protein. Northern blot analysis showed that the transcript from the gene is accumulated in N. sylvestris leaves, roots and flowers, but not in N. tabacum-cultured cells.
Abstract: A group of proteins containing a conserved ribonucleoprotein consensus sequence (RNP-CS)-type RNA-binding domain (CS-RBD) of approximately 80 amino acids is present in eukaryotic cells and binds specifically to a wide variety of RNA molecules. We have isolated 12 kDa single-stranded DNA binding proteins from the unicellular cyanobacterium Synechococcus 6301. The amino-terminal sequence was determined and two distinct genomic clones were isolated from a Synechococcus 6301 genomic library. Sequence analysis revealed that two closely related proteins contain a single CS-RBD of 82 amino acids and are named as 12RNP1 and 12RNP2. Both of the CS-RBDs share the highest amino acid identity with those of chloroplast ribonucleoproteins (40-51%). The 12RNP proteins were expressed in Escherichia coli bearing plasmids encoding glutathione S-transferase/12RNP fusion proteins and subjected to in vitro nucleic acid-binding assay. Both 12RNP1 and 12RNP2 bind to RNA homopolymers poly(U) and poly(G), indicating that they might be RNA-binding proteins. This is the first example of such proteins in prokaryotes. The 12RNP1 and 12RNP2 genes are transcribed as monocistronic mRNAs and the steady-state mRNA level of 12RNP1 is over 20-fold than that of 12RNP2. Due to the easiness of genetic manipulations the cyanobacterium will provide an excellent system to analyze the function of not only cyanobacterial but also plant RNA-binding proteins.
Abstract: A cDNA encoding an RNA-binding protein (ribonucleoprotein or RNP) was isolated from a tobacco (Nicotiana sylvestris) cDNA library. The predicted protein (termed RGP-2) is 259 amino acids in length and consists of an N-terminal sequence of 39 amino acids, a consensus sequence type RNA-binding domain of 82 amino acids, a glycine-rich domain of 83 amino acids and an acidic C-terminal domain of 46 amino acids. It is distinct from the RGP-1 proteins previously reported, which consist of an RNA-binding domain in the N-terminal half and a glycine-rich domain in the C-terminal half. Homology searches revealed that RGP-2 is a novel consensus sequence-type RNA-binding protein. Its RNA-binding domain is structurally related to those of some chloroplast RNPs, while the amino acid composition of its glycine-rich domain (rich in glycine and asparagine) is similar to those in animal heterogeneous nuclear RNPs (hnRNP) A1 and A2/B1. The RGP-2 gene seems to be a single-copy gene, and its transcripts accumulate mainly in cultured cells and roots. A nucleic acid-binding assay using RGP-2 protein synthesized in vitro confirmed that it is an RNA-binding protein. Based on its greater affinity for total tobacco RNA than for poly(G) and poly(U), RGP-2 is suggested to bind to specific RNA sequences, probably G/U-rich regions. Quantitative analysis of the nucleic acid-binding properties of RGP-2 and RGP-1b indicates that they bind differently to nucleic acids. A possible role for RGP-2 is discussed in relation to known functions of animal hnRNP proteins.
Abstract: We have isolated two nuclear genes, tufA and tufB, encoding chloroplast EF-Tu from a tobacco (Nicotiana sylvestris) genomic library. The tufA gene encodes a polypeptide of 478 amino-acid residues, consisting of a putative transit peptide of 70 residues and a mature EF-TuA of 408 residues. The tufB gene codes for a precursor proteins of 485 residues, containing a transit peptide of 77 residues and a mature EF-TuB of 408 residues. No introns were found in either gene. The sequence similarity within the coding regions of the two genes is 84.3% for nucleotides and 89.7% for amino acids. Multiple 5' ends of transcripts were observed for both tuf genes. Northern analysis revealed that the EF-Tu mRNA accumulated at least 30-fold more in leaf than in root tissue. Ribonuclease protection assays using gene-specific probes showed that the level of tufB mRNA is three-fold higher than that of tufA mRNA in leaves but in roots the tufB mRNA levels is less than half that of tufA mRNA. The relative amount of tufB mRNA is 30-fold higher in leaves than in roots whereas tufA messages are only five-fold higher in leaves. These data suggest that expression of both tuf genes is differentially regulated according to tissue and plastid type.
Abstract: The nucleotide sequence of a 27,588-bp region of rice mitochondrial DNA was determined. This sequence contains putative genes that encode initiator methionine tRNA (trnfM), subunits III (nad3) and IV (nad4) of the NADH dehydrogenase complex, and ribosomal proteins S3 (rps3), S12 (rps12) and L16 (rpl16). An open reading frame that contains sequences homologous to parts of rps2 and atpA is also present. In addition to these regions, there are many short sequences with homology to fragments of mitochondrial DNAs from rice or other plants. These sequences may be remnants of multiple rearrangements of the genome and their presence seems to explain, in part, the large sizes of the mitochondrial genomes of higher plants.
Abstract: Three cDNAs encoding RNA-binding proteins were isolated from a tobacco (Nicotiana sylvestris) cDNA library. The predicted proteins (RGP-1) are homologous to each other and consist of a consensus-sequence type RNA-binding domain of 80 amino acids in the N-terminal half and a glycine-rich domain of 61-78 amino acids in the C-terminal half. Nucleic acid-binding assay using the in vitro synthesized RGP-1 protein confirmed that it is an RNA-binding protein. Based on its strong affinity for poly(G) and poly(U), the RGP-1 proteins are suggested to bind specifically to G and/or U rich sequences. All three genes are expressed in leaves, roots, flowers and cultured cells, however, the substantial amount of pre-mRNAs are accumulated especially in roots. Sequence analysis and ribonuclease protection assay indicated that significant amounts of alternatively spliced mRNAs, which are produced by differential selection of 5' splice sites, are also present in various tissues. Tissue-specific alternative splicing was found in two of the three genes. The alternatively spliced mRNAs are also detected in polysomal fractions and are suggested to produce truncated polypeptides. A possible role of this alternative splicing is discussed.
Abstract: Tobacco chloroplasts contain a family of ribonucleoproteins (RNPs) which appear to be involved in mRNA processing and splicing in chloroplasts. We have characterized a new cDNA, 33k-6, potentially encoding a tobacco 33 kDa chloroplast RNP (cp33) homologue. This cDNA has a 78 bp insertion near the 3' end with respect to previously characterized cp33 cDNAs, leading to the creation of an alternative C-terminal sequence. The cp33 protein is encoded by a single-copy nuclear gene in Nicotiana sylvestris, which contains three introns. No typical TATA box is present in the upstream region of the gene. Multiple transcription start sites are often observed for promoters lacking TATA boxes, and have been suggested in the cp33 gene. Sequence comparison revealed that the 78 bp insertion in 33k-6 is derived from the third intron of the cp33 gene which is not removed during pre-mRNA splicing. Ribonuclease protection analysis showed that the processing of the third intron is slow compared to the other introns. A possible role for the partially spliced mRNA (cp33k-6) is discussed.
Abstract: We have purified a chloroplast elongation factor Tu (EF-Tu) from tobacco (Nicotiana tabacum) and determined its N-terminal amino acid sequence. Two distinct cDNAs encoding EF-Tu were isolated from a leaf cDNA library of N. sylvestris (the female progenitor of N. tabacum) using an oligonucleotide probe based on the EF-Tu protein sequence. The cDNA sequence and genomic Southern analyses revealed that tobacco chloroplast EF-Tu is encoded by two distinct genes in the nuclear genome of N. sylvestris. We designated the corresponding gene products EF-Tu A and B. The mature polypeptides of EF-Tu A and B are 408 amino acids long and share 95.3% amino acid identity. They show 75-78% amino acid identity with cyanobacterial and chloroplast-encoded EF-Tu species.
Abstract: Tobacco chloroplast ribosomal protein L12 was isolated as a ssDNA-cellulose-binding protein from a chloroplast soluble protein fraction. Based on the N-terminal amino acid sequence of chloroplast L12, a cDNA clone was isolated and characterized. The precursor protein deduced from the DNA sequence consists of a transient peptide of 53 amino acid residues and a mature L12 protein of 133 amino acid residues. The chloroplast L12 protein was synthesized with a reticulocyte lysate and subjected to nucleic acid-binding assays. L12 synthesized in vitro does not bind to ssDNA, dsDNA nor ribonucleotide homopolymers, but it binds to cellulose matrix.
Abstract: We have previously identified three chloroplast ribonucleoproteins and characterized their cDNAs. Here we present the genomic organization, sequence and expression of one of their genes. The 31 kd ribonucleoprotein (cp31) from tobacco (Nicotiana sylvestris) chloroplasts is coded for by a single-copy nuclear gene. This gene was isolated and its sequence was determined. The gene contains four exons and three introns. The position of its first intron is conserved among the genes for the maize abscisic acid-induced glycine-rich protein, the human hnRNP A1 protein and cp31. The transcription start site was determined to be 168 bp upstream from the translational initiation codon in both leaf and root tissues. No alternatively spliced transcripts was detected, suggesting that a diversity of chloroplast ribonucleoproteins is generated probably by gene amplification rather than alternative splicing.
Abstract: The nucleotide sequences of tRNA(Asn) (GUU) and tRNA(Tyr) (GUA) genes from tomato mitochondria and their flanking regions have been determined. The tomato mitochondrial tRNA(Asn) gene is located 2.1 kb downstream from the tRNA(Cys) gene reported previously (Izuchi and Sugita 1989) and shows a nearly complete identity with the corresponding chloroplast gene. The tRNA(Tyr) gene, which shows only 73% homology with the corresponding chloroplast gene, has to be considered a "native" mitochondrial tRNA gene and is 535 bp from the "chloroplast-like" tRNA(Asn) gene on the same strand. Northern hybridization analysis revealed that the three tRNA genes are transcribed in tomato mitochondria. Southern hybridization analysis of tomato, sugar beet. rice and wheat mitochondrial DNAs, with oligonucleotide probes for mitochondrial or chloroplast tRNA genes, demonstrated that the mitochondrial tRNA(Cys) gene found in tomato is present in dicot plants but not in monocots. On the other hand, a chloroplast-like tRNA(Cys) gene exists in monocot plants.
Abstract: The tomato gene family for the small subunit of ribulose-1,5-bisphosphate carboxylase/oxygenase [3-phospho-D-glycerate carboxy-lyase (dimerizing); EC 4.1.1.39] has five genes, designated Rbcs-1, -2, -3A, -3B, and -3C. We have measured the steady-state mRNA levels for each of the five genes in various tomato organs using gene-specific oligonucleotides. All five genes are highly expressed in leaves, and transcripts of two genes, Rbcs-3B and Rbcs-3C, account for approximately equal to 60% of the total leaf transcripts. The relative transcript levels in the stem, in nature fruits, and etiolated seedlings (plants germinated and grown in the dark) correspond to 3.2%, 6.5%, and 4.6%, respectively, of the ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit mRNA level in leaves, and no transcripts have been detected in roots and ripe tomato fruits. Only Rbcs-1 and Rbcs-2 are expressed during the photosynthetically active phase of fruit development. Transcripts from these genes and from the Rbcs-3A locus are also present in etiolated seedlings. Rbcs-3B and Rbcs-3C transcripts, which are the most abundant mRNAs of the ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit gene family in the leaf, are undetectable in dark-grown seedlings and immature fruit. The selective expression of Rbcs-1 and Rbcs-2 in the dark and in the pericarp of green fruit and the induction and rapid mRNA accumulation for Rbcs-3B and Rbcs-3C after illumination may reflect different regulatory mechanism(s) that control the expression of individual members in the tomato ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit gene family.
Abstract: We have cloned and sequenced all five members of the gene family for the small subunit (rbcS) of ribulose-1,5-bisphosphate carboxylase/oxygenase from tomato, Lycopersicon esculentum cv. VFNT LA 1221 cherry line. Two of the five genes, designated Rbcs-1 and Rbcs-2, are present as single genes at individual loci. Three genes, designated Rbcs-3A, Rbcs-3B and Rbcs-3C, are organized in a tandem array within 10 kb at a third independent locus. The Rbcs-2 gene contains three introns; all the other members of the tomato gene family contain two introns. The coding sequence of Rbcs-1 differs by 14.0% from that of Rbcs-2 and by 13.3% from that of Rbcs-3 genes. Rbcs-2 shows 10.4% divergence from Rbcs-3. The exon and intron sequences of Rbcs-3A are identical to those of Rbcs-3C, and differ by 1.9% from those of Rbcs-3B. Nucleotide sequence analysis suggests that the five rbcS genes encode four different precursors, and three different mature polypeptides. S1 nuclease mapping of the 5' end of rbcS mRNAs revealed that the mRNA leader sequences vary in length from 8 to 75 nucleotides. Northern analysis using gene-specific oligonucleotide probes from the 3' non-coding region of each gene reveals a four to five-fold difference among the five genes in maximal steady-state mRNA levels in leaves.
Abstract: The complete nucleotide sequence (155 844 bp) of tobacco (Nicotiana tabacum var. Bright Yellow 4) chloroplast DNA has been determined. It contains two copies of an identical 25 339 bp inverted repeat, which are separated by a 86 684 bp and a 18 482 bp single-copy region. The genes for 4 different rRNAs, 30 different tRNAs, 39 different proteins and 11 other predicted protein coding genes have been located. Among them, 15 genes contain introns. Blot hybridization revealed that all rRNA and tRNA genes and 27 protein genes so far analysed are transcribed in the chloroplast and that primary transcripts of the split genes hitherto examined are spliced. Five sequences coding for proteins homologous to components of the respiratory-chain NADH dehydrogenase from human mitochondria have been found. The 30 tRNAs predicted from their genes are sufficient to read all codons if the ;two out of three' and ;U:N wobble' mechanisms operate in the chloroplast. Two sequences which autonomously replicate in yeast have also been mapped. The sequence and expression analyses indicate both prokaryotic and eukaryotic features of the chloroplast genes.
Abstract: The nucleotide sequence of a tobacco (Nicotiana tabacum) chloroplast gene cluster that encodes eight proteins homologous to Escherichia coli ribosomal proteins L23, L2, S19, L22, S3, L16, L14, and S8 has been determined. RNA gel blot hybridization revealed that all eight coding regions are expressed in the chloroplasts. The arrangement of the eight genes resembles that found in the E. coli S10 and spc operons. Among the eight genes, the L2 and L16 genes contain 666- and 1020-base-pair introns, respectively. These intron boundary sequences are consistent with the conserved boundary sequences of the chloroplast group III introns [Shinozaki, K., Deno, H., Sugita, M., Kuramitsu, S. & Sugiura, M. (1986) Mol. Gen. Genet. 202, 1-5].
Abstract: The nucleotide sequence of a tRNA(Lys)(UUU) gene on tobacco (Nicotiana tabacum) chloroplast DNA has been determined. This gene is located 215 base pairs upstream from the gene for the 32,000-dalton thylakoid membrane protein on the same DNA strand and has a 2526-base-pair intron in the anticodon loop. The intron boundary sequence does not follow the G-U/A-G rule but is similar to those of tobacco chloroplast split genes for tRNA(Gly)(UCC) and ribosomal proteins L2 and S12. The intron contains one major open reading frame of 509 codons. The codon usage in the open reading frame resembles those observed in the genes for tobacco chloroplast proteins so far analyzed. The primary transcript of this tRNA gene is 2.7 kilobases long.
Abstract: A partial sequence of a cloned 3.2 Md BamHI fragment from tobacco chloroplast DNA revealed the occurrence of a putative gene for ribosomal protein. The putative gene is located on the left margin of the large single-copy region in the chloroplast DNA. The coding region contains 276 bp (92 codons). The amino acid sequence deduced from the DNA sequence shows 55% homology with that of E. coli S19 (91 amino acid residues).
Abstract: Chromatin-bound poly(adenosine 5'-diphosphate [ADP]-ribose) synthetase activity was highest in ungerminated wheat (Triticum aestivum L., cv. Mukakomugi) embryos, and it decreased after germination within 14 hours. In contrast, transcriptional activity was lowest in ungerminated wheat embryos, and it increased during germination for 24 hours or more. Histones, H1, H2A/H2B, basic nonhistone chromosomal proteins, and acidic nonhistone chromosomal proteins (molecular weight more than 10 kilodaltons) were ADP-ribosylated in wheat germ chromatin. Specific nonhistone chromosomal protein (molecular weight of 37 kilodaltons) in seedling chromatin was not found to be ADP-ribosylated.
Abstract: The template activity of chromatin from winter wheat embryos gradually increased during germination and was regulated with some nonhistone proteins different from the two major ones, molecular weight 39k and 59k polypeptides, previously reported.To clarify chromosomal proteins which are involved in regulation of template activity of chromatin, we studied the quantitative and qualitative changes in chromosomal proteins. Differences in acid-soluble and acid-insoluble proteins between chromatins from wheat germ and embryos germinated for various times were visualized by sodium dodecyl sulfate-polyacrylamide slab gel electrophoresis.Nonhistone proteins of 39k, 41k, and 50k molecular weights were specifically present in wheat germ and in 24- or 48-hour germinated wheat embryos, thereafter greatly reduced or finally disappeared. In contrast, nonhistone protein of 37k was absent in germ and in embryos germinated for 24 hours and appeared after 48 hours of germination. Thereafter it was present in abundant amounts in 96-hour germinated winter wheat embryos and in 72-hour germinated spring embryos, corresponding to 7 and 10% of total nonhistone proteins, respectively. Histone H1, especially H1d, was slightly reduced after 48-hour germination, as much as basic nonhistone proteins having electrophoretic mobilities between H1d and H2B. Further-more, similarity and diversity of chromosomal proteins between spring and winter wheat embryos are shown in this study. A subspecies of histone H1c of spring wheat had faster electrophoretic mobility than that of winter wheat.
Abstract: To clarify how the transcriptionally inactive chromatin of dormant wheat seed embryos becomes active during germination, we studied two kinds of chromatin-associated proteins: histones and nonhistone proteins found in wheat germ.Two major nonhistone proteins were solubilized from purified germ chromatin with 5 molar urea, and were separated from histones and chromosomal RNA by BioRex-70 resin and diethylaminoethyl-cellulose chromatography, respectively. We found that purified 5 molar urea-soluble nonhistone proteins, including the two major nonhistone proteins, had no effect on the transcription of native wheat DNA.Two kinds of chromatins were reconstituted by gradient dialysis from a mixture of DNA, germ histones, and nonhistone proteins derived from germs or germinated seedlings. Reconstituted chromatins had a 1.4- to 1.6-fold higher protein content than DNA content, protein components similar to native chromatin, and had only about 17 to 25% and 37% the transcriptional activities of native seedling and germ chromatins, respectively. Using the transcriptional activity of chromatin reconstituted from DNA and histones alone as a standard, one kind of reconstituted chromatin containing nonhistone proteins of germ was only about one-fourth as active as the standard. Another with nonhistone proteins from seedlings was about 1.3 to 1.5 times more active than the standard.The ratio of histones to DNA content is approximately 1.3 during germination, but the proportion of histone H1 to the total histones is reduced.