hosted by
publicationslist.org
    

Mitali Mukerji

Institute of Genomics and Integrative Biology, Council of Scientific and Industrial Research, Mall Road, Delhi, India
mitali@igib.res.in

Journal articles

2013
Desiree C Petersen, Ondrej Libiger, Elizabeth A Tindall, Rae-Anne Hardie, Linda I Hannick, Richard H Glashoff, Mitali Mukerji, Pedro Fernandez, Wilfrid Haacke, Nicholas J Schork, Vanessa M Hayes (2013)  Complex patterns of genomic admixture within southern Africa.   PLoS genetics 9: 3. 03  
Abstract: Within-population genetic diversity is greatest within Africa, while between-population genetic diversity is directly proportional to geographic distance. The most divergent contemporary human populations include the click-speaking forager peoples of southern Africa, broadly defined as Khoesan. Both intra- (Bantu expansion) and inter-continental migration (European-driven colonization) have resulted in complex patterns of admixture between ancient geographically isolated Khoesan and more recently diverged populations. Using gender-specific analysis and almost 1 million autosomal markers, we determine the significance of estimated ancestral contributions that have shaped five contemporary southern African populations in a cohort of 103 individuals. Limited by lack of available data for homogenous Khoesan representation, we identify the Ju/'hoan (n = 19) as a distinct early diverging human lineage with little to no significant non-Khoesan contribution. In contrast to the Ju/'hoan, we identify ancient signatures of Khoesan and Bantu unions resulting in significant Khoesan- and Bantu-derived contributions to the Southern Bantu amaXhosa (n = 15) and Khoesan !Xun (n = 14), respectively. Our data further suggests that contemporary !Xun represent distinct Khoesan prehistories. Khoesan assimilation with European settlement at the most southern tip of Africa resulted in significant ancestral Khoesan contributions to the Coloured (n = 25) and Baster (n = 30) populations. The latter populations were further impacted by 170 years of East Indian slave trade and intra-continental migrations resulting in a complex pattern of genetic variation (admixture). The populations of southern Africa provide a unique opportunity to investigate the genomic variability from some of the oldest human lineages to the implications of complex admixture patterns including ancient and recently diverged human lineages.
Notes:
Amit K Mandal, Rajesh Pandey, Vineet Jha, Mitali Mukerji (2013)  Transcriptome-wide expansion of non-coding regulatory switches: evidence from co-occurrence of Alu exonization, antisense and editing.   Nucleic acids research 41: 4. 2121-2137 Feb  
Abstract: Non-coding RNAs from transposable elements of human genome are gaining prominence in modulating transcriptome dynamics. Alu elements, as exonized, edited and antisense components within same transcripts could create novel regulatory switches in response to different transcriptional cues. We provide the first evidence for co-occurrences of these events at transcriptome-wide scale through integrative analysis of data sets across diverse experimental platforms and tissues. This involved the following: (i) positional anchoring of Alu exonization events in the UTRs and CDS of 4663 transcript isoforms from RefSeq mRNAs and (ii) mapping on to them A→I editing events inferred from ∼7 million ESTs from dbEST and antisense transcripts identified from virtual serial analysis of gene expression tags represented in Cancer Genome Anatomy Project next-generation sequencing data sets across 20 tissues. We observed significant enrichment of these events in the 3'UTR as well as positional preference within the embedded Alus. More than 300 genes had co-occurrence of all these events at the exon level and were significantly enriched in apoptosis and lysosomal processes. Further, we demonstrate functional evidence of such dynamic interactions between Alu-mediated events in a time series data from Integrated Personal Omics Profiling during recovery from a viral infection. Such 'single transcript-multiple fate' opportunity facilitated by Alu elements may modulate transcriptional response, especially during stress.
Notes:
2012
Tanveer Ahmad, Manish Kumar, Ulaganathan Mabalirajan, Bijay Pattnaik, Shilpi Aggarwal, Ranjana Singh, Suchita Singh, Mitali Mukerji, Balaram Ghosh, Anurag Agrawal (2012)  Hypoxia response in asthma: differential modulation on inflammation and epithelial injury.   American journal of respiratory cell and molecular biology 47: 1. 1-10 Jul  
Abstract: Oxygen-sensing prolyl-hydroxylase (PHD)-2 negatively regulates hypoxia-inducible factor (HIF)1-α and suppresses the hypoxic response. Hypoxia signaling is thought to be proinflammatory but also attenuates cellular injury and apoptosis. Although increased hypoxic response has been noted in asthma, its functional relevance is unknown. The objectives of this study were to dissect the mechanisms and role of the hypoxic response in asthma pathophysiology. Experimental studies were conducted in mice using acute and chronic allergic models of asthma. The hypoxic response in allergically inflamed lungs was modulated by using pharmacologic PHD inhibitors (ethyl-3-4-dihydroxybenzoic acid [DHB], 1-10 mg/kg) or siRNA-mediated genetic knockdowns. Increased hypoxia response led to exacerbation of the asthma phenotype, with HIF-1α knockdown being beneficial. Chronically inflamed lungs from mice treated with 10 mg/kg DHB showed diffuse up-regulation of the hypoxia response, severe airway remodeling, and inflammation. Fatal asphyxiation during methacholine challenge was noted. However, bronchial epithelium restricted up-regulation of the hypoxia response seen with low-dose DHB (1 mg/kg) reduced epithelial injury and attenuated the asthmatic phenotype. Up-regulation of the hypoxia response was associated with increased expression of CX3CR1, a lymphocyte survival factor, and increased inflammatory cell infiltrate. This study shows that an exaggerated hypoxia response may contribute to airway inflammation, remodeling, and the development of asthma. However, the hypoxia response may also be protective of epithelial apoptosis at lower levels, and the net effects of modulating the hypoxia response may vary based on the context.
Notes:
Swarup K Parida, Mitali Mukerji, Ashok K Singh, Nagendra K Singh, Trilochan Mohapatra (2012)  SNPs in stress-responsive rice genes: validation, genotyping, functional relevance and population structure.   BMC genomics 13: 08  
Abstract: Single nucleotide polymorphism (SNP) validation and large-scale genotyping are required to maximize the use of DNA sequence variation and determine the functional relevance of candidate genes for complex stress tolerance traits through genetic association in rice. We used the bead array platform-based Illumina GoldenGate assay to validate and genotype SNPs in a select set of stress-responsive genes to understand their functional relevance and study the population structure in rice.
Notes:
Shalini Datta, Abhijit Chowdhury, Malay Ghosh, Kaushik Das, Pankaj Jha, Roshan Colah, Mitali Mukerji, Partha P Majumder (2012)  A genome-wide search for non-UGT1A1 markers associated with unconjugated bilirubin level reveals significant association with a polymorphic marker near a gene of the nucleoporin family.   Ann Hum Genet 76: 1. 33-41 Jan  
Abstract: Variants in the UGT1A1 gene and its promoter are known to determine levels of unconjugated bilirubin (UCB), but do not explain all cases of unconjugated hyperbilirubinemia. To discover associations with variants in genes other than UGT1A1, we undertook a genome-wide association study. We recruited 200 participants to cover the entire range of quantitative variation in UCB level. The data set -- after data curation, including analyses for population stratification and cryptic relatedness -- comprised genotypes at 512,349 SNP loci on 182 individuals. Quantitative trait locus (QTL) association analyses were performed, after adjusting the UCB level for effects of age, gender, and genotype at the dinucleotide (TA) insertion locus in UGT1A1 that is known to significantly modulate UCB level. A significant association of a polymorphic marker (rs2328136) near the NUP153 gene (which produces a 153 kDa nucleoporin) was obtained (p = 0.002, after multiple-testing correction). The frequency of the variant allele (A) at the rs2328136 locus in our study population is 40%, higher than most global populations. NUP153, whose product is a major regulatory factor in bidirectional transport of biomolecules across nucleus to cytosol, is associated with the transport of biliverdin reductase, which is important for bilirubin conjugation.
Notes:
Pankaj Jha, Swapnil Sinha, Kanika Kanchan, Tabish Qidwai, Ankita Narang, Prashant Kumar Singh, Sudhanshu S Pati, Sanjib Mohanty, Saroj K Mishra, Surya K Sharma, Shally Awasthi, Vimala Venkatesh, Sanjeev Jain, Analabha Basu, Shuhua Xu, Mitali Mukerji, Saman Habib (2012)  Deletion of the APOBEC3B gene strongly impacts susceptibility to falciparum malaria.   Infection, genetics and evolution : journal of molecular epidemiology and evolutionary genetics in infectious diseases 12: 1. 142-148 Jan  
Abstract: APOBEC3B, a gene involved in innate response, exhibits insertion-deletion polymorphism across world populations. We observed the insertion allele to be nearly fixed in malaria endemic regions of sub-Saharan Africa as well as populations with high malaria incidence in the past. This prompted us to investigate the possible association of the polymorphism with falciparum malaria. We studied the distribution of APOBEC3B, in 25 diverse Indian populations comprising of 500 samples and 176 severe or non-severe Plasmodium falciparum patients and 174 ethnically-matched uninfected individuals from a P. falciparum endemic and a non-endemic region of India. The deletion frequencies ranged from 0% to 43% in the Indian populations. The frequency of the insertion allele strikingly correlated with the endemicity map of P. falciparum malaria in India. A strong association of the deletion allele with susceptibility to falciparum malaria in the endemic region (non-severe vs. control, Odds ratio=4.96, P value=9.5E(-06); severe vs. control, OR=4.36, P value=5.76E(-05)) was observed. Although the frequency of deletion allele was higher in the non-endemic region, there was a significant association of the homozygous deletion genotype with malaria (OR=3.17, 95% CI=1.10-10.32, P value=0.0177). Our study also presents a case for malaria as a positive selection force for the APOBEC3B insertion and suggests a major role for this gene in innate immunity against malaria.
Notes:
2011
Meena Lall, Seema Thakur, Ratna Puri, Ishwar Verma, Mithali Mukerji, Pankaj Jha (2011)  A 54 Mb 11qter duplication and 0.9 Mb 1q44 deletion in a child with laryngomalacia and agenesis of corpus callosum.   Molecular cytogenetics 4: 09  
Abstract: Partial Trisomy 11q syndrome (or Duplication 11q) has defined clinical features and is documented as a rare syndrome by National Organization of Rare Disorders (NORD). Deletion 1q44 (or Monosomy 1q44) is a well-defined syndrome, but there is controversy about the genes lying in 1q44 region, responsible for agenesis of the corpus callosum. We report a female child with the rare Partial Trisomy 11q syndrome and Deletion 1q44 syndrome. The genomic imbalance in the proband was used for molecular characterization of the critical genes in 1q44 region for agenesis of corpus callosum. Some genes in 11q14q25 may be responsible for laryngomalacia.
Notes:
Ankita Narang, Pankaj Jha, Vimal Rawat, Arijit Mukhopadhayay, Debasis Dash, Analabha Basu, Mitali Mukerji (2011)  Recent admixture in an Indian population of african ancestry.   Am J Hum Genet 89: 1. 111-120 Jul  
Abstract: Identification and study of genetic variation in recently admixed populations not only provides insight into historical population events but also is a powerful approach for mapping disease loci. We studied a population (OG-W-IP) that is of African-Indian origin and has resided in the western part of India for 500 years; members of this population are believed to be descendants of the Bantu-speaking population of Africa. We have carried out this study by using a set of 18,534 autosomal markers common between Indian, CEPH-HGDP, and HapMap populations. Principal-components analysis clearly revealed that the African-Indian population derives its ancestry from Bantu-speaking west-African as well as Indo-European-speaking north and northwest Indian population(s). STRUCTURE and ADMIXTURE analyses show that, overall, the OG-W-IPs derive 58.7% of their genomic ancestry from their African past and have very little inter-individual ancestry variation (8.4%). The extent of linkage disequilibrium also reveals that the admixture event has been recent. Functional annotation of genes encompassing the ancestry-informative markers that are closer in allele frequency to the Indian ancestral population revealed significant enrichment of biological processes, such as ion-channel activity, and cadherins. We briefly examine the implications of determining the genetic diversity of this population, which could provide opportunities for studies involving admixture mapping.
Notes:
Rajesh Pandey, Amit K Mandal, Vineet Jha, Mitali Mukerji (2011)  Heat shock factor binding in Alu repeats expands its involvement in stress through an antisense mechanism.   Genome biology 12: 11. 11  
Abstract: Alu RNAs are present at elevated levels in stress conditions and, consequently, Alu repeats are increasingly being associated with the physiological stress response. Alu repeats are known to harbor transcription factor binding sites that modulate RNA pol II transcription and Alu RNAs act as transcriptional co-repressors through pol II binding in the promoter regions of heat shock responsive genes. An observation of a putative heat shock factor (HSF) binding site in Alu led us to explore whether, through HSF binding, these elements could further contribute to the heat shock response repertoire.
Notes:
Abhimanyu, Irengbam Rocky Mangangcha, Pankaj Jha, Komal Arora, Mitali Mukerji, Jayant Nagesh Banavaliker, Vani Brahmachari, Mridula Bose (2011)  Differential serum cytokine levels are associated with cytokine gene polymorphisms in north Indians with active pulmonary tuberculosis.   Infect Genet Evol 11: 5. 1015-1022 Jul  
Abstract: Globally only 5-10% of people encountering Mycobacterium tuberculosis have a lifetime risk of active disease indicating a strong host genetic bias towards development of tuberculosis. In the current study we investigated genotype variants pertaining to five cytokine genes namely IFNG, TNFA, IL4, IL10 and IL12 in the north Indian population with active pulmonary tuberculosis (APTB) and correlated the serum cytokine levels with the corresponding genotypes. Twenty five single nucleotide polymorphisms (SNPs) including six loci examined for the first time in tuberculosis were selected for genotyping in 108 patients with APTB from north India and 48 healthy regional controls (HC). Applying exclusion criteria 12 SNPs passed all the filters and were analysed further. The serum cytokine concentrations were measured by ELISA. Compared to HC mean serum IFN-γ, IL-12, IL-4, and IL-10 levels were higher in APTB (p = 0.3661, p = 0.0186, p = 0.003, p = 0.7, respectively). In contrast the mean serum TNF-α level was higher in HC (p = 0.007). Comparison of genotypes and serum levels of the corresponding cytokine genes reveal that though IFN-γ and IL-4 levels were higher in APTB the genotype variants showed no difference between HC and APTB. In contrast the genotypes of the selected rsIDs in the TNFA, IL12 and IL10 genes showed significant association with the varying serum levels of corresponding cytokines. The variant of the TNFA gene at rs3093662, the IL12 gene at rs3213094 and rs3212220 and the IL10 gene at rs3024498 did show a strong indication to be of relevance to the immunity to tuberculosis. To our knowledge this is the first report from this region relating genotypes and serum cytokine levels in north Indian population.
Notes:
M Gupta, H Kaur, A Jajodia, S Jain, K Satyamoorthy, M Mukerji, J Thirthalli, R Kukreti (2011)  Diverse facets of COMT: from a plausible predictive marker to a potential drug target for schizophrenia.   Current molecular medicine 11: 9. 732-743 Dec  
Abstract: Dopaminergic system in the prefrontal cortex (PFC) is known to regulate the cognitive functions. Catechol-O-methyl transferase (COMT), one of the major modulators of prefrontal dopamine function, has emerged as an important determinant of schizophrenia associated cognitive dysfunction and response to antipsychotics. A common Val->Met polymorphism (rs4680) in the COMT gene, associated with increased prefrontal dopamine catabolism, impairs prefrontal cognition and might increase risk for schizophrenia. Further, the degree of cognitive improvement observed with antipsychotics in schizophrenia patients is influenced by the COMT activity, and Val/Met has been proposed as a potential pharmacogenetic marker. However, studies evaluating the role of COMT have been equivocal. The presence of other functional polymorphisms in the gene, and the observed ethnic variations in the linkage disequilibrium structure at COMT locus, suggest that COMT activity regulation might be complex. Despite these lacunae in our current understanding, the influence of COMT on PFC mediated cognitive tasks is undeniable. COMT thus represents an attractive candidate for novel therapeutic interventions for cognitive dysfunction. The COMT activity inhibiting drugs including tolcapone and entacapone, have shown promising potential as they selectively modulate dopaminergic transmission. This review is an attempt to summarize the rapidly evolving literature exploring the diverse facets of COMT biology, its functional relevance as a predictive marker and a therapeutic target for schizophrenia.
Notes:
Pramod Gautam, Pankaj Jha, Dhirendra Kumar, Shivani Tyagi, Binuja Varma, Debasis Dash, Arijit Mukhopadhyay, Mitali Mukerji (2011)  Spectrum of large copy number variations in 26 diverse Indian populations: potential involvement in phenotypic diversity.   Hum Genet Jul  
Abstract: Copy number variations (CNVs) have provided a dynamic aspect to the apparently static human genome. We have analyzed CNVs larger than 100 kb in 477 healthy individuals from 26 diverse Indian populations of different linguistic, ethnic and geographic backgrounds. These CNVRs were identified using the Affymetrix 50K Xba 240 Array. We observed 1,425 and 1,337 CNVRs in the deletion and amplification sets, respectively, after pooling data from all the populations. More than 50% of the genes encompassed entirely in CNVs had both deletions and amplifications. There was wide variability across populations not only with respect to CNV extent (ranging from 0.04-1.14% of genome under deletion and 0.11-0.86% under amplification) but also in terms of functional enrichments of processes like keratinization, serine proteases and their inhibitors, cadherins, homeobox, olfactory receptors etc. These did not correlate with linguistic, ethnic, geographic backgrounds and size of populations. Certain processes were near exclusive to deletion (serine proteases, keratinization, olfactory receptors, GPCRs) or duplication (homeobox, serine protease inhibitors, embryonic limb morphogenesis) datasets. Populations having same enriched processes were observed to contain genes from different genomic loci. Comparison of polymorphic CNVRs (5% or more) with those cataloged in Database of Genomic Variants revealed that 78% (2473) of the genes in CNVRs in Indian populations are novel. Validation of CNVs using Sequenom MassARRAY revealed extensive heterogeneity in CNV boundaries. Exploration of CNV profiles in such diverse populations would provide a widely valuable resource for understanding diversity in phenotypes and disease.
Notes:
Rajesh Pandey, Mitali Mukerji (2011)  From 'JUNK' to just unexplored noncoding knowledge: the case of transcribed Alus.   Briefings in functional genomics 10: 5. 294-311 Sep  
Abstract: Non-coding RNAs (ncRNAs) are increasingly being implicated in diverse functional roles. Majority of these ncRNAs have their origin in the repetitive elements of genome. Significantly, increase in genomic complexity has been correlated with increase in repetitive content of the genome. Primate-specific Alu repeats, belonging to SINE class of repeats, is the most abundant repeat class inhabiting the human genome. Of the many possible functional roles of Alu repeats, they have been shown to modulate human transcriptome by virtue of harboring diverse array of functional RNA pol II TFBS, cryptic splice-site-mediated Alu exonization and as probable miRNA targets. Retro-transposition of Alu harboring TFBS has shaped up gene-specific regulatory networks. Alu exonized transcripts are raw material for dsRNA-mediated A-I editing leading to nuclear retention of transcripts and change in miRNA target. miRNA targets within Alu may titrate the effective miRNA or transcript concentration, thus acting as 'miRNA sponge'. Differential levels of Alu RNA during different conditions of stress also await clear functional understanding. These have contributed toward evolution of complex regulatory repertoire leading to the evolution of primate-specific functions. Recent reports of co-localization of pol II and pol III binding sites near the gene and elsewhere in the genome, increase the possibility of dynamic co-ordination between both pol II and pol III determining the ultimate transcriptional outcome. Dynamic and functional Alu repeats seem to be centrally placed to modulate the transcriptional landscape of human genome.
Notes:
2010
Shilpi Aggarwal, Sapna Negi, Pankaj Jha, Prashant K Singh, Tsering Stobdan, M A Qadar Pasha, Saurabh Ghosh, Anurag Agrawal, Bhavana Prasher, Mitali Mukerji (2010)  EGLN1 involvement in high-altitude adaptation revealed through genetic analysis of extreme constitution types defined in Ayurveda.   Proc Natl Acad Sci U S A 107: 44. 18961-18966 Nov  
Abstract: It is being realized that identification of subgroups within normal controls corresponding to contrasting disease susceptibility is likely to lead to more effective predictive marker discovery. We have previously used the Ayurvedic concept of Prakriti, which relates to phenotypic differences in normal individuals, including response to external environment as well as susceptibility to diseases, to explore molecular differences between three contrasting Prakriti types: Vata, Pitta, and Kapha. EGLN1 was one among 251 differentially expressed genes between the Prakriti types. In the present study, we report a link between high-altitude adaptation and common variations rs479200 (C/T) and rs480902 (T/C) in the EGLN1 gene. Furthermore, the TT genotype of rs479200, which was more frequent in Kapha types and correlated with higher expression of EGLN1, was associated with patients suffering from high-altitude pulmonary edema, whereas it was present at a significantly lower frequency in Pitta and nearly absent in natives of high altitude. Analysis of Human Genome Diversity Panel-Centre d'Etude du Polymorphisme Humain (HGDP-CEPH) and Indian Genome Variation Consortium panels showed that disparate genetic lineages at high altitudes share the same ancestral allele (T) of rs480902 that is overrepresented in Pitta and positively correlated with altitude globally (P < 0.001), including in India. Thus, EGLN1 polymorphisms are associated with high-altitude adaptation, and a genotype rare in highlanders but overrepresented in a subgroup of normal lowlanders discernable by Ayurveda may confer increased risk for high-altitude pulmonary edema.
Notes:
Ankita Narang, Rishi Das Roy, Amit Chaurasia, Arijit Mukhopadhyay, Mitali Mukerji, Debasis Dash (2010)  IGVBrowser--a genomic variation resource from diverse Indian populations.   Database (Oxford) 2010: 09  
Abstract: The Indian Genome Variation Consortium (IGVC) project, an initiative of the Council for Scientific and Industrial Research, has been the first large-scale comprehensive study of the Indian population. One of the major aims of the project is to study and catalog the variations in nearly thousand candidate genes related to diseases and drug response for predictive marker discovery, founder identification and also to address questions related to ethnic diversity, migrations, extent and relatedness with other world population. The Phase I of the project aimed at providing a set of reference populations that would represent the entire genetic spectrum of India in terms of language, ethnicity and geography and Phase II in providing variation data on candidate genes and genome wide neutral markers on these reference set of populations. We report here development of the IGVBrowser that provides allele and genotype frequency data generated in the IGVC project. The database harbors 4229 SNPs from more than 900 candidate genes in contrasting Indian populations. Analysis shows that most of the markers are from genic regions. Further, a large fraction of genes are implicated in cardiovascular, metabolic, cancer and immune system-related diseases. Thus, the IGVC data provide a basal level variation data in Indian population to study genetic diseases and pharmacology. Additionally, it also houses data on ∼50,000 (Affy 50 K array) genome wide neutral markers in these reference populations. In IGVBrowser one can analyze and compare genomic variations in Indian population with those reported in HapMap along with annotation information from various primary data sources. Database URL: http://igvbrowser.igib.res.in.
Notes:
Inder Singh, Mohammed Faruq, Odity Mukherjee, Sanjeev Jain, Pramod Kumar Pal, M V Padma Srivastav, Madhuri Behari, Achal K Srivastava, Mitali Mukerji (2010)  North and South Indian populations share a common ancestral origin of Friedreich's ataxia but vary in age of GAA repeat expansion.   Ann Hum Genet 74: 3. 202-210 May  
Abstract: Friedreich's ataxia (FRDA) is caused by expansion of GAA repeats in the frataxin (FXN) gene on chromosome 9q13-q21.1. We analysed the origin of FRDA in 21 North Indian (NI) and eight South Indian (SI) families using five single nucleotide polymorphisms (SNPs) and a microsatellite marker spanning the GAA repeats. The NI and SI families were derived from Indo-European and Dravidian linguistic backgrounds respectively. The frequency of large normal (LNs) alleles of the GAA repeat correlate with the overall lower prevalence of FRDA in India compared to the European population. All of the expanded alleles in the Indian population share a common core haplotype suggesting a founder effect. The expanded alleles in the NI population demonstrate more similarity to those of Europeans in terms of age of GAA repeat expansion (15975 +/- 2850 years) and association of LNs with expanded alleles. FRDA seems to have been introduced recently in the South Indian population since the average estimated age of the mutation in SI is 5425 +/- 1750 years and unlike NI some of the haplotypes of LNs are not associated with the expanded alleles.
Notes:
2009
Anshu Bhardwaj, Mitali Mukerji, Shipra Sharma, Jinny Paul, Chaitanya S Gokhale, Achal K Srivastava, Shrish Tiwari (2009)  MtSNPscore: a combined evidence approach for assessing cumulative impact of mitochondrial variations in disease.   BMC Bioinformatics 10 Suppl 8: 08  
Abstract: BACKGROUND: Human mitochondrial DNA (mtDNA) variations have been implicated in a broad spectrum of diseases. With over 3000 mtDNA variations reported across databases, establishing pathogenicity of variations in mtDNA is a major challenge. We have designed and developed a comprehensive weighted scoring system (MtSNPscore) for identification of mtDNA variations that can impact pathogenicity and would likely be associated with disease. The criteria for pathogenicity include information available in the literature, predictions made by various in silico tools and frequency of variation in normal and patient datasets. The scoring scheme also assigns scores to patients and normal individuals to estimate the cumulative impact of variations. The method has been implemented in an automated pipeline and has been tested on Indian ataxia dataset (92 individuals), sequenced in this study, and other publicly available mtSNP dataset comprising of 576 mitochondrial genomes of Japanese individuals from six different groups, namely, patients with Parkinson's disease, patients with Alzheimer's disease, young obese males, young non-obese males, and type-2 diabetes patients with or without severe vascular involvement. MtSNPscore, for analysis can extract information from variation data or from mitochondrial DNA sequences. It has a web-interface http://bioinformatics.ccmb.res.in/cgi-bin/snpscore/Mtsnpscore.pl that provides flexibility to update/modify the parameters for estimating pathogenicity. RESULTS: Analysis of ataxia and mtSNP data suggests that rare variants comprise the largest part of disease associated variations. MtSNPscore predicted possible role of eight and 79 novel variations in ataxia and mtSNP datasets, respectively, in disease etiology. Analysis of cumulative scores of patient and normal data resulted in Matthews Correlation Coefficient (MCC) of ~0.5 and accuracy of ~0.7 suggesting that the method may also predict involvement of mtDNA variation in diseases. CONCLUSION: We have developed a novel and comprehensive method for evaluation of mitochondrial variation and their involvement in disease. Our method has the most comprehensive set of parameters to assess mtDNA variations and overcomes the undesired bias generated as a result of better-studied diseases and genes. These variations can be prioritized for functional assays to confirm their pathogenic status.
Notes:
Mahmood Ameen Abdulla, Ikhlak Ahmed, Anunchai Assawamakin, Jong Bhak, Samir K Brahmachari, Gayvelline C Calacal, Amit Chaurasia, Chien-Hsiun Chen, Jieming Chen, Yuan-Tsong Chen, Jiayou Chu, Eva Maria C Cutiongco-de la Paz, Maria Corazon A De Ungria, Frederick C Delfin, Juli Edo, Suthat Fuchareon, Ho Ghang, Takashi Gojobori, Junsong Han, Sheng-Feng Ho, Boon Peng Hoh, Wei Huang, Hidetoshi Inoko, Pankaj Jha, Timothy A Jinam, Li Jin, Jongsun Jung, Daoroong Kangwanpong, Jatupol Kampuansai, Giulia C Kennedy, Preeti Khurana, Hyung-Lae Kim, Kwangjoong Kim, Sangsoo Kim, Woo-Yeon Kim, Kuchan Kimm, Ryosuke Kimura, Tomohiro Koike, Supasak Kulawonganunchai, Vikrant Kumar, Poh San Lai, Jong-Young Lee, Sunghoon Lee, Edison T Liu, Partha P Majumder, Kiran Kumar Mandapati, Sangkot Marzuki, Wayne Mitchell, Mitali Mukerji, Kenji Naritomi, Chumpol Ngamphiw, Norio Niikawa, Nao Nishida, Bermseok Oh, Sangho Oh, Jun Ohashi, Akira Oka, Rick Ong, Carmencita D Padilla, Prasit Palittapongarnpim, Henry B Perdigon, Maude Elvira Phipps, Eileen Png, Yoshiyuki Sakaki, Jazelyn M Salvador, Yuliana Sandraling, Vinod Scaria, Mark Seielstad, Mohd Ros Sidek, Amit Sinha, Metawee Srikummool, Herawati Sudoyo, Sumio Sugano, Helena Suryadi, Yoshiyuki Suzuki, Kristina A Tabbada, Adrian Tan, Katsushi Tokunaga, Sissades Tongsima, Lilian P Villamor, Eric Wang, Ying Wang, Haifeng Wang, Jer-Yuarn Wu, Huasheng Xiao, Shuhua Xu, Jin Ok Yang, Yin Yao Shugart, Hyang-Sook Yoo, Wentao Yuan, Guoping Zhao, Bin Alwi Zilfalil (2009)  Mapping human genetic diversity in Asia.   Science 326: 5959. 1541-1545 Dec  
Abstract: Asia harbors substantial cultural and linguistic diversity, but the geographic structure of genetic variation across the continent remains enigmatic. Here we report a large-scale survey of autosomal variation from a broad geographic sample of Asian human populations. Our results show that genetic ancestry is strongly correlated with linguistic affiliations as well as geography. Most populations show relatedness within ethnic/linguistic groups, despite prevalent gene flow among populations. More than 90% of East Asian (EA) haplotypes could be found in either Southeast Asian (SEA) or Central-South Asian (CSA) populations and show clinal structure with haplotype diversity decreasing from south to north. Furthermore, 50% of EA haplotypes were found in SEA only and 5% were found in CSA only, indicating that SEA was a major geographic source of EA populations.
Notes:
Mohammed Faruq, Vinod Scaria, Inder Singh, Shivani Tyagi, Achal K Srivastava, Mitali Mukerji (2009)  SCA-LSVD: a repeat-oriented locus-specific variation database for genotype to phenotype correlations in spinocerebellar ataxias.   Hum Mutat 30: 7. 1037-1042 Jul  
Abstract: Repeat expansion has been implicated in 10 out of 17 candidate genes identified for autosomal dominant cerebellar ataxias (ADCAs)-commonly referred as spinocerebellar ataxias (SCAs). Though genetically distinct, the SCAs share a large number of features that confound their clinical classification. In addition, there is a difference in the prevalence and phenotypic expression of ataxias between different ethnic groups. We have created a new SCA-locus-specific variation database (LSVD) that aims to catalog and integrate information on SCAs associated with trinucleotide repeat expansion (SCA1, SCA 2, SCA 3, SCA 6, SCA 7, SCA 8, SCA 12, SCA 17, Friedreich's ataxia [FRDA], and dentatorubral-pallidoluysian atrophy [DRPLA]) from all over the world. The database has been developed using the Leiden Open (source) Variation Database (LOVD) software (Leiden University Medical Center, Leiden, the Netherlands). The database houses detailed information on clinical features, such as age and symptom at onset, mode of inheritance, and genotype information, pertaining to the SCA patients from more than 400 families across India. All the compiled genotype data conforms to the HGVS Nomenclature guidelines. This would be a very useful starting point for understanding the molecular correlates of phenotypes in ataxia-a multilocus disease in which related molecular mechanisms converge to overlapping phenotypes.
Notes:
Samira Bahl, Ikhlak Ahmed, Mitali Mukerji (2009)  Utilizing linkage disequilibrium information from Indian Genome Variation Database for mapping mutations: SCA12 case study.   J Genet 88: 1. 55-60 Apr  
Abstract: Stratification in heterogeneous populations poses an enormous challenge in linkage disequilibrium (LD) based identification of causal loci using surrogate markers. In this study, we demonstrate the enormous potential of endogamous Indian populations for mapping mutations in candidate genes using minimal SNPs, mainly due to larger regions of LD. We show this by a case study of the PPP2R2B gene (approximately 400 kb) that harbours a CAG repeat, expansion of which has been implicated in spinocerebellar ataxia type 12 (SCA12). Using LD information derived from Indian Genome Variation database (IGVdb) on populations which share similar ethnic and linguistic backgrounds as the SCA12 study population, we could map the causal loci using a minimal set of three SNPs, without the generation of additional basal data from the ethnically matched population. We could also demonstrate transferability of tagSNPs from a related HapMap population for mapping the mutation.
Notes:
M Netravathi, Pramod Kumar Pal, Meera Purushottam, Kandavel Thennarasu, Mitali Mukherjee, Sanjeev Jain (2009)  Spinocerebellar ataxias types 1, 2 and 3: age adjusted clinical severity of disease at presentation correlates with size of CAG repeat lengths.   J Neurol Sci 277: 1-2. 83-86 Feb  
Abstract: OBJECTIVE: The objective of this study was to determine the correlation, if any, between the clinical rate of disease progression at presentation with the CAG repeat size in patients with spinocerebellar ataxias 1, 2 and 3 (SCA1, SCA2 and SCA3). METHODS: The severity of ataxia was measured using the International Cooperative Ataxia Rating Scale (IARS) in 31 patients of SCA1 (mean+/-SD age: 35.1+/-12.6 years; age at onset (AAO): 29.9+/-10.7 years), 25 patients of SCA2 (age: 34.9+/-14.9 years; AAO: 29.7+/-14.0 years) and 15 patients of SCA3 (age: 40.9+/-8.6 years; AAO: 36.9+/-10.1). The rate of disease progression at presentation was measured by the age adjusted IARS (IARS/Age). For each SCA, correlations of AAO, raw scores of IARS, age adjusted IARS and duration adjusted IARS (IARS/Duration) with the CAG repeat size were determined. RESULTS: The number of CAG repeats of the abnormal allele ranged from 42 to 67 in SCA1, 38 to 66 in SCA2, and 69 to 79 in SCA3. In all the three types of SCAs, there were significant inverse correlations of AAO with CAG repeat size (SCA1: r=-0.9, p<0.0001; SCA2: r=-0.7, p<0.0001; SCA3:-0.8, p=0.0003) and significant positive correlations of IARS/Age with CAG repeat size (SCA1: r=0.6, p=0.0015; SCA2: r=0.9, p<0.0001; SCA3:0.7, p=0.0057). However, the raw IARS scores and the duration adjusted IARS scores did not correlate significantly with the CAG repeat sizes. CONCLUSIONS: These data suggest that the rate of clinical disease progression at presentation, especially in SCA2, is dependent on the CAG repeat size, and may commence linearly from birth.
Notes:
Ravi Shankar, Bhavesh Kataria, Mitali Mukerji (2009)  Finding Alu in primate genomes with AF-1.   Bioinformation 3: 7. 287-288 02  
Abstract: Repetitive sequences occupy more than 40% of the human genome which is much larger compared to the 2% occupied by the coding DNA. Amongst these Alu elements are the second largest class of repeats, occupying nearly 10% of the whole genome. Alus have been implicated in many genomic processes, sometimes giving rise to aberrations while many times playing as silent player in genomic and regulatory evolution. Here we present a web server, AF1, exclusively developed for finding Alu like elements. Besides alignment based methodology, this server utilizes probabilistic scanning to find more diverged elements and employs a more precise way of element classification based on unequal weighting of sequence through sequence encoding. AVAILABILITY: AF1 is freely available at http://software.iiar.res.in/af1/. The standalone is also available for download.
Notes:
2008
Bhavana Prasher, Sapna Negi, Shilpi Aggarwal, Amit K Mandal, Tav P Sethi, Shailaja R Deshmukh, Sudha G Purohit, Shantanu Sengupta, Sangeeta Khanna, Farhan Mohammad, Gaurav Garg, Samir K Brahmachari, Mitali Mukerji (2008)  Whole genome expression and biochemical correlates of extreme constitutional types defined in Ayurveda.   J Transl Med 6: 09  
Abstract: BACKGROUND: Ayurveda is an ancient system of personalized medicine documented and practiced in India since 1500 B.C. According to this system an individual's basic constitution to a large extent determines predisposition and prognosis to diseases as well as therapy and life-style regime. Ayurveda describes seven broad constitution types (Prakritis) each with a varying degree of predisposition to different diseases. Amongst these, three most contrasting types, Vata, Pitta, Kapha, are the most vulnerable to diseases. In the realm of modern predictive medicine, efforts are being directed towards capturing disease phenotypes with greater precision for successful identification of markers for prospective disease conditions. In this study, we explore whether the different constitution types as described in Ayurveda has molecular correlates. METHODS: Normal individuals of the three most contrasting constitutional types were identified following phenotyping criteria described in Ayurveda in Indian population of Indo-European origin. The peripheral blood samples of these individuals were analysed for genome wide expression levels, biochemical and hematological parameters. Gene Ontology (GO) and pathway based analysis was carried out on differentially expressed genes to explore if there were significant enrichments of functional categories among Prakriti types. RESULTS: Individuals from the three most contrasting constitutional types exhibit striking differences with respect to biochemical and hematological parameters and at genome wide expression levels. Biochemical profiles like liver function tests, lipid profiles, and hematological parameters like haemoglobin exhibited differences between Prakriti types. Functional categories of genes showing differential expression among Prakriti types were significantly enriched in core biological processes like transport, regulation of cyclin dependent protein kinase activity, immune response and regulation of blood coagulation. A significant enrichment of housekeeping, disease related and hub genes were observed in these extreme constitution types. CONCLUSION: Ayurveda based method of phenotypic classification of extreme constitutional types allows us to uncover genes that may contribute to system level differences in normal individuals which could lead to differential disease predisposition. This is a first attempt towards unraveling the clinical phenotyping principle of a traditional system of medicine in terms of modern biology. An integration of Ayurveda with genomics holds potential and promise for future predictive medicine.
Notes:
Billie-Jo Hardy, Béatrice Séguin, Peter A Singer, Mitali Mukerji, Samir K Brahmachari, Abdallah S Daar (2008)  From diversity to delivery: the case of the Indian Genome Variation initiative.   Nat Rev Genet 9 Suppl 1: S9-14 Oct  
Abstract: India currently has the world's second-largest population along with a fast-growing economy and significant economic disparity. It also continues to experience a high rate of infectious disease and increasingly higher rates of chronic diseases. However, India cannot afford to import expensive technologies and therapeutics nor can it, as an emerging economy, emulate the health-delivery systems of the developed world. Instead, to address these challenges it is looking to biotechnology-based innovation in the field of genomics. The Indian Genome Variation (IGV) consortium, a government-funded collaborative network among seven local institutions, is a reflection of these efforts. The IGV has recently developed the first large-scale database of genomic diversity in the Indian population that will facilitate research on disease predisposition, adverse drug reactions and population migration.
Notes:
(2008)  Genetic landscape of the people of India: a canvas for disease gene exploration.   J Genet 87: 1. 3-20 Apr  
Abstract: Analyses of frequency profiles of markers on disease or drug-response related genes in diverse populations are important for the dissection of common diseases. We report the results of analyses of data on 405 SNPs from 75 such genes and a 5.2 Mb chromosome, 22 genomic region in 1871 individuals from diverse 55 endogamous Indian populations. These include 32 large (>10 million individuals) and 23 isolated populations, representing a large fraction of the people of India. We observe high levels of genetic divergence between groups of populations that cluster largely on the basis of ethnicity and language. Indian populations not only overlap with the diversity of HapMap populations, but also contain population groups that are genetically distinct. These data and results are useful for addressing stratification and study design issues in complex traits especially for heterogeneous populations.
Notes:
2007
Nithin Krishna, Surendra Mohan, B S Yashavantha, A Rammurthy, H B Kiran Kumar, Uma Mittal, Shivani Tyagi, Mitali Mukerji, Sanjeev Jain, Pramod Kumar Pal, Meera Purushottam (2007)  SCA 1, SCA 2 & SCA 3/MJD mutations in ataxia syndromes in southern India.   Indian J Med Res 126: 5. 465-470 Nov  
Abstract: BACKGROUND & OBJECTIVE: Spinocerebellar ataxias (SCAs) are often caused by expansions of CTG/ CAG trinucleotide repeat in the genome. Expansions at the SCA1, 2 and 3 loci are the most frequent, but differences in their relative proportion in regions occur across the world. We carried out this study to assess the occurrence of SCA1, 2 and 3, at a tertiary neuro-psychiatric center in Bangalore, Karnataka. METHODS: Probands (N=318) who were diagnosed to have an ataxia syndrome (progressive degenerative ataxia of unknown cause) attending the clinical services of the National Institute of Mental Health and Neuro Sciences (NIMHANS), Bangalore, were evaluated over a period of three years. Standard protocols were used for both clinical and molecular diagnosis. RESULTS: Genotyping established that SCA1, 2 and 3 accounted for more than one third of the ataxia cases seen in the clinic. In the cases with established family history and autosomal dominant inheritance SCA1 was most prevalent followed by SCA2 and SCA3. INTERPRETATION & CONCLUSION: Our findings suggested SCA1 rather than SCA2 to be the more common mutation in southern India. Large numbers of SCA3 probands were also identified. Differences in prevalence of these syndromes within India need to be explored further for founder effects, correlations with phenotype, and patterns of outcome. Family history was not apparent in almost a fifth of those tested positive, highlighting the value of testing even in the absence of family history. Molecular testing should be extended to cover the other forms of ataxia, of which a large number are now known. Combined efforts to confirm the presence of these less common forms, as well as family studies to detect novel mutations, are necessary in this context in India.
Notes:
Ravi Shankar, Amit Chaurasia, Biswaroop Ghosh, Dmitry Chekmenev, Evgeny Cheremushkin, Alexander Kel, Mitali Mukerji (2007)  Non-random genomic divergence in repetitive sequences of human and chimpanzee in genes of different functional categories.   Mol Genet Genomics 277: 4. 441-455 Apr  
Abstract: Sequencing of the human and chimpanzee genomes has revealed approximately 99% similarity in the coding sequence between both the species, which in no way parallels the observable phenotypic differences. Contribution of the non-coding sequences which comprise a bulk of the genome, in functional divergence between human and chimpanzee, is largely understudied. In this context, we have compared extents of divergence in the non-coding repetitive DNA in a data set of well-classified neuronal and housekeeping genes between human and chimpanzee. The coding regions of these genes have earlier been extensively compared between the two species. It was revealed that the neurodevelopmental genes show accelerated evolution compared to neurophysiology and housekeeping genes in human. In this study, comparative analysis in terms of repeat spectrum, divergence in dinucleotide content density, JC divergence and its partitioning in repeats versus unique regions and transcription factor binding sites indicate different extents of functional constraints associated with the non-coding repeat regions. The constraints are also different when the upstream and downstream genic regions are compared across the functional categories. The neurodevelopmental genes seem to diverge more in the genic regions, whereas the neurophysiology genes show higher divergence in the upstream 2 kb region. Most of the divergence observed in the housekeeping genes is contributed by repeats. We also observe an accumulation of function-specific transcription factor profiles in the human lineage. Interestingly, a major fraction of the regulatory sites in these regions is differently partitioned in the repetitive sequences which in turn is dependant upon the relative distribution of the repeats across the functional categories. Thus, differential distribution of repeats across the various functional categories could have substantial effects on genome wide regulation and structure. The insights obtained from this study further add a new facet to the contribution of non-coding factors especially repeats in divergence of human and chimpanzee.
Notes:
2006
Pooja Rawal, Veera Bhadra Rao Kummarasetti, Jinoy Ravindran, Nirmal Kumar, Kangkan Halder, Rakesh Sharma, Mitali Mukerji, Swapan Kumar Das, Shantanu Chowdhury (2006)  Genome-wide prediction of G4 DNA as regulatory motifs: role in Escherichia coli global regulation.   Genome Res 16: 5. 644-655 May  
Abstract: The role of nonlinear DNA in replication, recombination, and transcription has become evident in recent years. Although several studies have predicted and characterized regulatory elements at the sequence level, very few have investigated DNA structure as regulatory motifs. Here, using G-quadruplex or G4 DNA motifs as a model, we have researched the role of DNA structure in transcription on a genome-wide scale. Analyses of >61,000 open reading frames (ORFs) across 18 prokaryotes show enrichment of G4 motifs in regulatory regions and indicate its predominance within promoters of genes pertaining to transcription, secondary metabolite biosynthesis, and signal transduction. Based on this, we predict that G4 DNA may present regulatory signals. This is supported by conserved G4 motifs in promoters of orthologous genes across phylogenetically distant organisms. We hypothesized a regulatory role of G4 DNA during supercoiling stress, when duplex destabilization may result in G4 formation. This is in line with our observations from target site analysis for 55 DNA-binding proteins in Escherichia coli, which reveals significant (P<0.001) association of G4 motifs with target sites of global regulators FIS and Lrp and the sigma factor RpoD (sigma70). These factors together control >1000 genes in the early growth phase and are believed to be induced by supercoiled DNA. We also predict G4 motif-induced supercoiling sensitivity for >30 operons in E. coli, and our findings implicate G4 DNA in DNA-topology-mediated global gene regulation in E. coli.
Notes:
2005
Ranjana Verma, Mitali Mukerji, Deepak Grover, Chandrika B-Rao, Swapan Kumar Das, Shobana Kubendran, Sanjeev Jain, Samir K Brahmachari (2005)  MLC1 gene is associated with schizophrenia and bipolar disorder in Southern India.   Biol Psychiatry 58: 1. 16-22 Jul  
Abstract: BACKGROUND: Chromosome 22q13 has shown linkage with schizophrenia (SCZ) and bipolar affective disorder (BPAD). A missense mutation in MLC1 (putative cation-channel gene on 22q13) co-segregating with periodic catatonic schizophrenia has been reported. We have investigated the relationship of MLC1 with SCZ and BPAD in Southern India. METHODS: All exons and flanking intronic sequences of MLC1 were screened for novel variations. Case-control (216 BPAD, 193 SCZ, 116 control subjects) and family-based analyses (113 BPAD, 107 SCZ families) were performed to evaluate association of MLC1 with these disorders. RESULTS: We found 33 MLC1 sequence variations, including three novel mutations: Val210Ile, Leu308Gln, and Arg328His in six BPAD cases and Val210Ile in one control individual. Minor allele of a common variation, ss16339182 (in approximately 6 Kb Linkage-Disequilibrium [LD]-block) was associated with BPAD in case-control (p = .03) and family-based analyses (transmitted/nontransmitted [T/NT]-44/20; p = .003). Association was observed for rs2235349 and rs2076137 with SCZ and ss16339163 with BPAD in case-control study. Using Block 2 haplotype tagging single nucleotide polymorphisms (htSNPs), GC haplotype revealed association (p = .02) and excess transmission (p = .002) with BPAD. CONCLUSIONS: Association of MLC1 with SCZ and BPAD suggests involvement of a common pathway. Rare missense mutations and common variants associated with BPAD favors hypothesis about likely involvement of both rare and common polymorphisms in etiology of this complex disorder.
Notes:
Uma Mittal, Achal K Srivastava, Satish Jain, Sanjeev Jain, Mitali Mukerji (2005)  Founder haplotype for Machado-Joseph disease in the Indian population: novel insights from history and polymorphism studies.   Arch Neurol 62: 4. 637-640 Apr  
Abstract: BACKGROUND: The ACA haplotype is associated with 72% of the expanded repeats in Machado-Joseph disease (MJD) worldwide and has been traced to a Portuguese ancestry. It is present in only 5% of the normal chromosomes in the Portuguese population. OBJECTIVE: To trace the origin of expanded alleles of MJD in the Indian population. METHODS: We performed CAG repeat size determination and haplotype analysis for 9 families with MJD and 263 unrelated chromosomes with unexpanded CAG sequences from the Indian population. RESULTS: All the expanded alleles were exclusively associated with the ACA haplotype in the Indian population. Interestingly, this haplotype was very common in normal alleles (40%) as compared with the Portuguese population (5%) and was significantly associated with large normal alleles (Pearson chi(2)1 = 87.1, P<.001) in the Indian population. We also observed a rare intermediate allele of MJD with the ACA haplotype but with a CAG variant instead of CAA at the sixth position in the repeat tract. CONCLUSIONS: Overrepresentation of the ACA haplotype in large normal alleles in India as compared with the Portuguese population suggests that the expansion-prone large normal alleles with the ACA haplotype may have been introduced in the Portuguese population through admixture with South Asian populations. Detailed haplotype analysis of a CAG variant within the repeat tract in an intermediate allele of MJD suggests a mechanism of gene conversion in the expansion.
Notes:
Uma Mittal, Sanghamitra Roy, Satish Jain, Achal K Srivastava, Mitali Mukerji (2005)  Post-zygotic de novo trinucleotide repeat expansion at spinocerebellar ataxia type 7 locus: evidence from an Indian family.   J Hum Genet 50: 3. 155-157 03  
Abstract: Spinocerebellar ataxia type 7 (SCA7) is an autosomal dominant cerebellar ataxia caused by CAG repeat expansion. We found expansion at SCA7 locus in only two out of 235 Indian families clinically diagnosed for ataxia. In one of the families, a de novo mutation was observed wherein a paternal allele in intermediate range of 31 CAG repeats expanded to 59 in the offspring leading to the disease. No expanded alleles were observed in the sperm of the transmitting parent by small pool PCR. This suggests that de novo expansion by a pre-zygotic event is unlikely and could be post-zygotic. SCA7 expanded alleles from the two families were present on different genetic backgrounds, indicating multiple origins of the mutation.
Notes:
Uma Mittal, Sangeeta Sharma, Rupali Chopra, Kalladka Dheeraj, Pramod Kr Pal, Achal K Srivastava, Mitali Mukerji (2005)  Insights into the mutational history and prevalence of SCA1 in the Indian population through anchored polymorphisms.   Hum Genet 118: 1. 107-114 Oct  
Abstract: There is a wide variation in prevalence of spinocerebellar ataxia type 1 (SCA1) in different populations. In the present study, we observed SCA1 in approximately 22% (37/167 families) of the autosomal dominant cerebellar ataxias (ADCAs) in the Indian population. We investigated the role of various genetic factors like repeat length, interruption pattern and chromosomal background in predisposing the repeats to instability in these families. We analyzed 12 markers (9 SNPs and 3 microsatellite markers) and found 3 of them, spanning a region of approximately 65 kbp to be linked with the disease locus in the Indian population. The haplotype C-4-C defined by rs1476464 (SNP9)-D6S288-rs2075974 (SNP1), which was extremely rare in nonaffected chromosomes (approximately 3%), was observed to be significantly (P<0.0000) associated with the expanded chromosomes in approximately 44% of SCA1 families. This haplotype was found in all nonhuman primates. SNP1 (C/T), which showed a skewed allelic distribution between large (LN > 30 repeats) and small normal (SN <or= 30 repeats) alleles (P<0.0000) had similar allelic distribution (P=0.3477) in LN and expanded alleles. Our study suggested that LN and expanded chromosomes linked with the ancestral C allele of SNP1 might have originated simultaneously during evolution by the lengthening of repeats. The LN alleles might have accumulated repeat stabilizing non-CAG interruptions during this process. Similar proportions of T allele in SN with single interruptions, LN and expanded chromosomes lend credence to the origin of expanded alleles from singly-interrupted chromosomes. Our analyses using markers linked (anchoring) to SCA1 suggest that prevalence of SCA1 is correlated to both repeat length and number of interruptions in the Indian population. The spectrum of these alleles also points toward the antiquity of SCA1 mutation in the Indian population.
Notes:
Deepak Grover, K Kannan, Samir K Brahmachari, Mitali Mukerji (2005)  ALU-ring elements in the primate genomes.   Genetica 124: 2-3. 273-289 Jul  
Abstract: Elucidation of complete nucleotide sequence of the human has revealed that coding sequences that store the information needed to synthesize functional proteins, occupy only 2% of the genomic region. The remaining 98%, barring few regulatory sequences, has been referred to as non-functional or junk DNA and consists of many kinds of repeat elements. In fact, human genome is the most repeat rich genome sequenced so far, in which more than half of the region is occupied by such sequences. Determination of significance of these repeats in the human genome has become the focus of many studies all over the world, especially after genome sequencing did not reveal any significant difference in coding regions between lower eukaryotes and human. In this article, we have focused on Alu repeats that are primate specific elements with many interesting biological properties. Moreover, these are the repeats with highest copy number in the human genome. We have highlighted different facets of their interaction with the genome and changing paradigms regarding their role in genome organization.
Notes:
S Bahl, K Virdi, U Mittal, M P Sachdeva, A K Kalla, S E Holmes, E O'Hearn, R L Margolis, S Jain, A K Srivastava, M Mukerji (2005)  Evidence of a common founder for SCA12 in the Indian population.   Ann Hum Genet 69: Pt 5. 528-534 Sep  
Abstract: Spinocerebellar ataxia type 12 (SCA12) is an autosomal dominant cerebellar ataxia associated with the expansion of an unstable CAG repeat in the 5' region of the PPP2R2B gene on chromosome 5q31-5q32. We found that it accounts for approximately 16% (20/124) of all the autosomal dominant ataxia cases diagnosed in AIIMS, a major tertiary referral centre in North India. The length of the expanded allele in this population ranges from 51-69 CAG triplets. Interestingly, all the affected families belong to an endogamous population, which originated in the state of Haryana, India. We identified four novel SNPs and a dinucleotide marker spanning approximately 137 kb downstream of CAG repeat in the PPP2R2B gene. Analysis of 20 Indian SCA12 families and ethnically matched normal unrelated individuals revealed one haplotype to be significantly associated with the affected alleles (P= 0.000), clearly indicating the presence of a common founder for SCA12 in the Indian population. This haplotype was not shared by the American pedigree with SCA12. Therefore, the SCA12 expansion appears to have originated at least twice.
Notes:
2004
Ravi Shankar, Deepak Grover, Samir K Brahmachari, Mitali Mukerji (2004)  Evolution and distribution of RNA polymerase II regulatory sites from RNA polymerase III dependant mobile Alu elements.   BMC Evol Biol 4: 1. Oct  
Abstract: BACKGROUND: The primate-specific Alu elements, which originated 65 million years ago, exist in over a million copies in the human genome. These elements have been involved in genome shuffling and various diseases not only through retrotransposition but also through large scale Alu-Alu mediated recombination. Only a few subfamilies of Alus are currently retropositionally active and show insertion/deletion polymorphisms with associated phenotypes. Retroposition occurs by means of RNA intermediates synthesised by a RNA polymerase III promoter residing in the A-Box and B-Box in these elements. Alus have also been shown to harbour a number of transcription factor binding sites, as well as hormone responsive elements. The distribution of Alus has been shown to be non-random in the human genome and these elements are increasingly being implicated in diverse functions such as transcription, translation, response to stress, nucleosome positioning and imprinting. RESULTS: We conducted a retrospective analysis of putative functional sites, such as the RNA pol III promoter elements, pol II regulatory elements like hormone responsive elements and ligand-activated receptor binding sites, in Alus of various evolutionary ages. We observe a progressive loss of the RNA pol III transcriptional potential with concomitant accumulation of RNA pol II regulatory sites. We also observe a significant over-representation of Alus harboring these sites in promoter regions of signaling and metabolism genes of chromosome 22, when compared to genes of information pathway components, structural and transport proteins. This difference is not so significant between functional categories in the intronic regions of the same genes. CONCLUSIONS: Our study clearly suggests that Alu elements, through retrotransposition, could distribute functional and regulatable promoter elements, which in the course of subsequent selection might be stabilized in the genome. Exaptation of regulatory elements in the preexisting genes through Alus could thus have contributed to evolution of novel regulatory networks in the primate genomes. With such a wide spectrum of regulatory sites present in Alus, it also becomes imperative to screen for variations in these sites in candidate genes, which are otherwise repeat-masked in studies pertaining to identification of predisposition markers.
Notes:
Neeraj Pandey, Uma Mittal, Achal K Srivastava, Mitali Mukerji (2004)  SMARCA2 and THAP11: potential candidates for polyglutamine disorders as evidenced from polymorphism and protein-folding simulation studies.   J Hum Genet 49: 11. 596-602 09  
Abstract: CAG repeat expansion is the cause of an ever-increasing list of neurodegenerative disorders, especially hereditary ataxias. However, genes responsible for 10-50% of the clinically diagnosed ataxias are still unidentified in different populations. Traditional linkage and repeat expansion-detection based methods complemented with human genome sequence and expression information can now accelerate the pace of identification of putative disease candidates. We have analyzed two CAG repeat containing loci, human SMARCA2 and THAP11, which are expressed in the brain as putative candidates for SCAs, using computational as well as polymorphism scanning approaches. Both loci exhibited features characteristic of genes associated with repeat disorders. These loci are polymorphic with respect to size and interruption pattern in the Indian population. Furthermore, computational analysis of glutamine-stretch embedded domains in the respective proteins predicted these regions to be "natively unfolded" beyond a threshold of 40 glutamines. Comparative genome analysis suggested a stabilizing influence of CAA interspersions in repeat tract in THAP11 but not in SMARCA2. Although repeat expansion could not be detected within these genes in unidentified ataxia patients reported in India, we suggest that these loci be screened in other populations, as there is a wide heterogeneity in the prevalence of these disorders in different populations.
Notes:
Mona Ragothaman, Nagaraja Sarangmath, Shashi Chaudhary, Vishwamohini Khare, Uma Mittal, Sangeeta Sharma, Sreelatha Komatireddy, Subhabrata Chakrabarti, Mitali Mukerji, Ramesh C Juyal, B K Thelma, Uday B Muthane (2004)  Complex phenotypes in an Indian family with homozygous SCA2 mutations.   Ann Neurol 55: 1. 130-133 Jan  
Abstract: We describe a consanguineous Indian family having spinocerebellar ataxia type 2 (SCA2) expansions with complex phenotypes (early-onset, dopa-responsive parkinsonism, ataxia and retinitis pigmentosa). The two probands having homozygous SCA2 mutations presenting with early-onset dopa-responsive parkinsonism without ataxia develop dyskinesias within a year of starting levodopa. Their siblings, heterozygous for SCA2 mutations, had retinitis pigmentosa with or without ataxia. Approximately 38% of family members with SCA2 mutations were asymptomatic.
Notes:
Deepak Grover, Mitali Mukerji, Pankaj Bhatnagar, K Kannan, Samir K Brahmachari (2004)  Alu repeat analysis in the complete human genome: trends and variations with respect to genomic composition.   Bioinformatics 20: 6. 813-817 Apr  
Abstract: MOTIVATION: Transposon-derived Alu repeats are exclusively associated with primate genomes. They have gained considerable importance in the recent times with evidence of their involvement in various aspects of gene regulation, e.g. alternative splicing, nucleosome positioning, CpG methylation, binding sites for transcription factors and hormone receptors, etc. The objective of this study is to investigate the factors that influence the distribution of Alu repeat elements in the human genome. Such analysis is expected to yield insights into various aspects of gene regulation in primates. RESULTS: Analysis of Alu repeat distribution for the human genome build 32 (released in January 2003) reveals that they occupy nearly one-tenth portion of the sequenced regions. Huge variations in Alu frequencies were seen across the genome with chromosome 19 being the most and chromosome Y being the least Alu dense chromosomes. The highlights of the analysis are as follows: (1). three-fourth of the total genes in the genome are associated with Alus. (2). Alu density is higher in genes as compared with intergenic regions in all the chromosomes except 19 and 22. (3). Alu density in human genome is highly correlated with GC content, gene density and intron density with GC content being major deterministic factor compared with other two. (4). Alu densities were correlated more with gene density than intron density indicating the insertion of Alus in untranslated regions of exons.
Notes:
2003
Deepak Grover, Partha P Majumder, Chandrika B Rao, Samir K Brahmachari, Mitali Mukerji (2003)  Nonrandom distribution of alu elements in genes of various functional categories: insight from analysis of human chromosomes 21 and 22.   Mol Biol Evol 20: 9. 1420-1424 Sep  
Abstract: The first draft of the human genome has revealed enormous variability in the global distribution of Alu repeat elements. There are regions such as the four homeobox gene clusters, which are nearly devoid of these repeats that contrast with repeat dense regions in other transcriptionally active regions of the genome. Our analysis of the completely sequenced chromosomes 21 and 22 revealed a striking bias in Alu distribution. These elements are more clustered in genes which are involved in metabolism, transport, and signaling processes. In contrast, they are significantly fewer in genes coding for information pathway components as well as structural proteins. This bias in Alu distribution is independent of the effect of Alu density of the flanking genomic region and is also not affected by the GC content of the gene and its upstream and downstream regions. The relative proportions of Alu subfamilies (Alu J, Alu S, and Alu Y) are not significantly different in genes with high Alu density belonging to the functional categories of transport, metabolism, and signaling. However, in the structural proteins and information genes, these proportions are lower than the other three categories. We suggest that Alu elements might be involved in regulatory mechanisms and are therefore differentially selected in primate genomes.
Notes:
2002
Chitra Chauhan, Debasis Dash, Deepak Grover, Jaya Rajamani, Mitali Mukerji (2002)  Origin and instability of GAA repeats: insights from Alu elements.   J Biomol Struct Dyn 20: 2. 253-263 Oct  
Abstract: Expansion of GAA repeats in the intron of the frataxin gene is involved in the autosomal recessive Friedreich's ataxia (FRDA). The GAA repeats arise from a stretch of adenine residues of an Alu element. These repeats have a size ranging from 7- 38 in the normal population, and expand to thousands in the affected individuals. The mechanism of origin of GAA repeats, their polymorphism and stability are not well understood. In this study, we have carried out an extensive analysis of GAA repeats at several loci in the humans. This analysis indicates the association of a majority of GAA repeats with the 3' end of an "A" stretch present in the Alu repeats. Further, the prevalence of GAA repeats correlates with the evolutionary age of Alu subfamilies as well as with their relative frequency in the genome. Our study on GAA repeat polymorphism at some loci in the normal population reveals that the length of the GAA repeats is determined by the relative length of the flanking A stretch. Based on these observations, a possible mechanism for origin of GAA repeats and modulatory effects of flanking sequences on repeat instability mediated by DNA triplex is proposed.
Notes:
2001
S Choudhry, M Mukerji, A K Srivastava, S Jain, S K Brahmachari (2001)  CAG repeat instability at SCA2 locus: anchoring CAA interruptions and linked single nucleotide polymorphisms.   Hum Mol Genet 10: 21. 2437-2446 Oct  
Abstract: Spinocerebellar ataxia 2 (SCA2) is an autosomal dominant neurodegenerative disorder that results from the expansion of a cryptic CAG repeat within the exon 1 of the SCA2 gene. The CAG repeat in normal individuals varies in length from 14 to 31 repeats and is frequently interrupted by one or more CAA triplets, whereas the expanded alleles contain a pure uninterrupted stretch of 34 to 59 CAG repeats. We have previously reported the presence of a limited pool of 'ancestral' or 'at risk' haplotypes for the expanded SCA2 alleles in the Indian population. We now report the identification of two novel single nucleotide polymorphisms (SNPs) in exon 1 of the SCA2 gene and their characterization in 215 normal and 64 expanded chromosomes. The two biallelic SNPs distinguished two haplotypes, GT and CC, each of which formed a predominant haplotype associated with normal and expanded SCA2 alleles. All the expanded alleles segregated with CC haplotype, which otherwise was associated with only 29.3% of the normal chromosomes. CAA interspersion analysis revealed that majority of the normal alleles with CC haplotype were either pure or lacked the most proximal 5' CAA interruption. The repeat length variation at SCA2 locus also appeared to be polar with changes occurring mostly at the 5' end of the repeat. Our results demonstrate that CAA interruptions play an important role in conferring stability to SCA2 repeat and their absence predisposes alleles towards instability and pathological expansion. Our study also provides new haplotypes associated with SCA2 that should prove useful in further understanding the mutational history and mechanism of repeat instability at the SCA2 locus.
Notes:
2000
M Mukerji, S Choudhry, Q Saleem, M V Padma, M C Maheshwari, S Jain (2000)  Molecular analysis of Friedreich's ataxia locus in the Indian population.   Acta Neurol Scand 102: 4. 227-229 Oct  
Abstract: OBJECTIVES: Friedreich's ataxia (FRDA) is an autosomal recessive neurodegenerative disorder caused by expansion of GAA repeats in the frataxin gene. We have carried out the first molecular analysis at the Friedreich's ataxia locus in the Indian population. MATERIALS AND METHODS: Three families clinically diagnosed for Friedreich's ataxia were analyzed for GAA expansion at the FRDA locus. The distribution of GAA repeats was also estimated in normal individuals of Indian origin. RESULTS: All patients clinically diagnosed for Friedreich's ataxia were found to be homozygous for GAA repeat expansion. The GAA repeat in the normal population show a bimodal distribution with 94% of alleles ranging from 7-16 repeats. CONCLUSION: Indian patients with expansion at the FRDA locus showed typical clinical features of Friedreich's ataxia. The low frequency of large normal alleles (6%) could indicate that the prevalence of this disease in the Indian population is likely to be low.
Notes:
Q Saleem, S Choudhry, M Mukerji, L Bashyam, M V Padma, A Chakravarthy, M C Maheshwari, S Jain, S K Brahmachari (2000)  Molecular analysis of autosomal dominant hereditary ataxias in the Indian population: high frequency of SCA2 and evidence for a common founder mutation.   Hum Genet 106: 2. 179-187 Feb  
Abstract: Expansion of CTG/CAG trinucleotide repeats has been shown to cause a number of autosomal dominant cerebellar ataxias (ADCA) such as SCA1, SCA2, SCA3/ MJD, SCA6, SCA7, SCA8 and DRPLA. There is a wide variation in the clinical phenotype and prevalence of these ataxias in different populations. An analysis of ataxias in 42 Indian families indicates that SCA2 is the most frequent amongst all the ADCAs we have studied. In the SCA2 families, together with an intergenerational increase in repeat size, a horizontal increase with the birth order of the offspring was also observed, indicating an important role for parental age in repeat instability. This was strengthened by the detection of a pair of dizygotic twins with expanded alleles showing the same repeat number. Haplotype analysis indicates the presence of a common founder chromosome for the expanded allele in the Indian population. Polymorphism of CAG repeats in 135 normal individuals at the SCA loci studied showed similarity to the Caucasian population but was significantly different from the Japanese population.
Notes:
1997
M Mukerji, S Mahadevan (1997)  Characterization of the negative elements involved in silencing the bgl operon of Escherichia coli: possible roles for DNA gyrase, H-NS, and CRP-cAMP in regulation.   Mol Microbiol 24: 3. 617-627 May  
Abstract: The bgl operon of Escherichia coli is rendered cryptic and uninducible in wild-type cells by the presence of DNA structural elements that negatively regulate transcription. We have carried out a detailed analysis of the sequences implicated in negative regulation. Fine-structure deletion analysis of the upstream sequences showed the presence of at least two elements involved in silencing the promoter. Chemical probing of genomic DNA in vivo showed that a region of dyad symmetry, present upstream of the promoter, is hypersensitive to KMnO4. The hypersensitive region detected corresponds to the potential cruciform structure implicated earlier in negative regulation. Enhancement of transcription from the wild-type promoter, observed in the presence of the gyrase inhibitor novobiocin, was absent in a mutant that carried point mutations in the inverted repeat. This observation suggests that the activation seen in a gyrase mutant is mediated by destabilization of the cruciform because of reduced supercoiling. Deletion of sequences downstream of the potential cruciform also resulted in an increase in transcription, indicating the presence of a second regulatory element. Measurement of transcription from the bgl promoter carrying the deletion, in a strain that has a mutation in the hns gene, indicated that this region is likely to be involved in binding to H-NS or a protein regulated by H-NS, which acts as a non-specific repressor. We also provide evidence which suggests that transcriptional activation by mutations at the cAMP receptor protein (CRP)-binding site is mediated partly by antagonization of the negative effect of H-NS by CRP-cAMP as a result of its increased affinity for the mutant site.
Notes:
1995
J Singh, M Mukerji, S Mahadevan (1995)  Transcriptional activation of the Escherichia coli bgl operon: negative regulation by DNA structural elements near the promoter.   Mol Microbiol 17: 6. 1085-1092 Sep  
Abstract: The bgl operon of Escherichia coli is transcriptionally inactive in wild-type cells. DNA insertion sequences (IS) constitute a major class of spontaneous mutations that activate the cryptic bgl promoter. In an attempt to study the molecular mechanism of activation mediated by insertion sequences, transcription of the bgl promoter was carried out in vitro. Stimulation of transcription is observed when a plasmid containing an insertionally activated bgl promoter is used as a template in the absence of proteins other than RNA polymerase. Deletions that remove sequences upstream of the bgl promoter, and insertion of a 1.2 kb DNA fragment encoding resistance to kanamycin, activate the promoter. Point mutations within a region of dyad symmetry upstream of the promoter, which has the potential to extrude into a cruciform structure under torsional stress, also lead to activation. Introduction of a sequence with dyad symmetry, upstream of an activated bgl promoter carrying a deletion of upstream sequences, results in a fourfold reduction in transcription. These results suggest that the cryptic nature of the bgl promoter is because of the presence of DNA structural elements near the promoter that negatively affect transcription.
Notes:
Powered by PublicationsList.org.