hosted by
publicationslist.org
    

Iain M Wallace


iain.m.wallace@gmail.com

Journal articles

2011
Genna M Luciani, Lilia Magomedova, Rachel Puckrin, Malene L Urbanus, Iain M Wallace, Guri Giaever, Corey Nislow, Carolyn L Cummins, Peter J Roy (2011)  Dafadine inhibits DAF-9 to promote dauer formation and longevity of Caenorhabditis elegans.   Nat Chem Biol 7: 12. 891-893 11  
Abstract: The DAF-9 cytochrome P450 is a key regulator of dauer formation, developmental timing and longevity in the nematode Caenorhabditis elegans. Here we describe the first identified chemical inhibitor of DAF-9 and the first reported small-molecule tool that robustly induces dauer formation in typical culture conditions. This molecule (called dafadine) also inhibits the mammalian ortholog of DAF-9(CYP27A1), suggesting that dafadine can be used to interrogate developmental control and longevity in other animals.
Notes:
Iain M Wallace, Gary D Bader, Guri Giaever, Corey Nislow (2011)  Displaying chemical information on a biological network using Cytoscape.   Methods Mol Biol 781: 363-376  
Abstract: Cytoscape is an open-source software package that is widely used to integrate and visualize diverse data sets in biology. This chapter explains how to use Cytoscape to integrate open-source chemical information with a biological network. By visualizing information about known compound-target interactions in the context of a biological network of interest, one can rapidly identify novel avenues to perturb the system with compounds and, for example, potentially identify therapeutically relevant targets. Herein, two different protocols are explained in detail, with no prior knowledge of Cytoscape assumed, which demonstrate how to incorporate data from the ChEMBL database with either a gene-gene or a protein-protein interaction network. ChEMBL is a very large, open-source repository of compound-target information available from the European Molecular Biology Laboratory.
Notes:
Michael Klein, Montse Morillas, Alexandre Vendrell, Lars Brive, Marinella Gebbia, Iain M Wallace, Guri Giaever, Corey Nislow, Francesc Posas, Morten Grøtli (2011)  Design, synthesis and characterization of a highly effective inhibitor for analog-sensitive (as) kinases.   PLoS One 6: 6. 06  
Abstract: Highly selective, cell-permeable and fast-acting inhibitors of individual kinases are sought-after as tools for studying the cellular function of kinases in real time. A combination of small molecule synthesis and protein mutagenesis, identified a highly potent inhibitor (1-Isopropyl-3-(phenylethynyl)-1H-pyrazolo[3,4-d]pyrimidin-4-amine) of a rationally engineered Hog1 serine/threonine kinase (Hog1(T100G)). This inhibitor has been successfully used to study various aspects of Hog1 signaling, including a transient cell cycle arrest and gene expression changes mediated by Hog1 in response to stress. This study also underscores that the general applicability of this approach depends, in part, on the selectivity of the designed the inhibitor with respect to activity versus the engineered and wild type kinases. To explore this specificity in detail, we used a validated chemogenetic assay to assess the effect of this inhibitor on all gene products in yeast in parallel. The results from this screen emphasize the need for caution and for case-by-case assessment when using the Analog-Sensitive Kinase Allele technology to assess the physiological roles of kinases.
Notes:
Iain M Wallace, Malene L Urbanus, Genna M Luciani, Andrew R Burns, Mitchell K L Han, Hao Wang, Kriti Arora, Lawrence E Heisler, Michael Proctor, Robert P St Onge, Terry Roemer, Peter J Roy, Carolyn L Cummins, Gary D Bader, Corey Nislow, Guri Giaever (2011)  Compound prioritization methods increase rates of chemical probe discovery in model organisms.   Chem Biol 18: 10. 1273-1283 Oct  
Abstract: Preselection of compounds that are more likely to induce a phenotype can increase the efficiency and reduce the costs for model organism screening. To identify such molecules, we screened ∼81,000 compounds in Saccharomyces cerevisiae and identified ∼7500 that inhibit cell growth. Screening these growth-inhibitory molecules across a diverse panel of model organisms resulted in an increased phenotypic hit-rate. These data were used to build a model to predict compounds that inhibit yeast growth. Empirical and in silico application of the model enriched the discovery of bioactive compounds in diverse model organisms. To demonstrate the potential of these molecules as lead chemical probes, we used chemogenomic profiling in yeast and identified specific inhibitors of lanosterol synthase and of stearoyl-CoA 9-desaturase. As community resources, the ∼7500 growth-inhibitory molecules have been made commercially available and the computational model and filter used are provided.
Notes:
2010
Michael Costanzo, Anastasia Baryshnikova, Jeremy Bellay, Yungil Kim, Eric D Spear, Carolyn S Sevier, Huiming Ding, Judice L Y Koh, Kiana Toufighi, Sara Mostafavi, Jeany Prinz, Robert P St Onge, Benjamin VanderSluis, Taras Makhnevych, Franco J Vizeacoumar, Solmaz Alizadeh, Sondra Bahr, Renee L Brost, Yiqun Chen, Murat Cokol, Raamesh Deshpande, Zhijian Li, Zhen-Yuan Lin, Wendy Liang, Michaela Marback, Jadine Paw, Bryan-Joseph San Luis, Ermira Shuteriqi, Amy Hin Yan Tong, Nydia van Dyk, Iain M Wallace, Joseph A Whitney, Matthew T Weirauch, Guoqing Zhong, Hongwei Zhu, Walid A Houry, Michael Brudno, Sasan Ragibizadeh, Balázs Papp, Csaba Pál, Frederick P Roth, Guri Giaever, Corey Nislow, Olga G Troyanskaya, Howard Bussey, Gary D Bader, Anne-Claude Gingras, Quaid D Morris, Philip M Kim, Chris A Kaiser, Chad L Myers, Brenda J Andrews, Charles Boone (2010)  The genetic landscape of a cell.   Science 327: 5964. 425-431 Jan  
Abstract: A genome-scale genetic interaction map was constructed by examining 5.4 million gene-gene pairs for synthetic genetic interactions, generating quantitative genetic interaction profiles for approximately 75% of all genes in the budding yeast, Saccharomyces cerevisiae. A network based on genetic interaction profiles reveals a functional map of the cell in which genes of similar biological processes cluster together in coherent subsets, and highly correlated profiles delineate specific pathways to define gene function. The global network identifies functional cross-connections between all bioprocesses, mapping a cellular wiring diagram of pleiotropy. Genetic interaction degree correlated with a number of different gene attributes, which may be informative about genetic network hubs in other organisms. We also demonstrate that extensive and unbiased mapping of the genetic landscape provides a key for interpretation of chemical-genetic interactions and drug target identification.
Notes:
Andrew R Burns, Iain M Wallace, Jan Wildenhain, Mike Tyers, Guri Giaever, Gary D Bader, Corey Nislow, Sean R Cutler, Peter J Roy (2010)  A predictive model for drug bioaccumulation and bioactivity in Caenorhabditis elegans.   Nat Chem Biol 6: 7. 549-557 Jul  
Abstract: The resistance of Caenorhabditis elegans to pharmacological perturbation limits its use as a screening tool for novel small bioactive molecules. One strategy to improve the hit rate of small-molecule screens is to preselect molecules that have an increased likelihood of reaching their target in the worm. To learn which structures evade the worm's defenses, we performed the first survey of the accumulation and metabolism of over 1,000 commercially available drug-like small molecules in the worm. We discovered that fewer than 10% of these molecules accumulate to concentrations greater than 50% of that present in the worm's environment. Using our dataset, we developed a structure-based accumulation model that identifies compounds with an increased likelihood of bioavailability and bioactivity, and we describe structural features that facilitate small-molecule accumulation in the worm. Preselecting molecules that are more likely to reach a target by first applying our model to the tens of millions of commercially available compounds will undoubtedly increase the success of future small-molecule screens with C. elegans.
Notes:
Andrew M Smith, Lawrence E Heisler, Robert P St Onge, Eveline Farias-Hesson, Iain M Wallace, John Bodeau, Adam N Harris, Kathleen M Perry, Guri Giaever, Nader Pourmand, Corey Nislow (2010)  Highly-multiplexed barcode sequencing: an efficient method for parallel analysis of pooled samples.   Nucleic Acids Res 38: 13. Jul  
Abstract: Next-generation sequencing has proven an extremely effective technology for molecular counting applications where the number of sequence reads provides a digital readout for RNA-seq, ChIP-seq, Tn-seq and other applications. The extremely large number of sequence reads that can be obtained per run permits the analysis of increasingly complex samples. For lower complexity samples, however, a point of diminishing returns is reached when the number of counts per sequence results in oversampling with no increase in data quality. A solution to making next-generation sequencing as efficient and affordable as possible involves assaying multiple samples in a single run. Here, we report the successful 96-plexing of complex pools of DNA barcoded yeast mutants and show that such 'Bar-seq' assessment of these samples is comparable with data provided by barcode microarrays, the current benchmark for this application. The cost reduction and increased throughput permitted by highly multiplexed sequencing will greatly expand the scope of chemogenomics assays and, equally importantly, the approach is suitable for other sequence counting applications that could benefit from massive parallelization.
Notes:
2008
Gordon Blackshields, Mark Larkin, Iain M Wallace, Andreas Wilm, Desmond G Higgins (2008)  Fast embedding methods for clustering tens of thousands of sequences.   Comput Biol Chem 32: 4. 282-286 Aug  
Abstract: Most sequence clustering methods require a full distance matrix to be computed between all pairs of sequences. This requires computer memory and time proportional to N(2) for N sequences. For small N or say up to 10000 or so, this can be accomplished in reasonable times for sequences of moderate length. For very large N, however, this becomes increasingly prohibitive. In this paper, we have tested variations on a class of published embedding methods that have been designed for clustering large numbers of complex objects where the individual distance calculations are expensive. These methods involve embedding the sequences in a space where the similarities within a set of sequences can be closely approximated without having to compute all pair-wise distances. We show how this approach greatly reduces computation time and memory requirements for clustering large numbers of sequences and demonstrate the quality of the clusterings by benchmarking them as guide trees for multiple alignments. Source code is available on request from the authors.
Notes:
Shawn Hoon, Andrew M Smith, Iain M Wallace, Sundari Suresh, Molly Miranda, Eula Fung, Michael Proctor, Kevan M Shokat, Chao Zhang, Ronald W Davis, Guri Giaever, Robert P St Onge, Robert P StOnge, Corey Nislow (2008)  An integrated platform of genomic assays reveals small-molecule bioactivities.   Nat Chem Biol 4: 8. 498-506 Aug  
Abstract: Bioactive compounds are widely used to modulate protein function and can serve as important leads for drug development. Identifying the in vivo targets of these compounds remains a challenge. Using yeast, we integrated three genome-wide gene-dosage assays to measure the effect of small molecules in vivo. A single TAG microarray was used to resolve the fitness of strains derived from pools of (i) homozygous deletion mutants, (ii) heterozygous deletion mutants and (iii) genomic library transformants. We demonstrated, with eight diverse reference compounds, that integration of these three chemogenomic profiles improves the sensitivity and specificity of small-molecule target identification. We further dissected the mechanism of action of two protein phosphatase inhibitors and in the process developed a framework for the rational design of multidrug combinations to sensitize cells with specific genotypes more effectively. Finally, we applied this platform to 188 novel synthetic chemical compounds and identified both potential targets and structure-activity relationships.
Notes:
2007
Iain M Wallace, Desmond G Higgins (2007)  Supervised multivariate analysis of sequence groups to identify specificity determining residues.   BMC Bioinformatics 8: 04  
Abstract: Proteins that evolve from a common ancestor can change functionality over time, and it is important to be able identify residues that cause this change. In this paper we show how a supervised multivariate statistical method, Between Group Analysis (BGA), can be used to identify these residues from families of proteins with different substrate specifities using multiple sequence alignments.
Notes:
Sebastien Moretti, Fabrice Armougom, Iain M Wallace, Desmond G Higgins, Cornelius V Jongeneel, Cedric Notredame (2007)  The M-Coffee web server: a meta-method for computing multiple sequence alignments by combining alternative alignment methods.   Nucleic Acids Res 35: Web Server issue. W645-W648 Jul  
Abstract: The M-Coffee server is a web server that makes it possible to compute multiple sequence alignments (MSAs) by running several MSA methods and combining their output into one single model. This allows the user to simultaneously run all his methods of choice without having to arbitrarily choose one of them. The MSA is delivered along with a local estimation of its consistency with the individual MSAs it was derived from. The computation of the consensus multiple alignment is carried out using a special mode of the T-Coffee package [Notredame, Higgins and Heringa (T-Coffee: a novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 2000; 302: 205-217); Wallace, O'Sullivan, Higgins and Notredame (M-Coffee: combining multiple sequence alignment methods with T-Coffee. Nucleic Acids Res. 2006; 34: 1692-1699)] Given a set of sequences (DNA or proteins) in FASTA format, M-Coffee delivers a multiple alignment in the most common formats. M-Coffee is a freeware open source package distributed under a GPL license and it is available either as a standalone package or as a web service from www.tcoffee.org.
Notes:
M A Larkin, G Blackshields, N P Brown, R Chenna, P A McGettigan, H McWilliam, F Valentin, I M Wallace, A Wilm, R Lopez, J D Thompson, T J Gibson, D G Higgins (2007)  Clustal W and Clustal X version 2.0.   Bioinformatics 23: 21. 2947-2948 Nov  
Abstract: SUMMARY: The Clustal W and Clustal X multiple sequence alignment programs have been completely rewritten in C++. This will facilitate the further development of the alignment algorithms in the future and has allowed proper porting of the programs to the latest versions of Linux, Macintosh and Windows operating systems. AVAILABILITY: The programs can be run on-line from the EBI web server: http://www.ebi.ac.uk/tools/clustalw2. The source code and executables for Windows, Linux and Macintosh computers are available from the EBI ftp site ftp://ftp.ebi.ac.uk/pub/software/clustalw2/
Notes:
2006
Iain M Wallace, Orla O'Sullivan, Desmond G Higgins, Cedric Notredame (2006)  M-Coffee: combining multiple sequence alignment methods with T-Coffee.   Nucleic Acids Res 34: 6. 1692-1699 03  
Abstract: We introduce M-Coffee, a meta-method for assembling multiple sequence alignments (MSA) by combining the output of several individual methods into one single MSA. M-Coffee is an extension of T-Coffee and uses consistency to estimate a consensus alignment. We show that the procedure is robust to variations in the choice of constituent methods and reasonably tolerant to duplicate MSAs. We also show that performances can be improved by carefully selecting the constituent methods. M-Coffee outperforms all the individual methods on three major reference datasets: HOMSTRAD, Prefab and Balibase. We also show that on a case-by-case basis, M-Coffee is twice as likely to deliver the best alignment than any individual method. Given a collection of pre-computed MSAs, M-Coffee has similar CPU requirements to the original T-Coffee. M-Coffee is a freeware open-source package available from http://www.tcoffee.org/.
Notes:
Gordon Blackshields, Iain M Wallace, Mark Larkin, Desmond G Higgins (2006)  Analysis and comparison of benchmarks for multiple sequence alignment.   In Silico Biol 6: 4. 321-339  
Abstract: The most popular way of comparing the performance of multiple sequence alignment programs is to use empirical testing on sets of test sequences. Several such test sets now exist, each with potential strengths and weaknesses. We apply several different alignment packages to 6 benchmark datasets, and compare their relative performances. HOMSTRAD, a collection of alignments of homologous proteins, is regularly used as a benchmark for sequence alignment though it is not designed as such, and lacks annotation of reliable regions within the alignment. We introduce this annotation into HOMSTRAD using protein structural superposition. Results on each database show that method performance is dependent on the input sequences. Alignment benchmarks are regularly used in combination to measure performance across a spectrum of alignment problems. Through combining benchmarks, it is possible to detect whether a program has been over-optimised for a single dataset, or alignment problem type.
Notes:
2005
Iain M Wallace, Orla O'Sullivan, Desmond G Higgins (2005)  Evaluation of iterative alignment algorithms for multiple alignment.   Bioinformatics 21: 8. 1408-1414 Apr  
Abstract: Iteration has been used a number of times as an optimization method to produce multiple alignments, either alone or in combination with other methods. Iteration has a great advantage in that it is often very simple both in terms of coding the algorithms and the complexity of the time and memory requirements. In this paper, we systematically test several different iteration strategies by comparing the results on sets of alignment test cases.
Notes:
Iain M Wallace, Gordon Blackshields, Desmond G Higgins (2005)  Multiple sequence alignments.   Curr Opin Struct Biol 15: 3. 261-266 Jun  
Abstract: Multiple sequence alignments are very widely used in all areas of DNA and protein sequence analysis. The main methods that are still in use are based on 'progressive alignment' and date from the mid to late 1980s. Recently, some dramatic improvements have been made to the methodology with respect either to speed and capacity to deal with large numbers of sequences or to accuracy. There have also been some practical advances concerning how to combine three-dimensional structural information with primary sequences to give more accurate alignments, when structures are available.
Notes:
Powered by PublicationsList.org.