hosted by
publicationslist.org
    
Mario Cannataro
University "Magna Græcia" of Catanzaro,
Bioinformatics Laboratory,
Viale Europa (Località Germaneto),
88100 CATANZARO,
ITALY
cannataro@unicz.it

Journal articles

2007
 
PMID 
Mario Cannataro, Annalisa Barla, Roberto Flor, Giuseppe Jurman, Stefano Merler, Silvano Paoli, Giuseppe Tradigo, Pierangelo Veltri, Cesare Furlanello (2007)  A grid environment for high-throughput proteomics.   IEEE Trans Nanobioscience 6: 2. 117-123 Jun  
Abstract: We connect in a grid-enabled pipeline an ontology-based environment for proteomics spectra management with a machine learning platform for unbiased predictive analysis. We exploit two existing software platforms (MS-Analyzer and BioDCV), the emerging proteomics standards, and the middleware and computing resources of the EGEE Biomed VO grid infrastructure. In the setup, BioDCV is accessed by the MS-Analyzer workflow as a Web service, thus providing a complete grid environment for proteomics data analysis. Predictive classification studies on MALDI-TOF data based on this environment are presented.
Notes:
 
DOI   
PMID 
Mario Cannataro, Giovanni Cuda, Marco Gaspari, Sergio Greco, Giuseppe Tradigo, Pierangelo Veltri (2007)  The EIPeptiDi tool: enhancing peptide discovery in ICAT-based LC MS/MS experiments.   BMC Bioinformatics 8: 07  
Abstract: BACKGROUND: Isotope-coded affinity tags (ICAT) is a method for quantitative proteomics based on differential isotopic labeling, sample digestion and mass spectrometry (MS). The method allows the identification and relative quantification of proteins present in two samples and consists of the following phases. First, cysteine residues are either labeled using the ICAT Light or ICAT Heavy reagent (having identical chemical properties but different masses). Then, after whole sample digestion, the labeled peptides are captured selectively using the biotin tag contained in both ICAT reagents. Finally, the simplified peptide mixture is analyzed by nanoscale liquid chromatography-tandem mass spectrometry (LC-MS/MS). Nevertheless, the ICAT LC-MS/MS method still suffers from insufficient sample-to-sample reproducibility on peptide identification. In particular, the number and the type of peptides identified in different experiments can vary considerably and, thus, the statistical (comparative) analysis of sample sets is very challenging. Low information overlap at the peptide and, consequently, at the protein level, is very detrimental in situations where the number of samples to be analyzed is high. RESULTS: We designed a method for improving the data processing and peptide identification in sample sets subjected to ICAT labeling and LC-MS/MS analysis, based on cross validating MS/MS results. Such a method has been implemented in a tool, called EIPeptiDi, which boosts the ICAT data analysis software improving peptide identification throughout the input data set. Heavy/Light (H/L) pairs quantified but not identified by the MS/MS routine, are assigned to peptide sequences identified in other samples, by using similarity criteria based on chromatographic retention time and Heavy/Light mass attributes. EIPeptiDi significantly improves the number of identified peptides per sample, proving that the proposed method has a considerable impact on the protein identification process and, consequently, on the amount of potentially critical information in clinical studies. The EIPeptiDi tool is available at http://bioingegneria.unicz.it/~veltri/projects/eipeptidi/ with a demo data set. CONCLUSION: EIPeptiDi significantly increases the number of peptides identified and quantified in analyzed samples, thus reducing the number of unassigned H/L pairs and allowing a better comparative analysis of sample data sets.
Notes:
2005
 
PMID 
Mario Cannataro, Pietro Hiram Guzzi, Tommaso Mazza, Giuseppe Tradigo, Pierangelo Veltri (2005)  Using ontologies in PROTEUS for modeling proteomics data mining applications.   Stud Health Technol Inform 112: 17-26  
Abstract: Bioinformatics applications are often characterized by a combination of (pre) processing of raw data representing biological elements, (e.g. sequence alignment, structure prediction), and an high level data mining analysis. Developing such applications needs knowledge of both data mining and bioinformatics domains, that can be effectively achieved by combining ontology about the application domain and ontology about the approaches and processes to solve the given problem. In this paper we talk about using ontologies to model proteomics in silico experiments. In particular data mining of mass spectrometry proteomics data is considered.
Notes:
 
DOI   
PMID 
M Cannataro, G Cuda, P Veltri (2005)  Modeling and designing a proteomics application on PROTEUS.   Methods Inf Med 44: 2. 221-226  
Abstract: OBJECTIVES: Biomedical applications, such as analysis and management of mass spectrometry proteomics experiments, involve heterogeneous platforms and knowledge, massive data sets, and complex algorithms. Main requirements of such applications are semantic modeling of the experiments and data analysis, as well as high performance computational platforms. In this paper we propose a software platform allowing to model and execute biomedical applications on the Grid. METHODS: Computational Grids offer the required computational power, whereas ontologies and workflow help to face the heterogeneity of biomedical applications. In this paper we propose the use of domain ontologies and workflow techniques for modeling biomedical applications, whereas Grid middleware is responsible for high performance execution. As a case study, the modeling of a proteomics experiment is discussed. RESULTS: The main result is the design and first use of PROTEUS, a Grid-based problem-solving environment for biomedical and bioinformatics applications. CONCLUSION: To manage the complexity of biomedical experiments, ontologies help to model applications and to identify appropriate data and algorithms, workflow techniques allow to combine the elements of such applications in a systematic way. Finally, translation of workflow into execution plans allows the exploitation of the computational power of Grids. Along this direction, in this paper we present PROTEUS discussing a real case study in the proteomics domain.
Notes:
 
PMID 
Vincent Breton, Kevin Dean, Tony Solomonides, I Blanquer, V Hernandez, E Medico, N Maglaveras, S Benkner, G Lonsdale, S Lloyd, K Hassan, R McClatchey, S Miguet, J Montagnat, X Pennec, W De Neve, C De Wagter, G Heeren, L Maigne, K Nozaki, M Taillet, H Bilofsky, R Ziegler, M Hoffman, C Jones, M Cannataro, P Veltri, G Aloisio, S Fiore, M Mirto, I Chouvarda, V Koutkias, A Malousi, V Lopez, I Oliveira, J P Sanchez, F Martin-Sanchez, G De Moor, B Claerhout, J A M Herveg (2005)  The Healthgrid White Paper.   Stud Health Technol Inform 112: 249-321  
Abstract: Over the last four years, a community of researchers working on Grid and High Performance Computing technologies started discussing the barriers and opportunities that grid technologies must face and exploit for the development of health-related applications. This interest lead to the first Healthgrid conference, held in Lyon, France, on January 16th-17th, 2003, with the focus of creating increased awareness about the possibilities and advantages linked to the deployment of grid technologies in health, ultimately targeting the creation of a European/international grid infrastructure for health. The topics of this conference converged with the position of the eHealth division of the European Commission, whose mandate from the Lisbon Meeting was "To develop an intelligent environment that enables ubiquitous management of citizens' health status, and to assist health professionals in coping with some major challenges, risk management and the integration into clinical practice of advances in health knowledge." In this context "Health" involves not only clinical procedures but covers the whole range of information from molecular level (genetic and proteomic information) over cells and tissues, to the individual and finally the population level (social healthcare). Grid technology offers the opportunity to create a common working backbone for all different members of this large "health family" and will hopefully lead to an increased awareness and interoperability among disciplines. The first HealthGrid conference led to the creation of the Healthgrid association, a non-profit research association legally incorporated in France but formed from the broad community of European researchers and institutions sharing expertise in health grids. After the second Healthgrid conference, held in Clermont-Ferrand on January 29th-30th, 2004, the need for a "white paper" on the current status and prospective of health grids was raised. Over fifty experts from different areas of grid technologies, eHealth applications and the medical world were invited to contribute to the preparation of this document.
Notes:
2004
 
PMID 
Mario Cannataro, Antonio Congiusta, Andrea Pugliese, Domenico Talia, Paolo Trunfio (2004)  Distributed data mining on grids: services, tools, and applications.   IEEE Trans Syst Man Cybern B Cybern 34: 6. 2451-2465 Dec  
Abstract: Data mining algorithms are widely used today for the analysis of large corporate and scientific datasets stored in databases and data archives. Industry, science, and commerce fields often need to analyze very large datasets maintained over geographically distributed sites by using the computational power of distributed and parallel systems. The grid can play a significant role in providing an effective computational support for distributed knowledge discovery applications. For the development of data mining applications on grids we designed a system called Knowledge Grid. This paper describes the Knowledge Grid framework and presents the toolset provided by the Knowledge Grid for implementing distributed knowledge discovery. The paper discusses how to design and implement data mining applications by using the Knowledge Grid tools starting from searching grid resources, composing software and data components, and executing the resulting data mining process on a grid. Some performance results are also discussed.
Notes:
Powered by publicationslist.org.