hosted by
publicationslist.org
    

Gualberto Asencio Cortés

Pablo de Olavide University. Building 44. Office 44.B.01.
guaasecor@upo.es
Welcome to my publications web page. I am a Msc. in Computer Science and PhD. Student at Pablo de Olavide University (Seville, Spain). I research in Bioinformatics and my investigation are focused in Protein Structure Prediction and Data Mining.

Journal articles

2011
G Asencio, J S Aguilar-Ruiz (2011)  Predicting protein distance maps according to physicochemical properties   Journal of Integrative Bioinformatics 8(3): 181.  
Abstract: The prediction of protein structures is a current issue of great significance in structural bioinformatics. More specifically, the prediction of the tertiary structure of a protein consists in determining its three-dimensional conformation based solely on its amino acid sequence. This study proposes a method in which protein fragments are assembled according to their physicochemical similarities, using information extracted from known protein structures. Many approaches cited in the literature use the physicochemical properties of amino acids, generally hydrophobicity, polarity and charge, to predict structure. In our method, implemented with parallel multithreading, we used a set of 30 physicochemical amino acid properties selected from the AAindex database. Several protein tertiary structure prediction methods produce a contact map. Our proposed method produces a distance map, which provides more information about the structure of a protein than a contact map. We performed several preliminary analysis of the protein physicochemical data distributions using 3D surfaces. Three main pattern types were found in 3D surfaces, thus it is possible to extract rules in order to predict distances between amino acids according to their physicochemical properties. We performed an experimental validation of our method using five non-homologous protein sets and we showed the generality of this method and its prediction quality using the amino acid properties considered. Finally, we included a study of the algorithm efficiency according to the number of most similar fragments considered and we notably improved the precision with the studied proteins sets.
Notes:

Book chapters

2010
A E Márquez, F Divina, J S Aguilar-Ruiz, G Asencio (2010)  Alpha Helix Prediction Based on Evolutionary Computation   In: Lecture Notes in Computer Science, Vol. 6282. pp. 358-367 Berlin, Germany: Springer Verlag  
Abstract: Multiple approaches have been developed in order to predict the protein secondary structure. In this paper, we propose an approach to such a problem based on evolutionary computation. The proposed approach considers various amino acids properties in order to predict the secondary structure of a protein. In particular, we will consider the hydrophobicity, the polarity and the charge of amino acids. In this study, we focus on predicting a particular kind of secondary structure: -helices. The results of our proposal will be a set of rules that will identify the beginning or an end of such a structure.
Notes:

Conference papers

2011
A E Márquez, F Divina, J S Aguilar-Ruiz, G Asencio (2011)  An Evolutionary Approach for Protein Contact Map Prediction   In: 9th European Conference on Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, EvoBio 2011 Edited by:Clara Pizzuti and Marylyn D. Ritchie and Mario Giacobini. 101-110 Torino, Italy:  
Abstract: In this study, we present a residue-residue contact prediction approach based on evolutionary computation. Some amino acid properties are employed according to their importance in the folding process: hydrophobicity, polarity, charge and residue size. Our evolutionary algorithm provides a set of rules which determine different cases where two amino acids are in contact. A rule represents two windows of three amino acids. Each amino acid is characterized by these four properties. We also include a statistical study for the propensities of contacts between each pair of amino acids, according to their types, hydrophobicity and polarity. Different experiments were also performed to determine the best selection of properties for the structure prediction among the cited properties.
Notes:
CE Santiesteban Toca, AE Marquez Chamorro, G Asencio, J S Aguilar-Ruiz (2011)  A Decision Tree-Based Method for Protein Contact Map Prediction   In: Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics - 9th European Conference, EvoBIO 2011 Edited by:Clara Pizzuti and Marylyn D. Ritchie and Mario Giacobini. 153-158 Torino, Italy:  
Abstract: In this paper, we focus on protein contact map prediction. We describe a method where contact maps are predicted using decision tree-based model. The algorithm includes the subsequence information between the couple of analyzed amino acids. In order to evaluate the method generalization capabilities, we carry out an experiment using 173 non-homologous proteins of known structures. Our results indicate that the method can assign protein contacts with an average accuracy of 0.34, superior to the 0.25 obtained by the FNETCSS method. This shows that our algorithm improves the accuracy with respect to the methods compared, especially with the increase of protein length.
Notes:
A E Márquez, F Divina, J S Aguilar-Ruiz, G Asencio (2011)  Residue-residue Contact Prediction based on Evolutionary Computation   In: 5th International Conference on Practical Applications of Computational Biology & Bioinformatics Edited by:Miguel P. Rocha and Juan M. Corchado Rodriguez and Florentino Fdez-Riverola and Alfonso Valencia. 279-283 Salamanca, España:  
Abstract: In this study, a novel residue-residue contacts prediction approach based on evolutionary computation is presented. The prediction is based on four amino acids properties. In particular, we consider the hydrophobicity, the polarity, the charge and residues size. The prediction model consists of a set of rules that identifies contacts between amino acids.
Notes:
G Asencio, J S Aguilar-Ruiz, A E Márquez (2011)  A nearest neighbour-based approach for viral protein structure prediction   In: 9th European Conference on Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, EvoBio 2011 Edited by:Clara Pizzuti and Marylyn D. Ritchie and Mario Giacobini. 69-76 Torino, Italy:  
Abstract: Protein tertiary structure prediction consists of determining the three-dimensional conformation of a protein based solely on its amino acid sequence. This study proposes a method in which protein fragments are assembled according to their physicochemical similarities, using information extracted from known protein structures. Several existing protein tertiary structure prediction methods produce contact maps as their output. Our proposed method produces a distance map, which provides more information about the structure of a protein than a contact map. In addition, many existing approaches use the physicochemical properties of amino acids, generally hydrophobicity, polarity and charge, to predict structure. In our method, we used three different physicochemical properties of amino acids obtained from the literature. Using this method, we performed tertiary structure predictions on 63 viral capsid proteins with a maximum identity of 30% obtained from the Protein Data Bank. We achieved a precision of 0.75 with an 8-angstrom cut-off and a minimum sequence separation of 7 amino acids. Thus, for the studied proteins, our results provide a notable improvement over those of other methods.
Notes:
G Asencio, J S Aguilar-Ruiz, A E Márquez (2011)  Prediction of protein distance maps by assembling fragments according to physicochemical similarities   In: 5th International Conference on Practical Applications of Computational Biology & Bioinformatics Edited by:Miguel P. Rocha and Juan M. Corchado Rodriguez and Florentino Fdez-Riverola and Alfonso Valencia. 271-277 Salamanca, España:  
Abstract: The prediction of protein structures is a current issue of great significance in structural bioinformatics. More specifically, the prediction of the tertiary structure of a protein consists of determining its three-dimensional conformation based solely on its amino acid sequence. This study proposes a method in which protein fragments are assembled according to their physicochemical similarities, using information extracted from known protein structures. Many approaches cited in the literature use the physicochemical properties of amino acids, generally hydrophobicity, polarity and charge, to predict structure. In our method, implemented with parallel multithreading, a set of 30 physicochemical amino acid properties selected from the AAindex database were used. Several protein tertiary structure prediction methods produce a contact map. Our proposed method produces a distance map, which provides more information about the structure of a protein than a contact map. The results of experiments with several non-homologous protein sets demonstrate the generality of this method and its prediction quality using the amino acid properties considered.
Notes:
2010
G Asencio, J S Aguilar-Ruiz (2010)  Importancia de las Propiedades Físico-Químicas de los Aminoácidos en la Predicción de Estructuras de Proteínas usando Vecinos más Cercanos   In: XV Congreso Español sobre Tecnologías y Lógica Fuzzy, ISBN: 978-84-92944-02-6. pp. 459-464 Huelva, España:  
Abstract: La predicción de estructuras de proteínas es actualmente un importante campo de investigación dentro de la bioinformática. En esta área, existen numerosos estudios realizados en los que se ha usado la información de la separación entre los aminoácidos de una cadena para predecir la estructura de las proteínas, utilizándose en otros trabajos ciertas propiedades físico-químicas de aminoácidos. En este trabajo se han usado ambas informaciones y se ha estudiado cómo influyen en la predicción de estructuras de proteínas empleando el algoritmo de vecinos más cercanos. Hemos comprobado que la información proporcionada por las propiedades físico-químicas es de mayor interés que la separación, obteniéndose mejores tasas de acierto. Se han realizado cuatro experimentos en los que se ha usado como atributos, la separación entre aminoácidos y un conjunto determinado de propiedades físico-químicas de los mismos y, como ejemplos, todas las subsecuencias posibles encontradas en un conjunto de más de 5000 proteínas reales. Finalmente se demuestra empíricamente que la separación entre aminoácidos, ampliamente usada en la literatura, puede ser reemplazada por propiedades físico-químicas de amino-ácidos, produciendo mejores predicciones. La tasa de acierto conseguida usando sólo la separación está en torno al 59%, ascendiendo este valor hasta el 79% al usarse un conjunto de propiedades físico-químicas de aminoácidos.
Notes:
2009
G Asencio, J S Aguilar-Ruiz (2009)  Predicción de Estructuras de Proteínas mediante Vecinos más Cercanos usando Características Inherentes a los Aminoácidos   In: II Workshop Español sobre Extracción y Validación de Conocimientos en Base de Datos Biomédicas pp. 1-10 Sevilla, España:  
Abstract: En este trabajo se ha estudiado c´omo influye la separaci´on entre amino´acidos de una prote´ına y ciertas caracter´ısticas inherentes a su naturaleza en la predicci´on de estructuras de prote´ınas mediante un algoritmo de vecinos m´as cercanos. En el proceso de predicci´on se han generado todas las subsecuencias posibles de amino´acidos procedentes de un conjunto de prote´ınas reales. Se demuestra emp´ıricamente que la separaci´on entre amino´acidos produce peores predicciones que las propiedades naturales inherentes a los mismos. Esto plantea la hip´otesis de que la informaci´on que suministra dicha separaci´on se encuentra impl´ıcita en la informaci´on proporcionada por las propiedades f´ısico-qu´ımicas, ya que se han obtenido iguales resultados tanto en presencia como en ausencia del atributo separaci´on.
Notes:
2003
G Asencio, J C Riquelme, J S Aguilar-Ruiz, F Ferrer-Troyano (2003)  Extendiendo la Semántica de los datos en Aprendizaje Supervisado   In: X Conferencia de la Asociación Española para la Inteligencia Artificial (CAEPIA 2003) San Sebastián, España:  
Abstract: En este trabajo, centrado en el área del aprendizaje supervisado, pretendemos extender la información que proporcionan los datos etiquetados. Basándonos en la técnica de los vecinos más cercanos, se amplía la información contenida en las etiquetas discretas de las instancias, fortaleciendo su semántica y perfeccionando una clasificación posterior. Para ello, por cada instancia del conjunto de entrenamiento se obtiene información acerca de las distancias a las instancias más cercanas de clase contraria. Mediante un modelo geométrico se transforma cada etiqueta y es extendida y representada por un punto en ℜn-1 (donde n es el número de valores distintos de la clase discreta). Utilizando el conjunto de datos de UCI, hemos realizado dos experimentos para validar nuestra propuesta y verificar que no existe pérdida de información y que es posible nuevas prestaciones al reemplazar una clase discreta por una continua.
Notes:

Masters theses

2010
G Asencio (2010)  Minería de datos aplicada a la predicción de estructuras de proteínas mediante vecinos más cercanos.   Escuela Técnica Superior de Ingeniería Informática. Universidad de Sevilla:  
Abstract: La predicción de la estructura de las proteínas consiste en determinar la estructura de las mismas únicamente a partir de su secuencia de aminoácidos. En el presente trabajo se han estudiado las principales aproximaciones algorítmicas existentes para llevar a cabo la tarea de predicción de estructuras de proteínas. Se ha aportado una solución para este problema de predicción que utiliza el esquema de vecinos más cercanos junto a una selección de propiedades físico-químicas de los aminoácidos. Se han realizado cuatro experimentos y se han analizado los resultados. Finalmente, se han indicado las líneas de trabajo abiertas y las conclusiones del trabajo realizado.
Notes:
Powered by PublicationsList.org.