Gualberto Asencio Cortés


Journal articles

2011	DOI G Asencio, J S Aguilar-Ruiz (2011) Predicting protein distance maps according to physicochemical properties Journal of Integrative Bioinformatics 8(3): 181. Abstract: The prediction of protein structures is a current issue of great significance in structural bioinformatics. More specifically, the prediction of the tertiary structure of a protein consists in determining its three-dimensional conformation based solely on its amino acid sequence. This study proposes a method in which protein fragments are assembled according to their physicochemical similarities, using information extracted from known protein structures. Many approaches cited in the literature use the physicochemical properties of amino acids, generally hydrophobicity, polarity and charge, to predict structure. In our method, implemented with parallel multithreading, we used a set of 30 physicochemical amino acid properties selected from the AAindex database. Several protein tertiary structure prediction methods produce a contact map. Our proposed method produces a distance map, which provides more information about the structure of a protein than a contact map. We performed several preliminary analysis of the protein physicochemical data distributions using 3D surfaces. Three main pattern types were found in 3D surfaces, thus it is possible to extract rules in order to predict distances between amino acids according to their physicochemical properties. We performed an experimental validation of our method using five non-homologous protein sets and we showed the generality of this method and its prediction quality using the amino acid properties considered. Finally, we included a study of the algorithm efficiency according to the number of most similar fragments considered and we notably improved the precision with the studied proteins sets. Notes:
Book chapters

2010	DOI A E Márquez, F Divina, J S Aguilar-Ruiz, G Asencio (2010) Alpha Helix Prediction Based on Evolutionary Computation In: Lecture Notes in Computer Science, Vol. 6282. pp. 358-367 Berlin, Germany: Springer Verlag Abstract: Multiple approaches have been developed in order to predict the protein secondary structure. In this paper, we propose an approach to such a problem based on evolutionary computation. The proposed approach considers various amino acids properties in order to predict the secondary structure of a protein. In particular, we will consider the hydrophobicity, the polarity and the charge of amino acids. In this study, we focus on predicting a particular kind of secondary structure: -helices. The results of our proposal will be a set of rules that will identify the beginning or an end of such a structure. Notes:
Conference papers

2011	DOI A E Márquez, F Divina, J S Aguilar-Ruiz, G Asencio (2011) An Evolutionary Approach for Protein Contact Map Prediction In: 9th European Conference on Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, EvoBio 2011 Edited by:Clara Pizzuti and Marylyn D. Ritchie and Mario Giacobini. 101-110 Torino, Italy: Abstract: In this study, we present a residue-residue contact prediction approach based on evolutionary computation. Some amino acid properties are employed according to their importance in the folding process: hydrophobicity, polarity, charge and residue size. Our evolutionary algorithm provides a set of rules which determine different cases where two amino acids are in contact. A rule represents two windows of three amino acids. Each amino acid is characterized by these four properties. We also include a statistical study for the propensities of contacts between each pair of amino acids, according to their types, hydrophobicity and polarity. Different experiments were also performed to determine the best selection of properties for the structure prediction among the cited properties. Notes:
	DOI CE Santiesteban Toca, AE Marquez Chamorro, G Asencio, J S Aguilar-Ruiz (2011) A Decision Tree-Based Method for Protein Contact Map Prediction In: Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics - 9th European Conference, EvoBIO 2011 Edited by:Clara Pizzuti and Marylyn D. Ritchie and Mario Giacobini. 153-158 Torino, Italy: Abstract: In this paper, we focus on protein contact map prediction. We describe a method where contact maps are predicted using decision tree-based model. The algorithm includes the subsequence information between the couple of analyzed amino acids. In order to evaluate the method generalization capabilities, we carry out an experiment using 173 non-homologous proteins of known structures. Our results indicate that the method can assign protein contacts with an average accuracy of 0.34, superior to the 0.25 obtained by the FNETCSS method. This shows that our algorithm improves the accuracy with respect to the methods compared, especially with the increase of protein length. Notes:
	A E Márquez, F Divina, J S Aguilar-Ruiz, G Asencio (2011) A multi-objective genetic algorithm for the protein structure prediction In: 11th International Conference on Intelligent Systems Design and Applications (ISDA), Córdoba, Spain (accepted) Abstract: Notes:
	DOI A E Márquez, F Divina, J S Aguilar-Ruiz, G Asencio (2011) Residue-residue Contact Prediction based on Evolutionary Computation In: 5th International Conference on Practical Applications of Computational Biology & Bioinformatics Edited by:Miguel P. Rocha and Juan M. Corchado Rodriguez and Florentino Fdez-Riverola and Alfonso Valencia. 279-283 Salamanca, España: Abstract: In this study, a novel residue-residue contacts prediction approach based on evolutionary computation is presented. The prediction is based on four amino acids properties. In particular, we consider the hydrophobicity, the polarity, the charge and residues size. The prediction model consists of a set of rules that identifies contacts between amino acids. Notes:
	DOI G Asencio, J S Aguilar-Ruiz, A E Márquez (2011) A nearest neighbour-based approach for viral protein structure prediction In: 9th European Conference on Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, EvoBio 2011 Edited by:Clara Pizzuti and Marylyn D. Ritchie and Mario Giacobini. 69-76 Torino, Italy: Abstract: Protein tertiary structure prediction consists of determining the three-dimensional conformation of a protein based solely on its amino acid sequence. This study proposes a method in which protein fragments are assembled according to their physicochemical similarities, using information extracted from known protein structures. Several existing protein tertiary structure prediction methods produce contact maps as their output. Our proposed method produces a distance map, which provides more information about the structure of a protein than a contact map. In addition, many existing approaches use the physicochemical properties of amino acids, generally hydrophobicity, polarity and charge, to predict structure. In our method, we used three different physicochemical properties of amino acids obtained from the literature. Using this method, we performed tertiary structure predictions on 63 viral capsid proteins with a maximum identity of 30% obtained from the Protein Data Bank. We achieved a precision of 0.75 with an 8-angstrom cut-off and a minimum sequence separation of 7 amino acids. Thus, for the studied proteins, our results provide a notable improvement over those of other methods. Notes:
	DOI G Asencio, J S Aguilar-Ruiz, A E Márquez (2011) Prediction of protein distance maps by assembling fragments according to physicochemical similarities In: 5th International Conference on Practical Applications of Computational Biology & Bioinformatics Edited by:Miguel P. Rocha and Juan M. Corchado Rodriguez and Florentino Fdez-Riverola and Alfonso Valencia. 271-277 Salamanca, España: Abstract: The prediction of protein structures is a current issue of great significance in structural bioinformatics. More specifically, the prediction of the tertiary structure of a protein consists of determining its three-dimensional conformation based solely on its amino acid sequence. This study proposes a method in which protein fragments are assembled according to their physicochemical similarities, using information extracted from known protein structures. Many approaches cited in the literature use the physicochemical properties of amino acids, generally hydrophobicity, polarity and charge, to predict structure. In our method, implemented with parallel multithreading, a set of 30 physicochemical amino acid properties selected from the AAindex database were used. Several protein tertiary structure prediction methods produce a contact map. Our proposed method produces a distance map, which provides more information about the structure of a protein than a contact map. The results of experiments with several non-homologous protein sets demonstrate the generality of this method and its prediction quality using the amino acid properties considered. Notes:
2010	G Asencio, J S Aguilar-Ruiz (2010) Importancia de las Propiedades Físico-Químicas de los Aminoácidos en la Predicción de Estructuras de Proteínas usando Vecinos más Cercanos In: XV Congreso Español sobre Tecnologías y Lógica Fuzzy, ISBN: 978-84-92944-02-6. pp. 459-464 Huelva, España: Abstract: La predicciÃ³n de estructuras de proteÃnas es actualmente un importante campo de investigaciÃ³n dentro de la bioinformÃ¡tica. En esta Ã¡rea, existen numerosos estudios realizados en los que se ha usado la informaciÃ³n de la separaciÃ³n entre los aminoÃ¡cidos de una cadena para predecir la estructura de las proteÃnas, utilizÃ¡ndose en otros trabajos ciertas propiedades fÃsico-quÃmicas de aminoÃ¡cidos. En este trabajo se han usado ambas informaciones y se ha estudiado cÃ³mo influyen en la predicciÃ³n de estructuras de proteÃnas empleando el algoritmo de vecinos mÃ¡s cercanos. Hemos comprobado que la informaciÃ³n proporcionada por las propiedades fÃsico-quÃmicas es de mayor interÃ©s que la separaciÃ³n, obteniÃ©ndose mejores tasas de acierto. Se han realizado cuatro experimentos en los que se ha usado como atributos, la separaciÃ³n entre aminoÃ¡cidos y un conjunto determinado de propiedades fÃsico-quÃmicas de los mismos y, como ejemplos, todas las subsecuencias posibles encontradas en un conjunto de mÃ¡s de 5000 proteÃnas reales. Finalmente se demuestra empÃricamente que la separaciÃ³n entre aminoÃ¡cidos, ampliamente usada en la literatura, puede ser reemplazada por propiedades fÃsico-quÃmicas de amino-Ã¡cidos, produciendo mejores predicciones. La tasa de acierto conseguida usando sÃ³lo la separaciÃ³n estÃ¡ en torno al 59%, ascendiendo este valor hasta el 79% al usarse un conjunto de propiedades fÃsico-quÃmicas de aminoÃ¡cidos. Notes:
2009	G Asencio, J S Aguilar-Ruiz (2009) Predicción de Estructuras de Proteínas mediante Vecinos más Cercanos usando Características Inherentes a los Aminoácidos In: II Workshop Español sobre Extracción y Validación de Conocimientos en Base de Datos Biomédicas pp. 1-10 Sevilla, España: Abstract: En este trabajo se ha estudiado cÂ´omo influye la separaciÂ´on entre aminoÂ´acidos de una proteÂ´Ä±na y ciertas caracterÂ´Ä±sticas inherentes a su naturaleza en la predicciÂ´on de estructuras de proteÂ´Ä±nas mediante un algoritmo de vecinos mÂ´as cercanos. En el proceso de predicciÂ´on se han generado todas las subsecuencias posibles de aminoÂ´acidos procedentes de un conjunto de proteÂ´Ä±nas reales. Se demuestra empÂ´Ä±ricamente que la separaciÂ´on entre aminoÂ´acidos produce peores predicciones que las propiedades naturales inherentes a los mismos. Esto plantea la hipÂ´otesis de que la informaciÂ´on que suministra dicha separaciÂ´on se encuentra implÂ´Ä±cita en la informaciÂ´on proporcionada por las propiedades fÂ´Ä±sico-quÂ´Ä±micas, ya que se han obtenido iguales resultados tanto en presencia como en ausencia del atributo separaciÂ´on. Notes:
2003	G Asencio, J C Riquelme, J S Aguilar-Ruiz, F Ferrer-Troyano (2003) Extendiendo la Semántica de los datos en Aprendizaje Supervisado In: X Conferencia de la Asociación Española para la Inteligencia Artificial (CAEPIA 2003) San Sebastián, España: Abstract: En este trabajo, centrado en el Ã¡rea del aprendizaje supervisado, pretendemos extender la informaciÃ³n que proporcionan los datos etiquetados. BasÃ¡ndonos en la tÃ©cnica de los vecinos mÃ¡s cercanos, se amplÃa la informaciÃ³n contenida en las etiquetas discretas de las instancias, fortaleciendo su semÃ¡ntica y perfeccionando una clasificaciÃ³n posterior. Para ello, por cada instancia del conjunto de entrenamiento se obtiene informaciÃ³n acerca de las distancias a las instancias mÃ¡s cercanas de clase contraria. Mediante un modelo geomÃ©trico se transforma cada etiqueta y es extendida y representada por un punto en â��n-1 (donde n es el nÃºmero de valores distintos de la clase discreta). Utilizando el conjunto de datos de UCI, hemos realizado dos experimentos para validar nuestra propuesta y verificar que no existe pÃ©rdida de informaciÃ³n y que es posible nuevas prestaciones al reemplazar una clase discreta por una continua. Notes:
Masters theses

2010	G Asencio (2010) Minería de datos aplicada a la predicción de estructuras de proteínas mediante vecinos más cercanos. Escuela Técnica Superior de Ingeniería Informática. Universidad de Sevilla: Abstract: La predicciÃ³n de la estructura de las proteÃnas consiste en determinar la estructura de las mismas Ãºnicamente a partir de su secuencia de aminoÃ¡cidos. En el presente trabajo se han estudiado las principales aproximaciones algorÃtmicas existentes para llevar a cabo la tarea de predicciÃ³n de estructuras de proteÃnas. Se ha aportado una soluciÃ³n para este problema de predicciÃ³n que utiliza el esquema de vecinos mÃ¡s cercanos junto a una selecciÃ³n de propiedades fÃsico-quÃmicas de los aminoÃ¡cidos. Se han realizado cuatro experimentos y se han analizado los resultados. Finalmente, se han indicado las lÃneas de trabajo abiertas y las conclusiones del trabajo realizado. Notes:

Journal articles

Book chapters

Conference papers

Masters theses