.: Marcos Gestal :.     -----
Principal
Trabajo
Publicaciones
Docencia
Enlaces
Blog
GaleriaFotos
Contacto


Visitas:

Datos de la Publicación:


Autores:
Carlos Fernandez-Lozano, Marcos Gestal, Humberto González-Díaz, Julián Dorado, Alejandro Pazos, Cristian Robert Munteanu
Título: Markov Mean Properties for Cell Death-Related Protein Classification
Revista: Journal of Theoretical Biology
ISSN: 0022-5193
Volumen: 349
Páginas: 12-21
Editorial: Elsevier
Fecha Publicación: Mayo 2014
Factor de Impacto: 2.303
SCIMago Journal Rank: 0.801
PubMed ID: 24491256
doi: 10.1016/j.jtbi.2014.01.033
Categorías WoS: Mathematical & Computational Biology - Cuartil: Q1 - Tercil: T1 - Posición 10 de 52
Biology - Cuartil: Q2 - Tercil: T1 - Posición 23 de 85

Abstract:

The cell death (CD) is a dynamic biological function involved in physiological and pathological processes. Due to the complexity of CD, there is a demand for fast theoretical methods that can help to find new CD molecular targets. The current work presents the first classification model to predict CD-related proteins based on Markov Mean Properties. These protein descriptors have been calculated with the MInD-Prot tool using the topological information of the amino acid contact networks of the 2423 protein chains, five atom physicochemical properties and the protein 3D regions. The Machine Learning algorithms from Weka were used to find the best classification model for CD-related protein chains using all 20 attributes. The most accurate algorithm to solve this problem was K*. After several feature subset methods, the best model found is based on only 11 variables and is characterized by the Area Under the Receiver Operating Characteristic Curve (AUROC) of 0.992 and the true positive rate (TP Rate) of 88.2% (validation set). 7409 protein chains labeled with “unknown function” in the PDB Databank were analyzed with the best model in order to predict the CD-related biological activity. Thus, several proteins have been predicted to have CD-related function in Homo sapiens: 3DRX–involved in virus-host interaction biological process, protein homooligomerization; 4DWF–involved in cell differentiation, chromatin modification, DNA damage response, protein stabilization; 1IUR–involved in ATP binding, chaperone binding; 1J7D–involved in DNA double-strand break processing, histone ubiquitination, nucleotide-binding oligomerization; 1UTU–linked with DNA repair, regulation of transcription; 3EEC–participating to the cellular membrane organization, egress of virus within host cell, class mediator resulting in cell cycle arrest, negative regulation of ubiquitin-protein ligase activity involved in mitotic cell cycle and apoptotic process. Other proteins from bacteria predicted as CD-related are 2G3V - a CAG pathogenicity island protein 13 from Helicobacter pylori, 4G5A - a hypothetical protein in Bacteroides thetaiotaomicron, 1YLK–involved in the nitrogen metabolism of Mycobacterium tuberculosis, and 1XSV - with possible DNA/RNA binding domains. The results demonstrated the possibility to predict CD-related proteins using molecular information encoded into the protein 3D structure. Thus, the current work demonstrated the possibility to predict new molecular targets involved in cell-death processes.

Download