HRAS physical feature analysis: Predicting protein activation through a random forest classifier
Visualitza/Obre
Autor/a
Altres autors/es
Data de publicació
2023-09-10Resum
Over the past decade, increased computational capabilities have enabled us to address biological questions using data-driven methods, particularly where traditional techniques have been limiting. We hypothesize these computer-based methods can be used to predict enzyme activation status. To verify this claim, we have selected a benchmark protein for study, Human HRAS, sourcing a comprehensive set of experimentally labelled structures from available databases. Seven physical, computationally inexpensive features were extracted from these structures at the amino acid alpha carbon level and aligned to the canonical sequence to convey their metrics locally. Subsequently, three-dimensional tensors were generated with them from the set of all possible combinations of the obtained features. A random forest model was then trained on t-SNE preprocessed tensors to look for the highest performing combination of features. Our results strongly suggest that activation status in Human HRAS, and probably other proteins, is mainly codified in the electrostatic and Van der Waals forces, with solvation forces playing a lesser role. These forces, when processed through machine learning models, offer substantial predictive capability. In contrast, methods based on the physical three-dimensional position of residues, such as coordinate-based data and pairwise Root Mean Standard Deviation, were not independently effective in distinguishing activation states.
Tipus de document
Treball fi de màster
Versió del document
Academic tutor: Jordi Villà i Freixa.
Llengua
Anglès
Paraules clau
Proteïnes -- Investigació
Pàgines
41 p.
Nota
Curs 2022-2023
Aquest element apareix en la col·lecció o col·leccions següent(s)
Drets
Aquest document està subjecte a aquesta llicència Creative Commons
Excepte que s'indiqui una altra cosa, la llicència de l'ítem es descriu com https://creativecommons.org/licenses/by-nc-nd/4.0/deed.ca