Mostrar el registro sencillo del ítem
Codon frequency is modulated by proteic selection, resulting in a coding profile in Archaea and Yeast
dc.contributor | Universitat de Vic - Universitat Central de Catalunya. Facultat de Ciències i Tecnologia | |
dc.contributor | Universitat de Vic - Universitat Central de Catalunya. Màster Universitari en Anàlisi de Dades Òmiques | |
dc.contributor.author | Roginski, Paul Luc Maxime | |
dc.date.accessioned | 2021-12-22T08:54:51Z | |
dc.date.available | 2021-12-22T08:54:51Z | |
dc.date.created | 2021-08 | |
dc.date.issued | 2021-08 | |
dc.identifier.uri | http://hdl.handle.net/10854/6877 | |
dc.description | Curs 2020-2021 | es |
dc.description.abstract | Codons as fragments of the genetic code articulate both nucleotidic and proteic constraints. If codon usage bias is now admitted to be mainly influenced by GC content, codon frequencies in general may display a more subtle compromise between base composition and selection at proteic level. In order to investigate the existing non-GC content factors of codon frequencies, we compared coding sequences (CDS) of 280 Archaea plus S. cerevisiae genomes to their randomized version (same base-composition and same length). Through dedicated counts we identified several CDS vs random patterns in Archaea some of which reflecting probable or evident proteic constraint : in particular, the systematic enrichment of CDS in negatively charged amino acids, and the strong constraint existing on codons having a T in second position, which, on the basis of hydrophobic cluster analysis attests a folding constraint. The sum of these patterns constitutes a coding profile that enables to accurately classify about 99% of individual archaea sequences between CDS and randomized CDS. In S. cerevisiae, whose coding profile shares similarities with Archeae of close GC content, phylostratigraphic methods allowed to investigate the coding profile of CDS based on their relative age. This analysis reveals that contrary to other genes, the youngest genes (only found in S. cerevisiae) as a whole do not have a strong coding profile. This can be explained by their relative shortness in comparison with other genes. But even when taking length into account, a clear enrichment of misclassified sequences appears in the youngest S. cerevisiae genes. This enrichment may reflect an insufficient proteic optimization operated by selection. | es |
dc.format | application/pdf | es |
dc.format.extent | 18 p. | es |
dc.language.iso | eng | es |
dc.rights | Tots els drets reservats | es |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/deed.ca | es |
dc.subject.other | Nucleòtids | es |
dc.subject.other | Aminoàcids | es |
dc.subject.other | Saccharomyces cerevisiae | es |
dc.subject.other | Proteïnes -- Investigació | es |
dc.subject.other | Regió codificant | es |
dc.title | Codon frequency is modulated by proteic selection, resulting in a coding profile in Archaea and Yeast | es |
dc.type | info:eu-repo/semantics/masterThesis | es |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | es |