dc.contributor | Universitat de Vic - Universitat Central de Catalunya. Facultat de Ciències i Tecnologia | |
dc.contributor | Universitat de Vic - Universitat Central de Catalunya. Màster Universitari en Anàlisi de Dades Òmiques | |
dc.contributor.author | Roginski, Paul Luc Maxime | |
dc.date.accessioned | 2021-12-22T08:54:51Z | |
dc.date.available | 2021-12-22T08:54:51Z | |
dc.date.created | 2021-08 | |
dc.date.issued | 2021-08 | |
dc.identifier.uri | http://hdl.handle.net/10854/6877 | |
dc.description | Curs 2020-2021 | es |
dc.description.abstract | Codons as fragments of the genetic code articulate both nucleotidic and proteic constraints.
If codon usage bias is now admitted to be mainly influenced by GC content, codon
frequencies in general may display a more subtle compromise between base composition and
selection at proteic level. In order to investigate the existing non-GC content factors of codon
frequencies, we compared coding sequences (CDS) of 280 Archaea plus S. cerevisiae genomes
to their randomized version (same base-composition and same length). Through dedicated
counts we identified several CDS vs random patterns in Archaea some of which reflecting
probable or evident proteic constraint : in particular, the systematic enrichment of CDS in
negatively charged amino acids, and the strong constraint existing on codons having a T in
second position, which, on the basis of hydrophobic cluster analysis attests a folding
constraint. The sum of these patterns constitutes a coding profile that enables to accurately
classify about 99% of individual archaea sequences between CDS and randomized CDS. In
S. cerevisiae, whose coding profile shares similarities with Archeae of close GC content,
phylostratigraphic methods allowed to investigate the coding profile of CDS based on their
relative age. This analysis reveals that contrary to other genes, the youngest genes (only
found in S. cerevisiae) as a whole do not have a strong coding profile. This can be explained
by their relative shortness in comparison with other genes. But even when taking length
into account, a clear enrichment of misclassified sequences appears in the youngest S.
cerevisiae genes. This enrichment may reflect an insufficient proteic optimization operated
by selection. | es |
dc.format | application/pdf | es |
dc.format.extent | 18 p. | es |
dc.language.iso | eng | es |
dc.rights | Tots els drets reservats | es |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/deed.ca | es |
dc.subject.other | Nucleòtids | es |
dc.subject.other | Aminoàcids | es |
dc.subject.other | Saccharomyces cerevisiae | es |
dc.subject.other | Proteïnes -- Investigació | es |
dc.subject.other | Regió codificant | es |
dc.title | Codon frequency is modulated by proteic selection, resulting in a coding profile in Archaea and Yeast | es |
dc.type | info:eu-repo/semantics/masterThesis | es |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | es |