Polygenic Risk Score in complex diseases
Author
Other authors
Publication date
2018-09Abstract
Motivation: Plenty genome-wide datasets are produced from complex diseases by
traditional GWAS studies, but they are limited. A new approach has emerged in the
last decade, the Polygenic Risk Scores (PRS), to combine several SNP into a single
predictor to try to explain the complex genetic behind diseases like Asthma or Autism
Spectrum Disorders.
Results: Here we analyse genome-wide data from these two diseases a compute PRS
with three different approaches, PLINK’s method, a machine learning approach
(biglasso) and a targeted-based method using SFARI database. We find that this kind of
analysis are quite complex like the diseases they try to predict, and PRS only explain a
very low percentage of the variance of the disease. The validation analysis we
performed show us that the parameters used to compute the PRS have to be optimize
using bigger datasets. We also used a machine learning approach (XGBoost) to impute
the data in certain analysis.
Document Type
Master's final project
Document version
Supervisor/a: Juan R González
Director/a: M. Luz Calle
Language
English
Keywords
Genomes
Malalties congènites
Pages
18 p.
Note
Curs 2017-2018
This item appears in the following Collection(s)
Rights
Aquest document està subjecte a aquesta llicència Creative Commons
Except where otherwise noted, this item's license is described as http://creativecommons.org/licenses/by-nc-nd/3.0/es/