Subtopic Deep Dive

Polygenic Risk Scores
Research Guide

What is Polygenic Risk Scores?

Polygenic Risk Scores (PRS) aggregate effects of many genetic variants from GWAS summary statistics to predict individual disease risk.

PRS construction typically involves weighting SNPs by GWAS effect sizes and summing weighted allele counts. Validation assesses predictive performance across ancestries and traits. Over 100 papers since 2013 explore PRS, with foundational work by Dudbridge (2013, 1608 citations) on power and accuracy.

15
Curated Papers
3
Key Challenges

Why It Matters

PRS stratify disease risk for precision medicine, identifying high-risk individuals equivalent to monogenic mutation carriers (Khera et al., 2018, Nature Genetics, 2878 citations). They support population screening for schizophrenia (Ripke et al., 2014, 7954 citations), depression (Wray et al., 2018, 3174 citations), and Alzheimer's (Jansen et al., 2019, 2403 citations). Clinical utility aids risk assessment in diverse ancestries, enhancing epidemiological studies (Manolio, 2010, 1512 citations).

Key Research Challenges

Ancestry Transferability

PRS trained in European ancestries underperform in non-European groups due to linkage disequilibrium differences. FinnGen highlights isolated population insights but limits generalizability (Kurki et al., 2023, 3679 citations). Improving cross-ancestry PRS requires diverse GWAS.

Non-linear Modeling

Standard linear PRS miss gene-environment interactions and rare variant effects. Dudbridge (2013) notes predictive accuracy limits from linear assumptions (1608 citations). Advanced models incorporating non-linearity boost utility.

Clinical Utility Validation

Translating PRS to actionable risk needs longitudinal trials beyond GWAS validation. Khera et al. (2018) show high-risk equivalence but clinical endpoints remain sparse (2878 citations). Regulatory approval demands robust evidence.

Essential Papers

1.

Biological insights from 108 schizophrenia-associated genetic loci

Stephan Ripke, Benjamin M. Neale, Aiden Corvin et al. · 2014 · Nature · 8.0K citations

2.

Reading Mendelian randomisation studies: a guide, glossary, and checklist for clinicians

Neil M Davies, Michael V. Holmes, George Davey Smith · 2018 · BMJ · 4.5K citations

Mendelian randomisation uses genetic variation as a natural experiment to investigate the causal relations between potentially modifiable risk factors and health outcomes in observational data. As ...

3.

FinnGen provides genetic insights from a well-phenotyped isolated population

Mitja Kurki, Juha Karjalainen, Priit Palta et al. · 2023 · Nature · 3.7K citations

4.

Strengthening the Reporting of Observational Studies in Epidemiology Using Mendelian Randomization

Veronika Skrivankova, Rebecca C. Richmond, Benjamin Woolf et al. · 2021 · JAMA · 3.6K citations

STROBE-MR provides guidelines for reporting MR studies. Improved reporting of these studies could facilitate their evaluation by editors, peer reviewers, researchers, clinicians, and other readers,...

5.

Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression

Naomi R. Wray, Stephan Ripke, Manuel Mattheisen et al. · 2018 · Nature Genetics · 3.2K citations

6.

Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations

Amit V. Khera, Mark Chaffin, Krishna G. Aragam et al. · 2018 · Nature Genetics · 2.9K citations

7.

Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals

James J. Lee, Robbee Wedow, Aysu Okbay et al. · 2018 · Nature Genetics · 2.8K citations

Reading Guide

Foundational Papers

Start with Dudbridge (2013, 1608 citations) for PRS power theory, Ripke et al. (2014, 7954 citations) for schizophrenia application, and Manolio (2010, 1512 citations) for GWAS context.

Recent Advances

Study Khera et al. (2018, 2878 citations) for clinical utility, Jansen et al. (2019, 2403 citations) for Alzheimer's PRS, and Kurki et al. (2023, 3679 citations) for population-specific insights.

Core Methods

Core techniques: SNP pruning/clumping (Dudbridge, 2013), meta-GWAS aggregation (Ripke et al., 2014), and risk equivalence testing (Khera et al., 2018).

How PapersFlow Helps You Research Polygenic Risk Scores

Discover & Search

Research Agent uses searchPapers for 'polygenic risk scores schizophrenia' to retrieve Ripke et al. (2014), then citationGraph maps 7954 citing papers, and findSimilarPapers uncovers Khera et al. (2018) for clinical parallels.

Analyze & Verify

Analysis Agent applies readPaperContent on Dudbridge (2013) to extract PRS power formulas, verifyResponse with CoVe checks ancestry bias claims against FinnGen data (Kurki et al., 2023), and runPythonAnalysis simulates PRS accuracy via NumPy-weighted sums with GRADE grading for evidence strength.

Synthesize & Write

Synthesis Agent detects gaps in non-European PRS via contradiction flagging across Wray et al. (2018) and Jansen et al. (2019); Writing Agent uses latexEditText for PRS equation revisions, latexSyncCitations for Ripke et al. (2014), and latexCompile for publication-ready reports with exportMermaid for variant effect diagrams.

Use Cases

"Compute PRS predictive power from Dudbridge formulas using schizophrenia GWAS data"

Research Agent → searchPapers(Dudbridge 2013) → Analysis Agent → readPaperContent → runPythonAnalysis(pandas GWAS simulation, matplotlib R2 plot) → researcher gets accuracy curves and p-value stats.

"Draft LaTeX review on PRS for Alzheimer's with citations from Jansen 2019"

Research Agent → citationGraph(Jansen 2019) → Synthesis Agent → gap detection → Writing Agent → latexEditText(intro), latexSyncCitations(Ripke 2014), latexCompile → researcher gets compiled PDF manuscript.

"Find GitHub repos implementing PRS methods from Khera 2018"

Research Agent → searchPapers(Khera 2018) → Code Discovery → paperExtractUrls → paperFindGithubRepo → githubRepoInspect → researcher gets repo code, PRS scripts, and validation notebooks.

Automated Workflows

Deep Research workflow scans 50+ PRS papers via searchPapers on 'polygenic risk scores transferability', structures report with GRADE-graded sections on Ripke (2014) and Khera (2018). DeepScan applies 7-step CoVe chain to verify Dudbridge (2013) power claims against FinnGen (2023). Theorizer generates hypotheses on non-linear PRS from Wray (2018) depression loci.

Frequently Asked Questions

What defines a Polygenic Risk Score?

PRS sums weighted effects of thousands of GWAS-identified SNPs to estimate individual trait risk (Dudbridge, 2013).

What are core PRS construction methods?

Methods include LD clumping + p-value thresholding or Bayesian approaches like SBayesR; validated in Khera et al. (2018).

What are key PRS papers?

Ripke et al. (2014, 7954 citations) for schizophrenia loci; Khera et al. (2018, 2878 citations) for clinical equivalence; Dudbridge (2013, 1608 citations) for power assessment.

What open problems exist in PRS?

Challenges include ancestry bias, non-linear effects, and clinical translation; addressed in Kurki et al. (2023) and Jansen et al. (2019).

Research Genetic Associations and Epidemiology with AI

PapersFlow provides specialized AI tools for Biochemistry, Genetics and Molecular Biology researchers. Here are the most relevant for this topic:

See how researchers in Life Sciences use PapersFlow

Field-specific workflows, example queries, and use cases.

Life Sciences Guide

Start Researching Polygenic Risk Scores with AI

Search 474M+ papers, run AI-powered literature reviews, and write with integrated citations — all in one workspace.

See how PapersFlow works for Biochemistry, Genetics and Molecular Biology researchers