Subtopic Deep Dive

Breast Cancer Gene Expression Profiling
Research Guide

What is Breast Cancer Gene Expression Profiling?

Breast Cancer Gene Expression Profiling uses microarray and RNA-seq data to classify tumor subtypes and predict clinical outcomes in breast cancer patients.

This subtopic analyzes gene expression patterns from cDNA microarrays and RNA-seq to identify subtypes like luminal A, luminal B, HER2-enriched, and basal-like (Sørlie et al., 2001; 10,815 citations). Key studies developed prognostic signatures such as the 70-gene signature (van ‘t Veer et al., 2002; 9,540 citations) and Oncotype DX recurrence score (Paik et al., 2004; 6,240 citations). Over 50 high-impact papers since 2001 establish molecular classifications linked to treatment response.

15
Curated Papers
3
Key Challenges

Why It Matters

Gene expression profiling enables precision medicine by stratifying patients into subtypes with targeted therapies, such as endocrine treatment for luminal cancers (Sørlie et al., 2001). Oncotype DX predicts recurrence risk in node-negative, ER-positive cases, guiding chemotherapy decisions and reducing overtreatment (Paik et al., 2004). Subtype signatures correlate with survival, influencing guidelines like St Gallen consensus (Goldhirsch et al., 2011). Tools like Győrffy’s survival analyzer assess 22,277 genes across 1,809 patients (Győrffy et al., 2009).

Key Research Challenges

Subtype Classification Consistency

Replicating breast tumor subtypes across independent datasets remains challenging due to technical variations in microarrays (Sørlie et al., 2003). Studies refined hierarchical clustering but noted batch effects (5,376 citations). Validation requires large cohorts to confirm clinical relevance.

Prognostic Signature Robustness

Signatures like the 70-gene profile outperform clinical predictors but face overfitting risks in young patients (van de Vijver et al., 2002; 6,482 citations). External validation is essential yet limited by cohort diversity. Integration with IHC markers adds complexity (Hammond et al., 2010).

Triple-Negative Profiling Gaps

Triple-negative breast cancer lacks targeted therapies, with gene expression revealing aggressive patterns but poor recurrence prediction (Dent et al., 2007; 4,976 citations). Microarray data struggles with heterogeneity. Linking profiles to novel vulnerabilities requires multi-omics integration.

Essential Papers

1.

Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications

Thérese Sørlie, Charles M. Perou, Robert Tibshirani et al. · 2001 · Proceedings of the National Academy of Sciences · 10.8K citations

The purpose of this study was to classify breast carcinomas based on variations in gene expression patterns derived from cDNA microarrays and to correlate tumor characteristics to clinical outcome....

2.

Gene expression profiling predicts clinical outcome of breast cancer

Laura van ‘t Veer, Hongyue Dai, Marc J. van de Vijver et al. · 2002 · Nature · 9.5K citations

3.

A Gene-Expression Signature as a Predictor of Survival in Breast Cancer

Marc J. van de Vijver, Yudong D. He, Laura van ‘t Veer et al. · 2002 · New England Journal of Medicine · 6.5K citations

The gene-expression profile we studied is a more powerful predictor of the outcome of disease in young patients with breast cancer than standard systems based on clinical and histologic criteria.

4.

A Multigene Assay to Predict Recurrence of Tamoxifen-Treated, Node-Negative Breast Cancer

Soonmyung Paik, Steven Shak, Gong Tang et al. · 2004 · New England Journal of Medicine · 6.2K citations

The recurrence score has been validated as quantifying the likelihood of distant recurrence in tamoxifen-treated patients with node-negative, estrogen-receptor-positive breast cancer.

5.

Repeated observation of breast tumor subtypes in independent gene expression data sets

Thérese Sørlie, Robert Tibshirani, Joel S. Parker et al. · 2003 · Proceedings of the National Academy of Sciences · 5.4K citations

Characteristic patterns of gene expression measured by DNA microarrays have been used to classify tumors into clinically relevant subgroups. In this study, we have refined the previously defined su...

6.

Triple-Negative Breast Cancer: Clinical Features and Patterns of Recurrence

Rebecca Dent, Maureen Trudeau, Kathleen I. Pritchard et al. · 2007 · Clinical Cancer Research · 5.0K citations

Abstract Purpose: To compare the clinical features, natural history, and outcomes for women with “triple-negative” breast cancer with women with other types of breast cancer. Experimental Design: W...

7.

Breast Cancer Treatment

Adrienne G. Waks, Eric P. Winer · 2019 · JAMA · 4.6K citations

Breast cancer consists of 3 major tumor subtypes categorized according to estrogen or progesterone receptor expression and ERBB2 gene amplification. The 3 subtypes have distinct risk profiles and t...

Reading Guide

Foundational Papers

Start with Sørlie et al. (2001) for subtype discovery via microarrays, then van ‘t Veer (2002) and van de Vijver (2002) for 70-gene prognosis, Paik (2004) for Oncotype DX validation.

Recent Advances

Győrffy et al. (2009) for survival analysis tools; Waks and Winer (2019) for subtype treatment strategies; Goldhirsch et al. (2011) for consensus guidelines.

Core Methods

Hierarchical clustering and PAM for subtypes (Sørlie et al., 2001, 2003); supervised learning for signatures (van ‘t Veer et al., 2002); recurrence score from RT-PCR (Paik et al., 2004).

How PapersFlow Helps You Research Breast Cancer Gene Expression Profiling

Discover & Search

Research Agent uses searchPapers and citationGraph to map foundational works like Sørlie et al. (2001) with 10,815 citations, revealing subtype classification clusters. exaSearch finds RNA-seq extensions, while findSimilarPapers links van ‘t Veer (2002) to Paik (2004) for prognostic signatures.

Analyze & Verify

Analysis Agent applies readPaperContent to extract microarray methods from Sørlie (2001), then runPythonAnalysis with pandas/NumPy to reanalyze expression data for subtype clustering. verifyResponse (CoVe) and GRADE grading confirm signature performance against van de Vijver (2002), with statistical verification of survival predictions.

Synthesize & Write

Synthesis Agent detects gaps in triple-negative profiling (Dent et al., 2007) and flags contradictions between signatures. Writing Agent uses latexEditText, latexSyncCitations for Oncotype DX reviews, and latexCompile for subtype diagrams via exportMermaid.

Use Cases

"Reanalyze gene expression data from Sørlie 2001 for modern subtype validation"

Research Agent → searchPapers('Sørlie 2001') → Analysis Agent → readPaperContent + runPythonAnalysis (pandas clustering on GEO data) → hierarchical cluster plot and p-values.

"Draft a review on Oncotype DX vs 70-gene signature with citations"

Synthesis Agent → gap detection (Paik 2004, van ‘t Veer 2002) → Writing Agent → latexEditText + latexSyncCitations + latexCompile → LaTeX PDF with formatted comparison table.

"Find GitHub repos with breast cancer RNA-seq analysis code"

Research Agent → citationGraph(Paik 2004) → Code Discovery → paperExtractUrls → paperFindGithubRepo → githubRepoInspect → R scripts for recurrence score computation.

Automated Workflows

Deep Research workflow conducts systematic reviews of 50+ profiling papers, chaining searchPapers → citationGraph → DeepScan for 7-step verification of subtype consensus (Sørlie 2001-2003). Theorizer generates hypotheses on luminal-basal transitions from expression patterns. DeepScan applies CoVe checkpoints to validate Oncotype DX against van ‘t Veer signatures.

Frequently Asked Questions

What is breast cancer gene expression profiling?

It classifies tumors using microarray/RNA-seq data into subtypes like luminal and basal-like, correlating patterns to outcomes (Sørlie et al., 2001).

What are key methods?

cDNA microarrays with hierarchical clustering (Sørlie et al., 2001), supervised signatures like 70-gene (van ‘t Veer et al., 2002), and multigene assays (Paik et al., 2004).

What are seminal papers?

Sørlie et al. (2001; 10,815 citations) defined subtypes; van ‘t Veer (2002; 9,540 citations) and Paik (2004; 6,240 citations) developed prognostic tools.

What open problems exist?

Robustness across datasets, triple-negative predictions, and multi-omics integration challenge current signatures (Dent et al., 2007; Sørlie et al., 2003).

Research Breast Cancer Treatment Studies with AI

PapersFlow provides specialized AI tools for your field researchers. Here are the most relevant for this topic:

Start Researching Breast Cancer Gene Expression Profiling with AI

Search 474M+ papers, run AI-powered literature reviews, and write with integrated citations — all in one workspace.