Subtopic Deep Dive
Breast Cancer Gene Expression Profiling
Research Guide
What is Breast Cancer Gene Expression Profiling?
Breast Cancer Gene Expression Profiling uses microarray and RNA-seq data to classify tumor subtypes and predict clinical outcomes in breast cancer patients.
This subtopic analyzes gene expression patterns from cDNA microarrays and RNA-seq to identify subtypes like luminal A, luminal B, HER2-enriched, and basal-like (Sørlie et al., 2001; 10,815 citations). Key studies developed prognostic signatures such as the 70-gene signature (van ‘t Veer et al., 2002; 9,540 citations) and Oncotype DX recurrence score (Paik et al., 2004; 6,240 citations). Over 50 high-impact papers since 2001 establish molecular classifications linked to treatment response.
Why It Matters
Gene expression profiling enables precision medicine by stratifying patients into subtypes with targeted therapies, such as endocrine treatment for luminal cancers (Sørlie et al., 2001). Oncotype DX predicts recurrence risk in node-negative, ER-positive cases, guiding chemotherapy decisions and reducing overtreatment (Paik et al., 2004). Subtype signatures correlate with survival, influencing guidelines like St Gallen consensus (Goldhirsch et al., 2011). Tools like Győrffy’s survival analyzer assess 22,277 genes across 1,809 patients (Győrffy et al., 2009).
Key Research Challenges
Subtype Classification Consistency
Replicating breast tumor subtypes across independent datasets remains challenging due to technical variations in microarrays (Sørlie et al., 2003). Studies refined hierarchical clustering but noted batch effects (5,376 citations). Validation requires large cohorts to confirm clinical relevance.
Prognostic Signature Robustness
Signatures like the 70-gene profile outperform clinical predictors but face overfitting risks in young patients (van de Vijver et al., 2002; 6,482 citations). External validation is essential yet limited by cohort diversity. Integration with IHC markers adds complexity (Hammond et al., 2010).
Triple-Negative Profiling Gaps
Triple-negative breast cancer lacks targeted therapies, with gene expression revealing aggressive patterns but poor recurrence prediction (Dent et al., 2007; 4,976 citations). Microarray data struggles with heterogeneity. Linking profiles to novel vulnerabilities requires multi-omics integration.
Essential Papers
Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications
Thérese Sørlie, Charles M. Perou, Robert Tibshirani et al. · 2001 · Proceedings of the National Academy of Sciences · 10.8K citations
The purpose of this study was to classify breast carcinomas based on variations in gene expression patterns derived from cDNA microarrays and to correlate tumor characteristics to clinical outcome....
Gene expression profiling predicts clinical outcome of breast cancer
Laura van ‘t Veer, Hongyue Dai, Marc J. van de Vijver et al. · 2002 · Nature · 9.5K citations
A Gene-Expression Signature as a Predictor of Survival in Breast Cancer
Marc J. van de Vijver, Yudong D. He, Laura van ‘t Veer et al. · 2002 · New England Journal of Medicine · 6.5K citations
The gene-expression profile we studied is a more powerful predictor of the outcome of disease in young patients with breast cancer than standard systems based on clinical and histologic criteria.
A Multigene Assay to Predict Recurrence of Tamoxifen-Treated, Node-Negative Breast Cancer
Soonmyung Paik, Steven Shak, Gong Tang et al. · 2004 · New England Journal of Medicine · 6.2K citations
The recurrence score has been validated as quantifying the likelihood of distant recurrence in tamoxifen-treated patients with node-negative, estrogen-receptor-positive breast cancer.
Repeated observation of breast tumor subtypes in independent gene expression data sets
Thérese Sørlie, Robert Tibshirani, Joel S. Parker et al. · 2003 · Proceedings of the National Academy of Sciences · 5.4K citations
Characteristic patterns of gene expression measured by DNA microarrays have been used to classify tumors into clinically relevant subgroups. In this study, we have refined the previously defined su...
Triple-Negative Breast Cancer: Clinical Features and Patterns of Recurrence
Rebecca Dent, Maureen Trudeau, Kathleen I. Pritchard et al. · 2007 · Clinical Cancer Research · 5.0K citations
Abstract Purpose: To compare the clinical features, natural history, and outcomes for women with “triple-negative” breast cancer with women with other types of breast cancer. Experimental Design: W...
Breast Cancer Treatment
Adrienne G. Waks, Eric P. Winer · 2019 · JAMA · 4.6K citations
Breast cancer consists of 3 major tumor subtypes categorized according to estrogen or progesterone receptor expression and ERBB2 gene amplification. The 3 subtypes have distinct risk profiles and t...
Reading Guide
Foundational Papers
Start with Sørlie et al. (2001) for subtype discovery via microarrays, then van ‘t Veer (2002) and van de Vijver (2002) for 70-gene prognosis, Paik (2004) for Oncotype DX validation.
Recent Advances
Győrffy et al. (2009) for survival analysis tools; Waks and Winer (2019) for subtype treatment strategies; Goldhirsch et al. (2011) for consensus guidelines.
Core Methods
Hierarchical clustering and PAM for subtypes (Sørlie et al., 2001, 2003); supervised learning for signatures (van ‘t Veer et al., 2002); recurrence score from RT-PCR (Paik et al., 2004).
How PapersFlow Helps You Research Breast Cancer Gene Expression Profiling
Discover & Search
Research Agent uses searchPapers and citationGraph to map foundational works like Sørlie et al. (2001) with 10,815 citations, revealing subtype classification clusters. exaSearch finds RNA-seq extensions, while findSimilarPapers links van ‘t Veer (2002) to Paik (2004) for prognostic signatures.
Analyze & Verify
Analysis Agent applies readPaperContent to extract microarray methods from Sørlie (2001), then runPythonAnalysis with pandas/NumPy to reanalyze expression data for subtype clustering. verifyResponse (CoVe) and GRADE grading confirm signature performance against van de Vijver (2002), with statistical verification of survival predictions.
Synthesize & Write
Synthesis Agent detects gaps in triple-negative profiling (Dent et al., 2007) and flags contradictions between signatures. Writing Agent uses latexEditText, latexSyncCitations for Oncotype DX reviews, and latexCompile for subtype diagrams via exportMermaid.
Use Cases
"Reanalyze gene expression data from Sørlie 2001 for modern subtype validation"
Research Agent → searchPapers('Sørlie 2001') → Analysis Agent → readPaperContent + runPythonAnalysis (pandas clustering on GEO data) → hierarchical cluster plot and p-values.
"Draft a review on Oncotype DX vs 70-gene signature with citations"
Synthesis Agent → gap detection (Paik 2004, van ‘t Veer 2002) → Writing Agent → latexEditText + latexSyncCitations + latexCompile → LaTeX PDF with formatted comparison table.
"Find GitHub repos with breast cancer RNA-seq analysis code"
Research Agent → citationGraph(Paik 2004) → Code Discovery → paperExtractUrls → paperFindGithubRepo → githubRepoInspect → R scripts for recurrence score computation.
Automated Workflows
Deep Research workflow conducts systematic reviews of 50+ profiling papers, chaining searchPapers → citationGraph → DeepScan for 7-step verification of subtype consensus (Sørlie 2001-2003). Theorizer generates hypotheses on luminal-basal transitions from expression patterns. DeepScan applies CoVe checkpoints to validate Oncotype DX against van ‘t Veer signatures.
Frequently Asked Questions
What is breast cancer gene expression profiling?
It classifies tumors using microarray/RNA-seq data into subtypes like luminal and basal-like, correlating patterns to outcomes (Sørlie et al., 2001).
What are key methods?
cDNA microarrays with hierarchical clustering (Sørlie et al., 2001), supervised signatures like 70-gene (van ‘t Veer et al., 2002), and multigene assays (Paik et al., 2004).
What are seminal papers?
Sørlie et al. (2001; 10,815 citations) defined subtypes; van ‘t Veer (2002; 9,540 citations) and Paik (2004; 6,240 citations) developed prognostic tools.
What open problems exist?
Robustness across datasets, triple-negative predictions, and multi-omics integration challenge current signatures (Dent et al., 2007; Sørlie et al., 2003).
Research Breast Cancer Treatment Studies with AI
PapersFlow provides specialized AI tools for your field researchers. Here are the most relevant for this topic:
AI Literature Review
Automate paper discovery and synthesis across 474M+ papers
Deep Research Reports
Multi-source evidence synthesis with counter-evidence
Paper Summarizer
Get structured summaries of any paper in seconds
AI Academic Writing
Write research papers with AI assistance and LaTeX support
Start Researching Breast Cancer Gene Expression Profiling with AI
Search 474M+ papers, run AI-powered literature reviews, and write with integrated citations — all in one workspace.
Part of the Breast Cancer Treatment Studies Research Guide