Subtopic Deep Dive
Lineage Tracing
Research Guide
What is Lineage Tracing?
Lineage tracing in single-cell and spatial transcriptomics reconstructs cell developmental trajectories using computational inference from transcriptomic data.
This subtopic employs tools like Slingshot for pseudotime and lineage inference (Street et al., 2018, 3077 citations). Scanpy provides scalable analysis pipelines supporting trajectory modeling (Wolf et al., 2018, 8472 citations). Current best practices emphasize robust pseudotime estimation amid technical noise (Luecken and Theis, 2019, 2145 citations).
Why It Matters
Lineage tracing reveals cell fate decisions in embryogenesis and cancer progression. Slingshot enables branching trajectory reconstruction from single-cell data, applied to neural development (Street et al., 2018). Scanpy workflows map dynamic gene expression along lineages in tissue regeneration studies (Wolf et al., 2018). Spatial methods like those in Satija et al. (2015) integrate positional data to validate inferred paths, impacting drug target discovery in heterogeneous tumors.
Key Research Challenges
Technical noise in trajectories
Dropout events distort pseudotime estimates in sparse scRNA-seq data (Luecken and Theis, 2019). Methods like Slingshot mitigate via minimum spanning trees but struggle with multi-branching (Street et al., 2018). Spatial data adds alignment errors (Satija et al., 2015).
Branching fate inference
Distinguishing convergence from branching requires advanced graph models (Street et al., 2018). Scanpy's diffusion pseudotime handles continuous states but needs validation (Wolf et al., 2018). Multimodal integration improves accuracy (Hao et al., 2023).
Scalability to large datasets
High-dimensional data overwhelms memory in tools like deepTools2 (Ramírez et al., 2016). Scanpy optimizes via approximate nearest neighbors (Wolf et al., 2018). Best practices recommend subsampling for initial exploration (Luecken and Theis, 2019).
Essential Papers
SCANPY: large-scale single-cell gene expression data analysis
F. Alexander Wolf, Philipp Angerer, Fabian J. Theis · 2018 · Genome biology · 8.5K citations
deepTools2: a next generation web server for deep-sequencing data analysis
Fidel Ramírez, Devon Ryan, Björn Grüning et al. · 2016 · Nucleic Acids Research · 8.4K citations
We present an update to our Galaxy-based web server for processing and visualizing deeply sequenced data. Its core tool set, deepTools, allows users to perform complete bioinformatic workflows rang...
Massively parallel digital transcriptional profiling of single cells
Grace Zheng, Jessica M. Terry, Phillip Belgrader et al. · 2017 · Nature Communications · 7.3K citations
Spatial reconstruction of single-cell gene expression data
Rahul Satija, Jeffrey A. Farrell, David Gennert et al. · 2015 · Nature Biotechnology · 7.2K citations
SCENIC: single-cell regulatory network inference and clustering
Sara Aibar, Carmen Bravo González‐Blas, Thomas Moerman et al. · 2017 · Nature Methods · 6.3K citations
Estimating the population abundance of tissue-infiltrating immune and stromal cell populations using gene expression
Étienne Becht, Nicolás A. Giraldo, Laetitia Lacroix et al. · 2016 · Genome biology · 3.8K citations
Dictionary learning for integrative, multimodal and scalable single-cell analysis
Yuhan Hao, Tim Stuart, Madeline H. Kowalski et al. · 2023 · Nature Biotechnology · 3.7K citations
Reading Guide
Foundational Papers
Start with Scanpy (Wolf et al., 2018) for scalable pipelines; Slingshot (Street et al., 2018) for lineage specifics; Hawrylycz et al. (2012) for early spatial transcriptomics context.
Recent Advances
Luecken and Theis (2019) for best practices; Hao et al. (2023) for multimodal advances; study alongside Scanpy implementations.
Core Methods
Pseudotime via diffusion maps (Scanpy); graph-based lineages (Slingshot); spatial reconstruction (Satija et al., 2015); regulatory networks (SCENIC, Aibar et al., 2017).
How PapersFlow Helps You Research Lineage Tracing
Discover & Search
Research Agent uses searchPapers('lineage tracing single-cell pseudotime') to retrieve Slingshot (Street et al., 2018), then citationGraph reveals downstream citations like Luecken and Theis (2019), and findSimilarPapers expands to spatial trajectory papers.
Analyze & Verify
Analysis Agent runs readPaperContent on Street et al. (2018) to extract Slingshot pseudocode, verifies trajectory math via verifyResponse (CoVe), and executes runPythonAnalysis with Scanpy code from Wolf et al. (2018) for GRADE-graded pseudotime validation on user datasets.
Synthesize & Write
Synthesis Agent detects gaps in branching inference across Slingshot and Scanpy papers, flags contradictions in pseudotime definitions, while Writing Agent uses latexEditText on trajectory manuscripts, latexSyncCitations for Street et al. (2018), and exportMermaid for lineage graphs.
Use Cases
"Reproduce Slingshot pseudotime on my scRNA-seq dataset"
Research Agent → searchPapers('Slingshot lineage') → Analysis Agent → runPythonAnalysis(Scanpy + Slingshot code from Street et al. 2018) → matplotlib plot of branching trajectories.
"Write LaTeX review of lineage tracing methods"
Synthesis Agent → gap detection (Scanpy vs Slingshot) → Writing Agent → latexEditText(draft) → latexSyncCitations(Street 2018, Wolf 2018) → latexCompile → PDF with lineage diagram.
"Find GitHub code for single-cell trajectory analysis"
Research Agent → paperExtractUrls(Street 2018) → Code Discovery → paperFindGithubRepo → githubRepoInspect(Scanpy repo) → verified Slingshot implementation.
Automated Workflows
Deep Research workflow scans 50+ papers via searchPapers('lineage tracing scRNA-seq'), structures report on Slingshot-Scanpy evolution (Street et al. 2018; Wolf et al. 2018). DeepScan applies 7-step analysis: readPaperContent → runPythonAnalysis(pseudotime) → CoVe verification on user data. Theorizer generates hypotheses on spatial lineage convergence using Satija et al. (2015).
Frequently Asked Questions
What defines lineage tracing in single-cell transcriptomics?
Lineage tracing reconstructs developmental paths via pseudotime and branching inference from scRNA-seq data, as in Slingshot (Street et al., 2018).
What are core methods for lineage tracing?
Slingshot uses minimum spanning trees for lineages (Street et al., 2018); Scanpy employs diffusion pseudotime (Wolf et al., 2018); Seurat integrates via dynamic mode decomposition.
What are key papers on single-cell lineage tracing?
Slingshot (Street et al., 2018, 3077 citations); Scanpy (Wolf et al., 2018, 8472 citations); best practices tutorial (Luecken and Theis, 2019, 2145 citations).
What open problems exist in lineage tracing?
Robust branching detection amid noise (Luecken and Theis, 2019); scalable spatial integration (Satija et al., 2015); multimodal data fusion (Hao et al., 2023).
Research Single-cell and spatial transcriptomics with AI
PapersFlow provides specialized AI tools for Biochemistry, Genetics and Molecular Biology researchers. Here are the most relevant for this topic:
AI Literature Review
Automate paper discovery and synthesis across 474M+ papers
Paper Summarizer
Get structured summaries of any paper in seconds
Deep Research Reports
Multi-source evidence synthesis with counter-evidence
See how researchers in Life Sciences use PapersFlow
Field-specific workflows, example queries, and use cases.
Start Researching Lineage Tracing with AI
Search 474M+ papers, run AI-powered literature reviews, and write with integrated citations — all in one workspace.
See how PapersFlow works for Biochemistry, Genetics and Molecular Biology researchers