Subtopic Deep Dive

Lineage Tracing
Research Guide

What is Lineage Tracing?

Lineage tracing in single-cell and spatial transcriptomics reconstructs cell developmental trajectories using computational inference from transcriptomic data.

This subtopic employs tools like Slingshot for pseudotime and lineage inference (Street et al., 2018, 3077 citations). Scanpy provides scalable analysis pipelines supporting trajectory modeling (Wolf et al., 2018, 8472 citations). Current best practices emphasize robust pseudotime estimation amid technical noise (Luecken and Theis, 2019, 2145 citations).

15
Curated Papers
3
Key Challenges

Why It Matters

Lineage tracing reveals cell fate decisions in embryogenesis and cancer progression. Slingshot enables branching trajectory reconstruction from single-cell data, applied to neural development (Street et al., 2018). Scanpy workflows map dynamic gene expression along lineages in tissue regeneration studies (Wolf et al., 2018). Spatial methods like those in Satija et al. (2015) integrate positional data to validate inferred paths, impacting drug target discovery in heterogeneous tumors.

Key Research Challenges

Technical noise in trajectories

Dropout events distort pseudotime estimates in sparse scRNA-seq data (Luecken and Theis, 2019). Methods like Slingshot mitigate via minimum spanning trees but struggle with multi-branching (Street et al., 2018). Spatial data adds alignment errors (Satija et al., 2015).

Branching fate inference

Distinguishing convergence from branching requires advanced graph models (Street et al., 2018). Scanpy's diffusion pseudotime handles continuous states but needs validation (Wolf et al., 2018). Multimodal integration improves accuracy (Hao et al., 2023).

Scalability to large datasets

High-dimensional data overwhelms memory in tools like deepTools2 (Ramírez et al., 2016). Scanpy optimizes via approximate nearest neighbors (Wolf et al., 2018). Best practices recommend subsampling for initial exploration (Luecken and Theis, 2019).

Essential Papers

1.

SCANPY: large-scale single-cell gene expression data analysis

F. Alexander Wolf, Philipp Angerer, Fabian J. Theis · 2018 · Genome biology · 8.5K citations

2.

deepTools2: a next generation web server for deep-sequencing data analysis

Fidel Ramírez, Devon Ryan, Björn Grüning et al. · 2016 · Nucleic Acids Research · 8.4K citations

We present an update to our Galaxy-based web server for processing and visualizing deeply sequenced data. Its core tool set, deepTools, allows users to perform complete bioinformatic workflows rang...

3.

Massively parallel digital transcriptional profiling of single cells

Grace Zheng, Jessica M. Terry, Phillip Belgrader et al. · 2017 · Nature Communications · 7.3K citations

4.

Spatial reconstruction of single-cell gene expression data

Rahul Satija, Jeffrey A. Farrell, David Gennert et al. · 2015 · Nature Biotechnology · 7.2K citations

5.

SCENIC: single-cell regulatory network inference and clustering

Sara Aibar, Carmen Bravo González‐Blas, Thomas Moerman et al. · 2017 · Nature Methods · 6.3K citations

6.

Estimating the population abundance of tissue-infiltrating immune and stromal cell populations using gene expression

Étienne Becht, Nicolás A. Giraldo, Laetitia Lacroix et al. · 2016 · Genome biology · 3.8K citations

7.

Dictionary learning for integrative, multimodal and scalable single-cell analysis

Yuhan Hao, Tim Stuart, Madeline H. Kowalski et al. · 2023 · Nature Biotechnology · 3.7K citations

Reading Guide

Foundational Papers

Start with Scanpy (Wolf et al., 2018) for scalable pipelines; Slingshot (Street et al., 2018) for lineage specifics; Hawrylycz et al. (2012) for early spatial transcriptomics context.

Recent Advances

Luecken and Theis (2019) for best practices; Hao et al. (2023) for multimodal advances; study alongside Scanpy implementations.

Core Methods

Pseudotime via diffusion maps (Scanpy); graph-based lineages (Slingshot); spatial reconstruction (Satija et al., 2015); regulatory networks (SCENIC, Aibar et al., 2017).

How PapersFlow Helps You Research Lineage Tracing

Discover & Search

Research Agent uses searchPapers('lineage tracing single-cell pseudotime') to retrieve Slingshot (Street et al., 2018), then citationGraph reveals downstream citations like Luecken and Theis (2019), and findSimilarPapers expands to spatial trajectory papers.

Analyze & Verify

Analysis Agent runs readPaperContent on Street et al. (2018) to extract Slingshot pseudocode, verifies trajectory math via verifyResponse (CoVe), and executes runPythonAnalysis with Scanpy code from Wolf et al. (2018) for GRADE-graded pseudotime validation on user datasets.

Synthesize & Write

Synthesis Agent detects gaps in branching inference across Slingshot and Scanpy papers, flags contradictions in pseudotime definitions, while Writing Agent uses latexEditText on trajectory manuscripts, latexSyncCitations for Street et al. (2018), and exportMermaid for lineage graphs.

Use Cases

"Reproduce Slingshot pseudotime on my scRNA-seq dataset"

Research Agent → searchPapers('Slingshot lineage') → Analysis Agent → runPythonAnalysis(Scanpy + Slingshot code from Street et al. 2018) → matplotlib plot of branching trajectories.

"Write LaTeX review of lineage tracing methods"

Synthesis Agent → gap detection (Scanpy vs Slingshot) → Writing Agent → latexEditText(draft) → latexSyncCitations(Street 2018, Wolf 2018) → latexCompile → PDF with lineage diagram.

"Find GitHub code for single-cell trajectory analysis"

Research Agent → paperExtractUrls(Street 2018) → Code Discovery → paperFindGithubRepo → githubRepoInspect(Scanpy repo) → verified Slingshot implementation.

Automated Workflows

Deep Research workflow scans 50+ papers via searchPapers('lineage tracing scRNA-seq'), structures report on Slingshot-Scanpy evolution (Street et al. 2018; Wolf et al. 2018). DeepScan applies 7-step analysis: readPaperContent → runPythonAnalysis(pseudotime) → CoVe verification on user data. Theorizer generates hypotheses on spatial lineage convergence using Satija et al. (2015).

Frequently Asked Questions

What defines lineage tracing in single-cell transcriptomics?

Lineage tracing reconstructs developmental paths via pseudotime and branching inference from scRNA-seq data, as in Slingshot (Street et al., 2018).

What are core methods for lineage tracing?

Slingshot uses minimum spanning trees for lineages (Street et al., 2018); Scanpy employs diffusion pseudotime (Wolf et al., 2018); Seurat integrates via dynamic mode decomposition.

What are key papers on single-cell lineage tracing?

Slingshot (Street et al., 2018, 3077 citations); Scanpy (Wolf et al., 2018, 8472 citations); best practices tutorial (Luecken and Theis, 2019, 2145 citations).

What open problems exist in lineage tracing?

Robust branching detection amid noise (Luecken and Theis, 2019); scalable spatial integration (Satija et al., 2015); multimodal data fusion (Hao et al., 2023).

Research Single-cell and spatial transcriptomics with AI

PapersFlow provides specialized AI tools for Biochemistry, Genetics and Molecular Biology researchers. Here are the most relevant for this topic:

See how researchers in Life Sciences use PapersFlow

Field-specific workflows, example queries, and use cases.

Life Sciences Guide

Start Researching Lineage Tracing with AI

Search 474M+ papers, run AI-powered literature reviews, and write with integrated citations — all in one workspace.

See how PapersFlow works for Biochemistry, Genetics and Molecular Biology researchers