Subtopic Deep Dive
Sorting by Reversals
Research Guide
What is Sorting by Reversals?
Sorting by reversals computes the minimum number of inversion operations needed to transform one genome permutation into the identity permutation, modeling evolutionary genomic inversions.
Hannenhalli and Pevzner introduced a polynomial-time algorithm for signed permutations in 1999 (642 citations). Unsigned permutations remain NP-hard, driving approximation algorithm research. Over 10 key papers since 1999 address exact solutions, heuristics, and extensions to multi-chromosome genomes.
Why It Matters
Sorting by reversals measures genomic inversion distance for reconstructing mammalian evolutionary histories, as shown in human-mouse comparisons (Pevzner and Tesler, 2002, 381 citations). It enables inference of rearrangement scenarios in bacterial assemblies (Kolmogorov et al., 2014, 212 citations) and plant genome evolution (Perumal et al., 2020, 160 citations). Tools like CREx apply reversal sorting to detect events across unichromosomal genomes (Bernt et al., 2007, 386 citations).
Key Research Challenges
Unsigned Permutation NP-hardness
Computing minimum reversals for unsigned permutations lacks polynomial algorithms, unlike signed cases (Hannenhalli and Pevzner, 1999). Approximation ratios exceed 1.375 for general instances. Heuristics like those in CREx provide practical solutions but sacrifice optimality (Bernt et al., 2007).
Multi-operation Rearrangement Distance
Extending reversal sorting to include translocations and block interchanges increases complexity (Yancopoulos et al., 2005, 493 citations). Exact algorithms handle special cases but fail on general genomes. Balancing computational efficiency with biological realism remains open.
Breakpoint Reuse in Evolution
Mammalian genomes show repeated breakpoint reuse, complicating reversal distance models (Bourque et al., 2004, 239 citations). This violates assumptions in Hannenhalli-Pevzner theory. Integrating microrearrangements into sorting algorithms requires new frameworks.
Essential Papers
Transforming cabbage into turnip
Sridhar Hannenhalli, Pavel A. Pevzner · 1999 · Journal of the ACM · 642 citations
Genomes frequently evolve by reversals ρ( i,j ) that transform a gene order π 1 … π i π i +1 … π j -1 π j … π n into π 1 … π i π j -1 … π i +1 π j … π n . Reversal distance between permutations π a...
Efficient sorting of genomic permutations by translocation, inversion and block interchange
Sophia Yancopoulos, Oliver Attie, R. Friedberg · 2005 · Computer applications in the biosciences · 493 citations
Finding genomic distance based on gene order is a classic problem in genome rearrangements. Efficient exact algorithms for genomic distances based on inversions and/or translocations have been foun...
CREx: inferring genomic rearrangements based on common intervals
Matthias Bernt, Daniel Merkle, Kai Ramsch et al. · 2007 · Bioinformatics · 386 citations
Abstract Summary: We present the web-based program CREx for heuristically determining pairwise rearrangement events in unichromosomal genomes. CREx considers transpositions, reverse transpositions,...
Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes
Pavel A. Pevzner, Glenn Tesler · 2002 · Genome Research · 381 citations
Although analysis of genome rearrangements was pioneered by Dobzhansky and Sturtevant 65 years ago, we still know very little about the rearrangement events that produced the existing varieties of ...
A new sequence distance measure for phylogenetic tree construction
Hasan H. Otu, Khalid Sayood · 2003 · Bioinformatics · 347 citations
Abstract Motivation: Most existing approaches for phylogenetic inference use multiple alignment of sequences and assume some sort of an evolutionary model. The multiple alignment strategy does not ...
Enredo and Pecan: Genome-wide mammalian consistency-based multiple alignment with paralogs
Benedict Paten, Javier Herrero, Kathryn Beal et al. · 2008 · Genome Research · 304 citations
Pairwise whole-genome alignment involves the creation of a homology map, capable of performing a near complete transformation of one genome into another. For multiple genomes this problem is genera...
Reconstructing the Genomic Architecture of Ancestral Mammals: Lessons From Human, Mouse, and Rat Genomes
Guillaume Bourque, Pavel A. Pevzner, Glenn Tesler · 2004 · Genome Research · 239 citations
Recent analysis of genome rearrangements in human and mouse genomes revealed evidence for more rearrangements than thought previously and shed light on previously unknown features of mammalian evol...
Reading Guide
Foundational Papers
Start with Hannenhalli-Pevzner (1999, 642 citations) for signed reversal theory and breakpoint graphs; follow with Yancopoulos (2005, 493 citations) for unsigned/multi-operation extensions; Pevzner-Tesler (2002, 381 citations) applies to human-mouse genomes.
Recent Advances
Kolmogorov (2014, 212 citations) uses reversals in bacterial assembly; Perumal (2020, 160 citations) reconstructs Brassica genomes via reversal distances.
Core Methods
Breakpoint graphs transform permutations to overlap graphs for cycle decomposition (Hannenhalli-Pevzner); oriented reversals simplify signed cases; common intervals detect events (CREx, Bernt 2007).
How PapersFlow Helps You Research Sorting by Reversals
Discover & Search
Research Agent uses searchPapers('sorting by reversals unsigned NP-hard') to retrieve Hannenhalli and Pevzner (1999), then citationGraph to map 642 citing papers, and findSimilarPapers to uncover Yancopoulos et al. (2005) extensions to multi-operations.
Analyze & Verify
Analysis Agent applies readPaperContent on Hannenhalli-Pevzner (1999) to extract reversal distance formulas, verifies complexity claims via verifyResponse (CoVe) against original proofs, and runs PythonAnalysis to simulate signed vs unsigned sorting on sample permutations with NumPy, graded by GRADE for algorithmic correctness.
Synthesize & Write
Synthesis Agent detects gaps in unsigned approximation algorithms via contradiction flagging across Pevzner-Tesler (2002) and Bourque et al. (2004), while Writing Agent uses latexEditText for reversal distance proofs, latexSyncCitations for 10+ papers, latexCompile for camera-ready sections, and exportMermaid for permutation transformation diagrams.
Use Cases
"Implement Hannenhalli-Pevzner signed reversal sorting in Python"
Research Agent → searchPapers → Analysis Agent → runPythonAnalysis (NumPy simulation of ρ(i,j) operations on 100-gene permutation) → researcher gets executable code verifying 642-citation algorithm on custom genomes.
"Write LaTeX review of reversal distance in mammalian evolution"
Synthesis Agent → gap detection (Pevzner-Tesler 2002) → Writing Agent → latexEditText (add proofs) → latexSyncCitations (Bourque 2004) → latexCompile → researcher gets PDF with diagrams via exportMermaid.
"Find GitHub repos for CREx reversal inference tool"
Research Agent → exaSearch('CREx Bernt 2007 github') → Code Discovery → paperExtractUrls → paperFindGithubRepo → githubRepoInspect → researcher gets working CREx fork with reversal sorting benchmarks.
Automated Workflows
Deep Research workflow scans 50+ papers via citationGraph from Hannenhalli-Pevzner (1999), producing structured reports on signed/unsigned progress with GRADE-verified claims. DeepScan's 7-step chain analyzes Yancopoulos (2005) via readPaperContent → runPythonAnalysis on translocation+reversal distances → CoVe verification. Theorizer generates hypotheses on breakpoint reuse from Bourque (2004) literature synthesis.
Frequently Asked Questions
What is sorting by reversals?
Sorting by reversals finds the minimum reversals ρ(i,j) transforming permutation π to identity, modeling genome inversions (Hannenhalli and Pevzner, 1999).
What are main methods?
Signed permutations use polynomial-time algorithms via breakpoint graphs (Hannenhalli-Pevzner, 1999); unsigned are NP-hard with 1.375-approximations; CREx heuristically infers reversals+transpositions (Bernt et al., 2007).
What are key papers?
Foundational: Hannenhalli-Pevzner (1999, 642 citations) for signed sorting; Yancopoulos (2005, 493 citations) for multi-operations; Pevzner-Tesler (2002, 381 citations) for mammalian applications.
What are open problems?
Exact unsigned reversal distance (NP-hard); integrating breakpoint reuse into models (Bourque et al., 2004); scalable multi-chromosome sorting beyond heuristics.
Research Genome Rearrangement Algorithms with AI
PapersFlow provides specialized AI tools for Biochemistry, Genetics and Molecular Biology researchers. Here are the most relevant for this topic:
AI Literature Review
Automate paper discovery and synthesis across 474M+ papers
Paper Summarizer
Get structured summaries of any paper in seconds
Deep Research Reports
Multi-source evidence synthesis with counter-evidence
See how researchers in Life Sciences use PapersFlow
Field-specific workflows, example queries, and use cases.
Start Researching Sorting by Reversals with AI
Search 474M+ papers, run AI-powered literature reviews, and write with integrated citations — all in one workspace.
See how PapersFlow works for Biochemistry, Genetics and Molecular Biology researchers
Part of the Genome Rearrangement Algorithms Research Guide