PapersFlow Research Brief
Genomics and Phylogenetic Studies
Research Guide
What is Genomics and Phylogenetic Studies?
Genomics and Phylogenetic Studies is the interdisciplinary field encompassing statistical analysis, alignment, annotation of RNA sequencing data, transcript quantification, phylogenetic tree reconstruction, genome assembly, sequence variation analysis, and metagenomics assembly in functional genomics.
This field includes 464,010 works with tools like BLAST for sequence similarity searches and Trimmomatic for preprocessing Illumina sequence data. Key methods cover progressive multiple sequence alignment via CLUSTAL W and fast short read alignment using Burrows-Wheeler transform as in BWA. Applications span quality assessment in functional genomics and phylogenetic tree building with neighbor-joining.
Topic Hierarchy
Research Sub-Topics
RNA-seq Transcript Quantification
Researchers develop and benchmark methods like Salmon and Kallisto for estimating transcript abundances from short-read data. Studies address biases in fragment length and sequencing depth.
Phylogenetic Tree Reconstruction
This area covers distance-based methods like neighbor-joining and maximum likelihood inference from molecular sequences. Researchers evaluate tree accuracy under long-branch attraction and incomplete lineage sorting.
Multiple Sequence Alignment Algorithms
Studies improve progressive alignment strategies such as CLUSTAL and MUSCLE for protein and nucleotide datasets. Researchers incorporate structural and secondary structure information for accuracy.
Short Read Alignment Methods
This sub-topic examines Burrows-Wheeler transform tools like BWA and Bowtie for mapping Illumina reads to references. Researchers optimize for speed, sensitivity, and variant calling accuracy.
Metagenomics Assembly Pipelines
Researchers develop de Bruijn graph assemblers like MEGAHIT for recovering microbial genomes from environmental samples. Studies bin contigs and assess assembly completeness.
Why It Matters
Genomics and Phylogenetic Studies enable accurate genome assembly and sequence variation analysis essential for functional genomics and metagenomics. For example, 'Basic local alignment search tool' by Altschul et al. (1990) with 92,387 citations powers protein and DNA database searches across biology, while 'Fast and accurate short read alignment with Burrows–Wheeler transform' by Li and Durbin (2009) with 60,292 citations handles massive short read data from next-generation sequencing in projects like the 1000 Genomes. These tools support real-world applications in evolutionary biology, epidemic tracking as in 'Assessing phylogenetic confidence at pandemic scales' (2025), and eukaryotic evolution studies like 'Dated gene duplications elucidate the evolutionary assembly of eukaryotes' (2025), advancing fields from cancer research to gut microbiota analysis.
Reading Guide
Where to Start
'Basic local alignment search tool' by Altschul et al. (1990) first, as it introduces foundational sequence alignment essential for all downstream genomics and phylogenetic tasks with 92,387 citations.
Key Papers Explained
'Basic local alignment search tool' by Altschul et al. (1990) establishes sequence similarity search, extended by 'Gapped BLAST and PSI-BLAST: a new generation of protein database search programs' by Altschul (1997) for faster protein searches; 'CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice' by Thompson et al. (1994) builds alignments for phylogenetics; 'The neighbor-joining method: a new method for reconstructing phylogenetic trees' by Saitou and Nei (1987) provides tree reconstruction from distances; 'Fast and accurate short read alignment with Burrows–Wheeler transform' by Li and Durbin (2009) adapts alignment for high-throughput sequencing data.
Paper Timeline
Most-cited paper highlighted in red. Papers ordered chronologically.
Advanced Directions
Recent preprints focus on syntrophy in eukaryotic evolution (John M. Archibald, 2025), phylogenetic confidence at pandemic scales (2025), dated gene duplications in eukaryote assembly (2025), and convergent genome evolution in terrestrial animals (2025), with tools like PhyloTune accelerating updates using DNA language models.
Papers at a Glance
| # | Paper | Year | Venue | Citations | Open Access |
|---|---|---|---|---|---|
| 1 | Cleavage of Structural Proteins during the Assembly of the Hea... | 1970 | Nature | 250.9K | ✕ |
| 2 | Basic local alignment search tool | 1990 | Journal of Molecular B... | 92.4K | ✕ |
| 3 | Gapped BLAST and PSI-BLAST: a new generation of protein databa... | 1997 | Nucleic Acids Research | 73.6K | ✓ |
| 4 | Fiji: an open-source platform for biological-image analysis | 2012 | Nature Methods | 66.8K | ✓ |
| 5 | Trimmomatic: a flexible trimmer for Illumina sequence data | 2014 | Bioinformatics | 65.4K | ✓ |
| 6 | CLUSTAL W: improving the sensitivity of progressive multiple s... | 1994 | Nucleic Acids Research | 64.3K | ✓ |
| 7 | The Sequence Alignment/Map format and SAMtools | 2009 | Bioinformatics | 64.1K | ✓ |
| 8 | Fast and accurate short read alignment with Burrows–Wheeler tr... | 2009 | Bioinformatics | 60.3K | ✓ |
| 9 | The neighbor-joining method: a new method for reconstructing p... | 1987 | Molecular Biology and ... | 60.1K | ✓ |
| 10 | Fast gapped-read alignment with Bowtie 2 | 2012 | Nature Methods | 57.6K | ✓ |
In the News
PhyloTune: An efficient method to accelerate phylogenetic updates using a pretrained DNA language model
challenging for current analytical methods. In this study, we introduce a new solution to accelerate the integration of novel taxa into an existing phylogenetic tree using a pretrained DNA language...
SPRTA: a smarter way to measure evolution uncertainty
This work was supported by EMBL core funds and the Medical Research Council (MRC). Australian collaborators received support from the Chan-Zuckerberg Initiative. #### Source article(s)
Semantic design of functional de novo genes from a genomic language model
language models can extend beyond natural sequences.
New Genome BC Funded Project Aims to Bring BC Closer ...
# New Genome BC Funded Project Aims to Bring BC Closer to Ending HIV Transmission Share This Facebook LinkedIn ### Share on Mastodon Enter your Mastodon instance URL (optional)Share
Great news for KPU's science students! GenomeBC is ...
designation focused on applied genomics for agriculture, opening more doors for students and industry. Learn more: https://bit.ly/42qyT7o | Kwantlen Polytechnic University
Code & Tools
Delphy is a fast, scalable, accurate and accessible tool for Bayesian phylogenetics based on Explicit Mutation-Annotated Trees (EMATs). EMATs are a...
`piqtree`(pronounced 'pie-cue-tree') is a library which allows you to use IQ-TREE directly from Python! The interface with Python is through cogent...
NovelTree is a highly parallelized and computationally efficient phylogenomic workflow that infers gene families, gene family trees, species trees,...
The purpose of CAFE is to analyze changes in gene family size in a way that accounts for phylogenetic history and provides a statistical foundation...
Phylociraptor is a computational framework calculating phylogenomic trees for a specified set of species using different alignment, trimming and tr...
Recent Preprints
Bioinformatics and Genomics section
## Section Highlights View all Bioinformatics and Genomics articles
Phylogenetics - Latest research and news
A comparative genomic investigation of metabolism across the tree of life supports the hypothesis that syntrophy — metabolic exchange between symbiotic partners — had a key role in the evolution of...
Assessing phylogenetic confidence at pandemic scales
Phylogenetics has a central role in evolutionary biology and genomic epidemiology 1 . Assessing phylogenetic confidence and reliability is therefore crucial and the methods that do this, such as th...
Dated gene duplications elucidate the evolutionary assembly of eukaryotes
The origin of eukaryotes was a formative event in the history of life, in which a new kind of cell with distinct functional, morphological and ecological modalities evolved through an evolutionary ...
Convergent genome evolution shaped the emergence of terrestrial animals
* Evolutionary genetics * Phylogenetics * Zoology ## Abstract
Latest Developments
Recent developments in genomics and phylogenetic studies include the creation of comprehensive single-cell molecular atlases across the eukaryotic tree of life, utilizing high-quality genomes to inform phylogenetics (Nature), advancements in reconstructing evolutionary relationships through high-resolution fossil analysis (Nature), and progress in genome sequencing such as complete ape genomes (Nature) and microbial genome search methods via phylogenetic compression (Nature), with ongoing discussions and research presented at upcoming conferences like the Biology of Genomes 2026 (CSHL). These reflect significant strides in understanding evolutionary relationships and genomic diversity as of early 2026.
Sources
Frequently Asked Questions
What is the role of BLAST in genomics?
BLAST, introduced in 'Basic local alignment search tool' by Altschul et al. (1990), is a tool for searching protein and DNA databases for sequence similarities. 'Gapped BLAST and PSI-BLAST: a new generation of protein database search programs' by Altschul (1997) refines it with algorithmic improvements for faster execution on protein comparisons.
How does Trimmomatic preprocess sequencing data?
Trimmomatic, described in 'Trimmomatic: a flexible trimmer for Illumina sequence data' by Bolger et al. (2014), handles paired-end data with flexibility and high performance for next-generation sequencing read preprocessing. It trims adapters and low-quality bases to meet requirements unmet by prior tools.
What is the neighbor-joining method in phylogenetics?
The neighbor-joining method in 'The neighbor-joining method: a new method for reconstructing phylogenetic trees' by Saitou and Nei (1987) reconstructs trees from evolutionary distance data by minimizing total branch length through clustering operational taxonomic units. It improves accuracy for phylogenetic analysis.
What is the SAM format used for?
The Sequence Alignment/Map (SAM) format in 'The Sequence Alignment/Map format and SAMtools' by Li et al. (2009) stores read alignments against reference sequences, supporting short and long reads up to 128 Mbp from various platforms. SAMtools enables efficient random access and compression.
How does Bowtie 2 align reads?
Bowtie 2 in 'Fast gapped-read alignment with Bowtie 2' by Langmead and Salzberg (2012) provides fast gapped-read alignment for short reads. It supports diverse sequencing technologies in genomic studies.
Open Research Questions
- ? How can phylogenetic confidence be accurately assessed at pandemic scales without enormous computational demands, as highlighted in recent preprints?
- ? What role did syntrophy play in eukaryotic cell evolution, per comparative genomic metabolism studies across the tree of life?
- ? How do dated gene duplications reveal the evolutionary assembly of eukaryotes from prokaryotic mergers?
- ? In what ways has convergent genome evolution influenced the emergence of terrestrial animals?
- ? Can pretrained DNA language models efficiently accelerate phylogenetic tree updates for novel taxa integration?
Recent Trends
Field spans 464,010 works; recent preprints emphasize eukaryotic evolution via syntrophy and gene duplications (Archibald, 2025; 2025-12-03), pandemic-scale phylogenetics , and tools like PhyloTune (2025-07-26) and SPRTA (2025-11-05) for uncertainty measurement, alongside workflows like Delphy, piqtree, and NovelTree on GitHub.
2025-11-05Research Genomics and Phylogenetic Studies with AI
PapersFlow provides specialized AI tools for Biochemistry, Genetics and Molecular Biology researchers. Here are the most relevant for this topic:
AI Literature Review
Automate paper discovery and synthesis across 474M+ papers
Paper Summarizer
Get structured summaries of any paper in seconds
Deep Research Reports
Multi-source evidence synthesis with counter-evidence
See how researchers in Life Sciences use PapersFlow
Field-specific workflows, example queries, and use cases.
Start Researching Genomics and Phylogenetic Studies with AI
Search 474M+ papers, run AI-powered literature reviews, and write with integrated citations — all in one workspace.
See how PapersFlow works for Biochemistry, Genetics and Molecular Biology researchers