Skip to main content
Search
Main content
bioRxiv : the preprint server for biology
Published

Population-scale interpretation of RNA isoform diversity enabled by Isopedia

Authors

Xinchang Zheng, Zev Kronenberg, Sonia Garcia-Ruiz, Ryan M Layer, Emil K Gustavsson, Mina Ryten, Fritz J Sedlazeck

Abstract

bioRxiv [Preprint]. 2026 Mar 25:2026.03.23.713667. doi: 10.64898/2026.03.23.713667.

ABSTRACT

Alternative splicing generates extensive transcriptomic complexity, yet "novelty" is often inflated because of incomplete reference annotations, with 20-70% of transcripts in RNA-Seq studies labeled as novel. Isopedia provides an expandable data structure for reference-agnostic isoform annotation, which we demonstrate here through a population-scale catalog of 1,007 long-read datasets spanning 37 diverse biological contexts. By transitioning from reference-dependent to evidence-weighted annotation, Isopedia provides the frequency-based context necessary to distinguish stochastic noise from biologically active isoforms. In HG002 benchmarks, Isopedia reduced apparent isoform novelty by up to 26-fold, achieving a >95% annotation rate even for low-abundance isoforms typically missed by standard catalogs. The framework further supports systematic exploration of challenging loci such as pseudogenes and gene fusions. Isopedia transforms isoform discovery into a systematic interpretation of the human transcriptome, providing a critical foundation for clinical and functional RNA research. Isopedia is open source and freely available: https://github.com/zhengxinchang/isopedia.

PMID:41929139 | PMC:PMC13041807 | DOI:10.64898/2026.03.23.713667

UK DRI Authors

Prof Mina Ryten

Centre Director & Research Division Lead

Leveraging brain transcriptomics to understand the pathophysiology of Lewy body disease

Prof Mina Ryten