OpenScientist: evaluating an open agentic AI co-scientist to

Authors

Kaleigh F Roberts, Zachary B Abrams, Luca Cappelletti, Mahdi Moqri, Nicholas Heugel, J Harry Caufield, Mathieu Bourdenx, Yan Li, Jineta Banerjee, Luca Foschini, Diego Galeano, Nomi L Harris, Melody Li, Kejun Ying, Justin A Melendez, Nicolas R Barthélemy, James G Bollinger, Yingxin He, Vitaliy Ovod, Tammie L S Benzinger, Shaney Flores, Brian A Gordon, Adegoke A Ojewole, Mukta Phatak, Donald L Elbert, Sarah Biber, Eric C Landsness, Christopher J Mungall, Randall J Bateman, Justin T Reese

Abstract

medRxiv [Preprint]. 2026 Mar 18:2026.03.15.26348338. doi: 10.64898/2026.03.15.26348338.

ABSTRACT

BACKGROUND: Advances in medicine depend on analyzing large and complex data sources, but discovery is partly constrained by the limited time and domain expertise of human researchers. Agentic artificial intelligence (agentic AI) can accelerate discovery by automating components of the scientific workflow, including information retrieval, data analysis, and knowledge synthesis.

AIM: OpenScientist, an open-source agentic AI co-scientist, aims to accelerate biomedical discovery by semi-autonomously investigating scientist-defined queries and generating clinically relevant, verifiable scientific insights.

METHODS: Domain experts evaluated OpenScientist for novel discoveries in four clinical case studies: (1) a prespecified analysis in a community-based Alzheimer's disease biomarker cohort, (2) unsupervised modeling for plasma proteomic survival prediction, (3) hypothesis investigation in single-cell transcriptomic data from neurons with neurofibrillary tangles, and (4) hypothesis generation with validation in a multiple myeloma dataset with a randomized negative control.

RESULTS: OpenScientist completed analyses in minutes that otherwise would take weeks to months of human time and expertise. It identified %ptau217 as the best predictor of amyloid PET status, generated a plasma proteomic survival model with performance comparable to published models, proposed a mechanism linking tau pathology to altered lysosomal acidification, and generated multiple myeloma hypotheses that were validated in an external cohort while distinguishing true signal from randomized controls.

CONCLUSION: OpenScientist demonstrates that open, auditable, agentic AI can support real-world clinical research by generating hypotheses, executing analyses, and discovering insights from complex datasets.

PMID:41891004 | PMC:PMC13015679 | DOI:10.64898/2026.03.15.26348338

OpenScientist: evaluating an open agentic AI co-scientist to accelerate biomedical discovery

Authors

Abstract