TLDR: This research introduces “hypothesis hunting,” a new paradigm for scientific discovery in large datasets, unconstrained by specific questions. It presents AScience, a framework, and ASCollab, an AI system of LLM-based agents that self-organize into evolving networks. These agents collaborate and peer-review findings, leading to the accumulation of novel, high-quality, and diverse discoveries, as demonstrated in cancer genomics experiments. The study highlights the critical role of social dynamics in AI agent networks for sustained scientific exploration.
Imagine a future where scientific breakthroughs aren’t just the result of individual human brilliance, but also the tireless, collaborative efforts of autonomous AI agents. This is the vision explored in a groundbreaking new research paper titled “Hypothesis Hunting with Evolving Networks of Autonomous Scientific Agents” by Tennison Liu, Silas Ruhrberg Estévez, David L. Bentley, and Mihaela van der Schaar. This work introduces a novel approach called “hypothesis hunting,” which aims to unlock hidden insights from vast scientific datasets without being constrained by predefined research questions.
Traditional scientific methods often start with a specific question. However, with the explosion of large-scale datasets—like health biobanks, cell atlases, and Earth re-analyses—there’s an immense opportunity for exploratory discovery. Hypothesis hunting is defined as the continuous and diverse exploration of these datasets to surface promising findings that can then guide human investigation and experimental validation.
The challenges of this open-ended discovery are immense: the sheer scale of data creates a combinatorial explosion of possible analyses, and meaningful progress often requires knowledge from diverse disciplines. The researchers propose AScience, a framework that models scientific discovery as an interaction between agents, networks, and evaluation standards. They then implement this framework as ASCollab, a distributed system of AI-powered research agents.
What makes ASCollab unique is its emphasis on social dynamics. Instead of isolated AI scientists, ASCollab features a community of LLM-based agents with diverse behaviors and expertise. These agents self-organize into evolving networks, constantly producing new findings and peer-reviewing each other’s work under shared standards. This mirrors how human scientific communities operate, with collaboration, critique, and cross-pollination driving progress.
The ASCollab system is built on two core shared-memory structures: an agent registry, which tracks agent profiles, expertise, and reputation, and an internal archive, which stores all accepted research outputs. Agents use query tools to access these resources, allowing them to find collaborators, retrieve prior findings, and update their understanding of the field. This retrieval-augmented generation (RAG) mechanism ensures that agents learn and build upon collective knowledge.
To foster sustained exploration, the agents are designed with heterogeneity in mind. They have distinct “epistemic behaviors” (e.g., explorer vs. exploiter, independent vs. collaborative) and adaptive expertise profiles. Each agent conducts research sessions, acting as a primary investigator, with access to datasets and a suite of computational tools for analysis, literature search, and communication. Collaboration is a key feature, allowing agents to brainstorm, share findings, and provide feedback.
A crucial part of ASCollab is its structured peer-review process. Each research output undergoes a two-stage evaluation: first by a panel of K reviewers (selected based on expertise), and then by a meta-reviewer who assesses submissions thematically and assigns a relative judgment of merit. Only the top fraction of outputs are accepted into the internal archive, ensuring quality control and continuous knowledge accumulation. This feedback loop ensures that the network’s collective behavior is constantly shaped by cumulative findings.
The effectiveness of ASCollab was tested on three cancer cohorts from The Cancer Genome Atlas (TCGA): kidney renal clear cell carcinoma (KIRC), diffuse large B-cell lymphoma (DLBC), and pancreatic adenocarcinoma (PAAD). The system integrated multi-omic data (RNA-sequencing, protein expression, clinical survival, pathway annotations, and drug-target information) to discover novel, supported, and significant findings.
Compared to a baseline of independent agents, ASCollab produced findings that were rated by human experts as more novel, higher quality, and more diverse. Independent agents tended to converge on narrow areas, while ASCollab agents explored a broader hypothesis space. The research highlighted several compelling case studies, including the identification of a ferroptosis module in kidney cancer, the proposal of SLC5A2 and ABCC8 as therapeutic targets in pancreatic adenocarcinoma (anticipating a later independent publication), and the validation of BIRC5 with an extension to PRKD1 in KIRC.
The study also observed heterogeneous agent behaviors and dynamic network evolution. Some agents preferred deep exploitation, while others adopted broad exploration. Collaborations were shown to yield systematically higher meta-review scores, underscoring the value of social interaction. The network structures reorganized over time, adapting to emerging areas of inquiry.
Also Read:
- DualResearch: Enhancing AI Scientific Reasoning with Dual-Graph Information Fusion
- AI’s Role in Peer Review: A Framework for Systematic Evaluation
While experimental validation remains essential, this research demonstrates the immense potential of socially structured, agentic networks to sustain exploratory hypothesis hunting at scale. It paves the way for accelerating and broadening the frontier of scientific inquiry, generating diverse and high-quality hypotheses for human scientists to pursue. You can read the full paper here.


