TLDR: The National Institutes of Health (NIH) has unveiled GeneAgent, an AI system powered by a large language model (LLM), designed to significantly improve the accuracy and informativeness of gene set analysis. This system cross-references its predictions with expert-curated databases to mitigate AI hallucinations and provide more precise biological insights.
Researchers at the National Institutes of Health (NIH) have made a significant stride in genetic analysis with the development of GeneAgent, an innovative artificial intelligence (AI) system. This new tool, driven by a large language model (LLM), is engineered to produce more accurate and comprehensive descriptions of biological processes and their functions within gene set analysis, surpassing the capabilities of existing systems.
GeneAgent’s core innovation lies in its ability to self-verify its initial predictions, or ‘claims,’ against information sourced from established, expert-curated databases. This rigorous cross-referencing mechanism generates a detailed verification report, highlighting both successes and failures in its analysis. This feature is crucial in combating ‘AI hallucinations’—the phenomenon where LLMs generate false, misleading, or fabricated content due to their training on vast amounts of internet data without inherent truth verification. Furthermore, it addresses the issue of circular reasoning, where LLMs might fact-check their own generated results against their internal data, leading to a false sense of confidence in erroneous outputs.
This AI agent is poised to revolutionize how researchers interpret high-throughput molecular data. By accurately identifying relevant biological pathways and functional modules, GeneAgent can lead to a deeper understanding of how various diseases and conditions impact gene groups, both individually and collectively. Previous studies that utilized LLMs for genomic questions or summarizing biological processes in gene sets often overlooked the critical issue of hallucinations in the generated content. GeneAgent directly tackles this by independently comparing its claims to external, authoritative knowledge bases.
Also Read:
- AI’s Forthcoming Role in Auditing Scientific Research: Implications for Public Trust
- Mayo Clinic Pioneers Medical AI with NVIDIA Blackwell Supercomputing Deployment
The research team initiated testing of GeneAgent on 1,106 gene sets, demonstrating its potential to provide more reliable and actionable insights for biological research.


