TLDR: TriAgent is an AI-powered multi-agent system designed to improve emergency department triage by automatically discovering new biomarkers and validating them against medical literature. It uses specialized AI agents to analyze patient data, conduct deep research, and provide transparent justifications for both known and novel biomarker candidates, outperforming existing LLM-based methods in accuracy and factual consistency.
Emergency departments around the world are facing significant challenges, including increasing patient numbers, staff shortages, and inconsistencies in how patients are prioritized, known as triage. Current methods for triage primarily rely on vital signs, standard lab tests, and doctors’ judgment. While these are effective, they often miss subtle biological signals that could significantly improve predictions for infections or guide antibiotic use in urgent situations.
To tackle these issues, researchers have introduced TriAgent, an innovative framework that uses large language models (LLMs) in a multi-agent system. TriAgent automates the discovery of new biomarkers – biological indicators of disease – and rigorously validates them against existing medical literature, also assessing their novelty.
How TriAgent Works
Imagine a team of highly specialized AI assistants working together. That’s essentially TriAgent. It employs a ‘supervisor research agent’ that generates specific research topics and then delegates these tasks to various ‘sub-agents’. These sub-agents are experts at retrieving evidence from diverse data sources, including biomedical databases and web sources.
The process unfolds in several stages:
- Scoping Agent: This agent refines the user’s initial medical query, ensuring the research is focused and aligned with clinical objectives.
- Data Analysis Agent: It analyzes patient data from routine tests to identify potential biomarkers not typically used in triage. This involves exploring data patterns and using advanced AI models to find predictive features.
- Research Supervisor Agent: This central coordinator takes the refined query and identified biomarkers, then formulates a research plan. It decides if multiple sub-agents are needed and assigns them specific literature research topics.
- Research Sub-agents: These agents perform targeted searches across vast biomedical literature using a technique called Retrieval Augmented Generation (RAG). They gather evidence for each candidate biomarker.
- Reporting Agent: Finally, all findings are synthesized into a detailed, auditable report. This report classifies biomarkers as either ‘grounded’ (meaning they are supported by existing knowledge) or ‘novel candidates’ (meaning they are new discoveries). It also provides transparent justifications and highlights areas where more research is needed.
Unlike previous systems that focused only on known clinical biomarkers, TriAgent offers an end-to-end solution, from data analysis to literature grounding, aiming to enhance transparency, explainability, and expand the range of potentially actionable clinical biomarkers. For more details, you can read the full research paper here.
Key Achievements and Performance
TriAgent has demonstrated impressive performance. In experiments, it achieved a topic adherence F1 score of 55.7 ± 5.0%, which is more than 10% higher than the CoT-ReAct agent. Its faithfulness score, which measures how factually consistent its responses are with retrieved context, was 0.42 ± 0.39, exceeding all baselines by over 50%. This indicates that TriAgent consistently outperforms state-of-the-art LLM-based agentic frameworks in justifying biomarkers and assessing their novelty based on literature.
The research also explored the optimal number of sub-agents for deep research, finding that around 4-6 sub-agents provided the best balance between performance and computational cost, with 5 being the chosen optimal configuration.
Also Read:
- Evaluating PHI De-identification Models with Multi-Agent AI
- ReclAIm: An Autonomous Framework for Sustaining Medical Imaging AI Performance
Limitations and Future Directions
While TriAgent shows great promise, the authors acknowledge certain limitations. The current work does not perform full clinical validation of biomarkers, which is a rigorous process required for regulatory approval. The data used for discovery came from a single healthcare facility, which might introduce biases and limit generalizability. Additionally, the performance of the deep research pipeline depends on the availability and quality of medical literature, with access restrictions potentially limiting its scope.
Future work will focus on integrating quantitative biomarker validation for clinical use, improving semantic understanding to reduce errors in evidence retrieval, and balancing the breadth and specificity of information retrieval. Ultimately, TriAgent represents a significant step towards using AI to enhance acute care triage, potentially leading to faster response times, lower costs, and more precise diagnoses.


