TLDR: SOLVE-Med is a novel multi-agent AI architecture designed for medical question answering. It combines a Router Agent for dynamic specialist selection, ten domain-specialized small language models (SLMs) for specific medical expertise, and an Orchestrator Agent to synthesize coherent responses. This system outperforms larger standalone models, offers local deployment for enhanced privacy and efficiency, and addresses challenges like hallucinations and high computational costs in healthcare AI.
In the rapidly evolving landscape of artificial intelligence in healthcare, a new multi-agent system called SOLVE-Med has emerged, designed to tackle the complexities of medical question answering. Developed by researchers from the University of Naples Federico II and Northwestern University, SOLVE-Med offers a promising solution to common challenges faced by traditional large language models (LLMs) in clinical settings, such as hallucinations, bias, high computational demands, and privacy concerns.
SOLVE-Med stands for Specialized Orchestration for Leading Vertical Experts across Medical Specialties. It’s an innovative architecture that combines the strengths of domain-specialized small language models (SLMs) to process and respond to intricate medical queries. Unlike large, monolithic LLMs that often require significant computational resources and cloud-based services, SOLVE-Med leverages smaller, more efficient models that can be deployed locally, enhancing privacy and reducing energy consumption.
How SOLVE-Med Works
The system is built around three core components that work in harmony:
A Router Agent acts as the initial point of contact for a user’s medical question. This agent functions as a multi-label classifier, dynamically selecting the most appropriate medical specialists from a pool of experts. It mimics the consultative nature of clinical workflows, ensuring that queries are directed to the relevant domains. The Router Agent uses a fine-tuned DistilBERT model, known for its rapid inference and low memory footprint.
A Pool of Medical Specialists consists of ten specialized small language models, each with 1 billion parameters. These SLMs are fine-tuned on distinct medical domains, such as Cardiology, Dermatology, Neurology, and more, using data from Italian healthcare forums. When selected by the Router Agent, these specialists generate responses grounded in their specific areas of expertise. To maintain efficiency, a quantized version of the LLaMA-3.2-1B-Instruct model is used for these specialists.
An Orchestrator Agent is the final component, responsible for synthesizing the individual outputs from the selected medical specialists into a single, coherent, and comprehensive answer. This agent is implemented using a quantized version of the Gemma-2-9B-IT model. Its larger parameter count compared to the individual specialists allows it to effectively integrate diverse contributions, mitigating issues like omissions or oversimplification. The Orchestrator Agent operates with a structured prompting strategy, framing itself as a professional medical assistant to deliver medically sound and contextually appropriate responses.
Key Advantages and Performance
One of SOLVE-Med’s significant advantages is its ability to enable local deployment. By using compact, specialized models, it drastically improves computational efficiency and safeguards data privacy by eliminating reliance on external cloud infrastructure. This makes it particularly suitable for healthcare applications where resource constraints and data sensitivity are critical.
The system was rigorously evaluated on Italian medical forum data across ten specialties. SOLVE-Med demonstrated superior performance, achieving a ROUGE-1 score of 0.301 and a BERTScore F1 of 0.697. These results indicate that it outperforms standalone models, including those up to 14 billion parameters, in generating high-quality, relevant responses. The evaluation also showed that strategies involving a greater number of selected specialists tend to yield better outcomes, suggesting that diverse expert contributions enhance the completeness and informativeness of the final response.
Also Read:
- STRMAC: A State-Aware Framework for Smarter Multi-Agent AI Collaboration
- Optimizing LLM Collaboration: A Graph-Based Approach to Test-Time Scaling
Future Outlook
SOLVE-Med represents a significant step forward in developing reliable and interpretable medical AI systems. The researchers envision future work including human evaluations and improved context handling to further refine the system. The ultimate goal is to establish SOLVE-Med as a dependable support tool in clinical practice, complementing rather than replacing human medical judgment. For more technical details, you can refer to the full research paper available here.


