TLDR: Researchers developed FairSkillMARL, an AI framework that defines fairness in multi-agent systems by balancing workload and aligning tasks with agent skills, specifically for healthcare settings. They also created MARLHospital, a new simulator to test these concepts, showing that considering both skill and workload leads to more effective and equitable task distribution among healthcare workers, preventing burnout and improving efficiency.
In the demanding environment of healthcare, especially in emergency departments, ensuring fairness in task allocation among workers is crucial. Traditionally, fairness in multi-agent systems has often focused solely on balancing workloads. However, this approach frequently overlooks the unique skills and expertise of individual healthcare workers, leading to potential burnout for highly skilled individuals and inefficient task completion when tasks are mismatched with an agent’s capabilities.
A recent research paper, titled “SKILL-ALIGNED FAIRNESS IN MULTI-AGENT LEARNING FOR COLLABORATION IN HEALTHCARE,” addresses this critical gap. Authored by Promise Osaine Ekpo, Brian La, Thomas Wiener, Saesha Agarwal, Arshia Agrawal, Gonzalo Gonzalez-Pumariega, and Angelique Taylor from Cornell Tech, along with Lekan P. Molu from Microsoft Research NYC, the paper introduces a novel framework and a specialized simulation environment to tackle this complex problem.
Introducing FairSkillMARL and MARLHospital
The core contribution of this research is FairSkillMARL, a groundbreaking framework that redefines fairness as a dual objective: not just workload balance, but also skill-task alignment. This means that tasks are distributed not only to ensure an equitable amount of work but also to match tasks with the most appropriate skill sets of the healthcare agents. This approach aims to prevent situations where skilled agents are overused or where less skilled agents are assigned tasks beyond their immediate expertise, which can lead to delays and errors.
To rigorously test and validate FairSkillMARL, the researchers developed MARLHospital, a customizable, healthcare-inspired simulation environment. Existing simulators were found to be inadequate for modeling the intricate dynamics of healthcare teams, including varying energy levels, diverse skill sets, and the structured coordination required in real-world medical procedures like Cardiopulmonary Resuscitation (CPR). MARLHospital fills this void by allowing researchers to model different team compositions and the impact of energy-constrained scheduling on fairness.
How MARLHospital Works
MARLHospital simulates a multi-agent environment where healthcare workers (agents) must collaborate to complete medical procedures. It incorporates realistic elements such as agent energy levels, where performing strenuous tasks like chest compressions incurs an energy cost, necessitating task-switching among agents to prevent fatigue. The environment models common medical procedures like CPR and AED tasks, based on standard protocols from organizations like the American Red Cross.
The simulation allows for various team compositions: uniform teams (all agents have identical skills), specialized teams (agents are more efficient in specific tasks but can perform others), and interdependent teams (agents can only perform a subset of tasks, forcing cooperation).
FairSkillMARL in Action
The FairSkillMARL framework modifies the traditional reward function in multi-agent reinforcement learning to penalize both workload imbalance and skill-task misalignment. It uses a composite disparity metric (L3) that combines the Gini Index for workload imbalance (L1) and a skill-task alignment measure (L2). A tunable parameter, alpha (α), allows for adjusting the trade-off between prioritizing workload balance and skill alignment, while a scaling factor (λ) controls the strength of the fairness penalty.
Also Read:
- New AI System Reduces Bias in Information Retrieval
- Optimizing Federated Learning in Competitive Multi-Service Environments
Key Findings from Experiments
The research involved extensive experiments comparing FairSkillMARL with standard multi-agent reinforcement learning algorithms and other state-of-the-art fairness metrics. Here are some of the significant findings:
- Task Difficulty: Centralized Training with Decentralized Execution (CTDE) algorithms, such as VDN and QMIX, generally outperformed Independent Learning (IL) methods, especially as task complexity increased. VDN, in particular, showed strong performance.
- Team Composition: VDN consistently demonstrated superior performance across all team compositions, including uniform, specialized, and forced cooperation teams. However, forced cooperation scenarios presented the greatest coordination challenges, resulting in lower success rates across all algorithms.
- Energy Constraints: Surprisingly, the introduction of energy costs and the necessity for task-switching (e.g., during CPR) sometimes led to faster convergence for agents. This suggests that structured turn-taking, enforced by energy levels, can simplify coordination rather than hinder it, potentially leading to clearer role specialization.
- Fairness Metrics: FairSkillMARL, particularly with an alpha (α) value of 0.7 (balancing workload and skill alignment), showed statistical improvements in success rates compared to methods focusing solely on workload balancing. However, under very strong fairness penalties, simpler fairness shaping methods like FEN (Fair Efficient Network) sometimes achieved better overall success and workload balance, indicating a delicate balance in applying fairness constraints.
This work provides valuable tools and a foundational understanding for studying fairness in heterogeneous multi-agent systems, especially where aligning effort with expertise is critical, such as in healthcare. The MARLHospital environment and the FairSkillMARL framework pave the way for future research into multi-objective optimization and larger-scale applications in complex, safety-critical domains. You can read the full paper here.


