TLDR: A new research paper introduces “class-feature bias,” a type of bias where AI models in medical diagnosis rely on features strongly correlated with only a subset of classes, leading to poor generalization. The authors propose a “class-unbiased model” (Cls-unbias) that uses a novel inequality loss and group distributionally robust optimization to simultaneously mitigate this new bias and traditional class imbalance. Experiments on synthetic and real-world medical datasets demonstrate that Cls-unbias significantly improves model performance and generalization, making AI diagnoses more reliable and fair.
Artificial intelligence (AI) is rapidly transforming various fields, and medical diagnosis is no exception. However, the effectiveness and fairness of AI models in healthcare can be severely compromised by inherent biases within the data they are trained on. A recent research paper, titled “Class Unbiasing for Generalization in Medical Diagnosis,” by Lishi Zuo, Man-Wai Mak, Lu Yi, and Youzhi Tu, delves into a critical, often overlooked, form of bias and proposes an innovative solution to enhance the reliability and generalization of medical AI.
Traditionally, a common challenge in medical datasets is ‘class imbalance,’ where certain conditions or diagnoses are far more prevalent than others. This can lead models to prioritize the majority class, performing poorly on rarer, but often critical, minority classes. While methods like oversampling or class weighting have been developed to address this, the authors identify a more subtle yet pervasive issue: ‘class-feature bias.’
Class-feature bias occurs when AI models mistakenly rely on features that are strongly correlated with only a subset of classes, rather than features that are truly indicative across all classes. For instance, a model might learn to associate a high Body Mass Index (BMI) with diabetes because many diabetic patients in its training data have high BMI. However, BMI is not a definitive biomarker for diabetes, as many non-diabetic individuals also have high BMI. If the model heavily relies on BMI, it could misclassify non-diabetic patients as diabetic, leading to inaccurate diagnoses and potentially reinforcing harmful stereotypes.
The researchers propose a novel approach called the ‘class-unbiased model’ (Cls-unbias) designed to tackle both class imbalance and class-feature bias simultaneously. The core of their method involves two key components:
Class-wise Inequality Loss
This component aims to equalize the classification losses contributed by samples from both positive and negative classes. In simpler terms, it encourages the model to pay equal attention to learning from all classes, preventing it from becoming overly confident or reliant on features specific to just one class. If a model’s loss is very low for one class but high for another, it signals that it might be overfitting to features unique to the low-loss class, which is a hallmark of class-feature bias.
Also Read:
- Unmasking Vulnerabilities: Adversarial Attacks Threaten AI Medical Questionnaire Systems
- Unveiling AI’s Logic: New Neural Networks Offer Clearer Insights
Class-wise Group Distributionally Robust Optimization (G-DRO)
To enhance the effectiveness of the inequality loss, especially in severely imbalanced datasets, the Cls-unbias model incorporates G-DRO. This mechanism adaptively upweights the ‘underperforming’ or minority classes during training. This ensures that the model doesn’t neglect these crucial, less represented classes, providing a more robust foundation for the inequality loss to work effectively.
The paper demonstrates the impact of class-feature bias through a simple synthetic experiment. A standard model (trained with Empirical Risk Minimization, ERM) heavily relied on a ‘class-specific’ or ‘spurious’ feature, leading to poor performance on unseen data. In contrast, the Cls-unbias model learned to ignore this biased feature, focusing instead on truly ‘class-shared’ features, resulting in significantly improved generalization.
Beyond synthetic tests, the Cls-unbias method was rigorously evaluated on five real-world medical datasets, including speech-based datasets for depression and dementia detection (DAIC-WOZ, MODMA, ADReSS) and imaging datasets for breast cancer and diabetic retinopathy detection (BreastMNIST, RetinaMNIST). Across all these diverse datasets, the proposed Cls-unbias model consistently outperformed traditional methods like standard ERM and class-weighted ERM. Notably, even on datasets with nearly balanced classes, where class-weighting offered no benefit or even degraded performance, Cls-unbias still showed improvements, suggesting the presence of subtle, unseen class-feature bias that it effectively mitigated.
The authors highlight that the improvements were particularly pronounced in speech-based tasks, which are inherently more complex and prone to spurious correlations. This underscores the model’s ability to identify and reduce these hidden biases, leading to more reliable diagnostic outcomes.
The implications of this research extend beyond mere performance metrics. Class-feature bias, if left unaddressed, can lead to serious ethical concerns. Misdiagnoses, delayed treatments, and the reinforcement of healthcare disparities are potential consequences when AI models rely on biased features. By promoting the learning of features that are truly informative across all patient groups, the Cls-unbias model contributes to more stable, fair, and ethically responsible AI systems in medical diagnosis. For more technical details, you can refer to the full research paper available at arXiv.


