TLDR: A study by Duong Mai and Lawrence Hall demonstrates that injecting various types of noise (Gaussian, Speckle, Poisson, Salt and Pepper) during the training of deep learning models significantly improves their ability to generalize to new, unseen data sources (out-of-distribution data). This technique helps models avoid learning ‘shortcuts’ specific to their training data, making them more robust for critical applications like COVID-19 detection from Chest X-rays, especially when training datasets are small. The research also highlights that while noise injection is effective, the diversity and composition of the initial training data sources remain crucial factors influencing overall generalization.
Deep learning models have shown remarkable capabilities in image recognition, but they often struggle when faced with data from new sources, such as different medical devices or patient populations. This challenge, known as out-of-distribution (OOD) generalization, is particularly critical in healthcare applications like detecting COVID-19 from Chest X-rays (CXRs). Models tend to learn ‘shortcuts’ – specific patterns or artifacts unique to their training data – instead of focusing on true biological markers, leading to poor performance on unseen data.
A recent study by Duong Mai and Lawrence Hall investigates a straightforward yet effective technique: injecting fundamental types of noise during the training process of deep learning models. Their research explores how adding Gaussian, Speckle, Poisson, and Salt and Pepper noise can make these models more robust to shifts in data distribution.
The Problem with Shortcuts and Limited Data
In safety-critical fields like healthcare, the trustworthiness of AI models hinges on their ability to generalize reliably to new data. For COVID-19 detection from CXRs, models frequently exploit source-specific artifacts, which are not relevant to new clinical environments. This ‘shortcut learning’ means a model might perform well on data it was trained on (in-distribution or ID data) but fail dramatically on data from a different hospital or region.
While noise-based data augmentation is a known strategy to improve model robustness against various perturbations, its impact on generalizing to entirely new data sources, especially when training data is scarce, has been largely unexplored. This paper addresses that gap.
How the Study Was Conducted
The researchers trained a deep learning model to classify COVID-19 versus non-COVID-19 pneumonia from CXR images. To mimic real-world healthcare scenarios where data is often limited due to privacy concerns, they trained their model on a small subset of data from a single medical network. This ‘in-distribution’ (ID) data came from the BIMCV-COVID-19+ and Padchest datasets in Spain.
For evaluating generalization, they used ‘out-of-distribution’ (OOD) data from multiple external medical institutions, including COVID-19-AR (USA), V2-COV19-NII (Germany), NIH (USA), and Chexpert (USA). Before training, all images underwent a two-step preprocessing pipeline: isolating the chest area using a pre-trained neural network and then normalizing and resizing the images.
Noise Injection and Model Training
To enhance robustness, the study employed a noise-based data augmentation strategy during training. Four types of noise—Gaussian, Speckle, Poisson, and Salt and Pepper—were applied randomly to images in each training epoch. These noise types simulate common artifacts that can occur during image acquisition, transmission, or storage.
The model used was a ResNet-50 architecture, a common choice for image recognition. Given the limited training data, transfer learning was utilized, where a pre-trained feature extractor was frozen, and only the classification head was fine-tuned. The models were trained using standard deep learning techniques, including a binary cross-entropy loss function and an Adam optimizer, with early stopping to prevent overfitting.
The researchers compared two conditions: a ‘Baseline model’ trained without any noise augmentation and a ‘Noise-based Augmentation’ model trained with the described noise injection. Both models were evaluated on ID and OOD test sets using key metrics such as AUC, F1, accuracy, recall, and specificity.
Key Findings
The empirical results demonstrated that noise-based data augmentation significantly improved the model’s generalization to external data sources. When trained on BIMCV-COVID-19+ and Padchest, noise injection reduced the performance gap between ID and OOD evaluations across the five key metrics from a range of 0.03–0.18 down to 0.01–0.08. This highlights the technique’s ability to help models generalize even with limited training data size and source diversity.
The study also included ablation studies, where the composition of the ID training sources was varied. While noise injection still improved generalization in these scenarios, the researchers noted that a dramatic gap between ID and OOD evaluation could persist. This suggests that the inherent composition and dissimilarity of the training data sources play a pivotal role in guiding the model to learn generalizable biomarkers. When training sources are highly dissimilar, models might still rely on shortcuts that don’t translate well to new distributions, even with noise injection.
Also Read:
- Generative Hints: A Novel Method for Enhancing Model Invariance in Deep Learning
- Enhancing Deep Learning for Medical Imaging with Composite Input Images
Conclusion
In summary, for deep learning models trained on limited datasets with restricted source diversity, noise-based data augmentation can significantly enhance their ability to generalize to new, unseen data from different sources. This is crucial for applications like medical image analysis where OOD generalization is a major hurdle. However, the study also underscores the importance of carefully considering the composition of training data, as it profoundly influences a model’s capacity to learn robust, generalizable features that remain valid under distributional shifts. You can find the full research paper here: Noise Injection: Improving Out-of-Distribution Generalization for Limited Size Datasets.


