Enhancing Mitotic Figure Detection in Cancer Diagnostics with Attention-Guided AI

TLDR: Researchers have developed a novel AI model for the MIDOG 2025 challenge, aiming to improve mitotic figure detection in cancer pathology. Their approach extends the FCOS object detector by integrating a Feedback Attention Ladder CNN (FAL-CNN) and a fusion network. This composite system classifies mitotic figures and refines bounding box predictions to reduce false positives, enhance accuracy, and improve generalizability across diverse datasets. While initial F1 scores were lower than the baseline, the model is expected to perform better on unseen data and offers explainability through its attention mechanism.

Accurate detection of mitotic figures, which are cells undergoing division, is crucial for grading cancer and predicting its progression in histopathological images. However, this task is challenging for Artificial Intelligence (AI) due to variations in tissue appearance, staining, and how mitotic figures look across different datasets. The MIDOG 2025 challenge aims to find solutions that can generalize well across these variations for mitotic segmentation and grading.

Researchers Andrew Broad, Jason Keighley, Lucy Godson, and Alex Wright from the National Pathology Imaging Cooperative, UK, have introduced a new method to improve mitotic figure detection. Their approach, detailed in their paper MIDOG 2025: Mitotic Figure Detection with Attention-Guided False Positive Correction, extends an existing object detection system called FCOS (Fully Convolutional One-Stage Object Detector).

A Novel Approach to Mitosis Detection

The core of their innovation is a composite model that integrates a Feedback Attention Ladder CNN (FAL-CNN) with the FCOS detector. The FAL-CNN is specifically designed to classify whether a detected figure is a normal or abnormal mitotic figure. This classification information then feeds into a separate ‘fusion network’. This fusion network is trained to make precise adjustments to the bounding boxes (the predicted outlines around detected objects) initially suggested by the FCOS model.

The primary goal of this sophisticated network is to significantly reduce the number of ‘false positives’ – instances where the system incorrectly identifies something as a mitotic figure. By doing so, the model aims to enhance the overall accuracy of object detection and, crucially, improve its ability to perform well on new, unseen datasets, making it more generalizable.

How the System Works

The image processing pipeline consists of four main components working in harmony:

FCOS Object Detector: This is the initial stage, a fully convolutional network that identifies potential mitotic figures and draws bounding boxes around them, along with a confidence score.
FAL-CNN Classifier: This component takes small sections (mini-patches) centered on the FCOS-predicted bounding boxes. It uses a hierarchical feedback attention mechanism to classify these patches as either mitotic or non-mitotic. It also generates a ‘spatial attention map’ that highlights important cellular features, guiding the system’s focus. The FAL-CNN is built upon the VGG19 architecture and includes a feedback pathway that enhances its response to salient image regions.
Fusion Network: This is where the magic of refinement happens. It takes the spatial attention map and the probability of mitosis (pmitosis) from the FAL-CNN, along with the original FCOS scores. Using a series of connected layers, it calculates precise adjustments – coordinate offsets and score multipliers – to fine-tune the position and confidence score of the FCOS-predicted bounding boxes. The fusion network is trained while the FCOS and FAL-CNN parts remain fixed.
Mini-patch Sampling: This step involves extracting 56×56 pixel samples from the input image, centered around the bounding box locations predicted by FCOS. These mini-patches are then fed into the FAL-CNN for classification.

Training and Performance

The FAL-CNN and fusion network were trained using the MIDOG++ dataset, which is publicly available. The FAL-CNN was specifically trained on a balanced set of nearly 12,000 mitotic and 12,000 non-mitotic figure patches. For the MIDOG 2025 Track 1 Evaluation phase, the composite model achieved an F1 score of 0.655. Interestingly, the baseline FCOS model, when trained on the same MIDOG++ data, achieved a higher F1 score of 0.767 in this initial test phase.

Also Read:

Looking Ahead

While the initial performance was lower than the baseline, the researchers are optimistic about their model’s potential. They hypothesize that their attention-guided approach will be more robust and less affected by variations in datasets and different imaging domains. The true test of this generalizability will come with final analysis on unseen test sets. Furthermore, the FAL-CNN’s feedback mechanism offers a valuable benefit: it can highlight the specific features that lead to its classifications, providing a degree of explainability for its decisions, which is a significant advantage in AI applications for pathology.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Enhancing Mitotic Figure Detection in Cancer Diagnostics with Attention-Guided AI

A Novel Approach to Mitosis Detection

How the System Works

Training and Performance

Looking Ahead

Gen AI News and Updates

HKU Spearheads AI Integration in Hong Kong’s Digital Education Future

UNESCO’s 43rd General Conference Concludes with New Leadership and Landmark Ethics Frameworks for Technology

BRYGE AI Secures Silver Stevie® Award for Groundbreaking Health Tech Product for Women

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates