TLDR: A new AI method, Hierarchical Generalized Category Discovery for Brain Tumor Classification (HGCD-BT), has been developed to improve brain tumor classification. It integrates hierarchical clustering with contrastive learning to categorize both known and previously unseen tumor types. Evaluated on SRH and H&E stained images, HGCD-BT significantly outperforms existing methods, achieving a +28% accuracy improvement for patch-level classification and demonstrating strong performance on slide-level classification, especially in identifying novel tumor categories. This approach has the potential to enhance intra-operative decision-making and support precision oncology by discovering unrecognized tumor subgroups.
Accurate and timely classification of brain tumors is a critical step during neuro-oncological surgery, directly influencing treatment decisions and patient outcomes. However, current machine learning methods often face significant limitations. They are typically restricted to a predefined set of tumor types they were trained on, making them unable to identify or categorize novel tumor patterns not seen before. Furthermore, while unsupervised learning can extract general features, it struggles to incorporate existing medical knowledge from labeled data, and many semi-supervised methods assume all potential tumor classes are already represented in the labeled samples.
Understanding the Challenge in Brain Tumor Classification
The complexity and vast diversity of brain tumor types make it incredibly difficult to create comprehensive datasets that cover every possible pathology. This often leads to models that are blind to rare or newly identified tumor subtypes. The field of Generalized Category Discovery (GCD) aims to bridge this gap by enabling models to categorize both known and previously unknown classes within unlabeled data. However, brain tumor classification is inherently hierarchical, following structured taxonomies like the WHO classification system. Existing GCD methods often don’t fully leverage this crucial hierarchical information.
Introducing HGCD-BT: A Hierarchical Approach
To address these challenges, researchers have introduced Hierarchical Generalized Category Discovery for Brain Tumor Classification (HGCD-BT). This novel approach is specifically designed to integrate the hierarchical structure of brain tumor taxonomies directly into the learning process. HGCD-BT extends existing contrastive learning-based GCD methods by incorporating a new semi-supervised hierarchical clustering loss, allowing it to better capture the underlying pathological concepts.
How HGCD-BT Works
At its core, HGCD-BT uses a feature encoder to process image data. It then employs two main branches: a contrastive learning branch and a hierarchical clustering branch. The contrastive learning branch helps to organize the feature space so that similar data points (e.g., images of the same tumor type) are grouped closely together. The hierarchical clustering branch, powered by a novel semi-supervised loss, enforces a hierarchical structure, allowing the model to make a sequence of decisions to classify tumor types, much like how pathologists use a decision tree. This combination allows HGCD-BT to learn from both labeled and unlabeled data, and crucially, to discover new categories while respecting the natural hierarchy of brain tumor types.
Impressive Results Across Different Imaging Types
The effectiveness of HGCD-BT was evaluated on two distinct datasets. On OpenSRH, a dataset of stimulated Raman histology (SRH) brain tumor images, HGCD-BT achieved a remarkable +28% improvement in accuracy over state-of-the-art GCD methods for patch-level classification. This improvement was particularly significant in identifying previously unseen, or ‘novel,’ tumor categories. Furthermore, the method demonstrated its versatility and generalizability on slide-level classification using hematoxylin and eosin (H&E) stained whole-slide images from the Digital Brain Tumor Atlas (DBTA), confirming its utility across different imaging modalities and levels of detail.
Also Read:
- Enhancing Medical Object Detection Across Diverse Imaging Modalities
- New Benchmark and Dataset Enhance AI Diagnosis for Spine Disorders
The Impact on Clinical Practice and Future Directions
The development of HGCD-BT holds significant promise for neuro-oncology. By enabling more accurate and rapid identification of brain tumor types, it could shorten surgical intervention times and potentially reduce complications. Its ability to discover novel subtypes without extensive, costly, and labor-intensive annotations could greatly support diagnostic workflows and contribute to a more refined understanding of tumor biology, paving the way for precision oncology. The hierarchical nature of HGCD-BT also offers improved interpretability of the model’s decisions. While currently focused on common tumor types, future work aims to address rare tumor types, dynamically estimate the optimal number of hierarchical levels, and integrate multimodal data such as genomic or radiological information for an even more comprehensive disease understanding.
For more in-depth information, you can read the full research paper here.


