TLDR: This research paper introduces an enhanced classification method for multispectral point clouds (MPCs) that tackles common challenges in outdoor datasets: sparse labeling, varying land-cover scales, and long-tailed class distributions. The proposed method features a grid-balanced sampling strategy for robust training data generation, a multi-scale feature fusion module to preserve fine details across different scales, and an adaptive hybrid loss module to balance learning for both common and rare classes. Experimental results on three real-world datasets demonstrate significant improvements in classification accuracy, especially for underrepresented categories, compared to existing methods.
Multispectral point clouds (MPCs) are a powerful tool for understanding our environment, capturing both 3D spatial information and spectral data from observed scenes. This rich data can be used in many applications, from autonomous driving to target detection and scene analysis. However, classifying these complex datasets, especially those from outdoor environments, presents significant challenges.
Traditional classification methods often struggle with outdoor MPCs due to several key issues. Firstly, obtaining detailed ground truth labels for outdoor scenes is difficult, leading to sparsely labeled datasets. Secondly, land-covers in remote sensing datasets can vary greatly in size, making it hard for a single approach to learn features effectively across all scales. Lastly, the number of instances for different land-cover classes can be highly imbalanced, a problem known as a long-tailed distribution, which often results in poor classification accuracy for less common categories.
To address these persistent challenges, researchers have proposed an enhanced classification method based on adaptive multi-scale fusion for MPCs with long-tailed distributions. This novel approach integrates three main components designed to improve the entire classification process, from preparing the data to learning features and making final predictions.
A Smarter Way to Sample Data
One of the core innovations is a grid-balanced sampling strategy. In outdoor datasets, where manual labeling is expensive and ground truth data is sparse, traditional random sampling can lead to training sets with insufficient labeled points or overlaps between training and test data. The new grid-balanced sampling strategy adaptively identifies labeled areas and selects sample centroids based on the labeled ground truth rate. This ensures that training samples contain enough labeled points, improving the robustness and reliability of the classification model, and clearly separating training and testing data.
Fusing Features Across Scales
Land-covers in remote sensing datasets can differ by as much as 50 times in scale, from large buildings to small street lights. This variation makes it difficult for a network to extract fine features at a single scale. To overcome this, the method introduces a multi-scale feature fusion module. This module acts as a ‘skip connection’ within the network, aggregating detailed features from different layers of the encoder (which extracts initial features) and feeding them into the decoder (which reconstructs the classification). This process helps the network retain fine details of land-covers across various scales, significantly improving classification accuracy.
Also Read:
- MoSAiC: Enhancing Land Cover Classification in Remote Sensing with Hybrid Contrastive Learning
- RAPNet: A New Approach to Sharpening Satellite Images with Adaptive AI
Balancing Learning for All Classes
The problem of long-tailed distributions means that common classes are learned well, while rare classes are often overlooked, leading to poor classification. To combat this, an adaptive hybrid loss module has been developed. This module combines two types of loss functions: multi-scale loss and long-tailed loss. It uses multiple classification heads, each with adaptive weights, to learn different classes separately. This intelligent weighting system balances the learning ability across all categories, significantly boosting the classification performance for small and rare classes, which are typically challenging to identify accurately.
The effectiveness of this enhanced method has been demonstrated through extensive experiments on three real-world multispectral point cloud datasets: HIT, ZJK, and SK. The results show significant improvements in overall accuracy, average accuracy, kappa coefficient, and mean intersection over union compared to existing state-of-the-art methods. Notably, the proposed method achieved superior performance in classifying the ‘tail’ categories, which are often problematic for other approaches. While some ‘head’ (common) categories might see slightly lower individual scores compared to methods specifically optimized for them, the overall balance and accuracy across all classes are significantly better, leading to a higher overall performance.
This research represents a significant step forward in multispectral point cloud classification, offering a robust and effective solution for handling the complexities of real-world outdoor datasets. For more technical details, you can refer to the full research paper: An Enhanced Classification Method Based on Adaptive Multi-Scale Fusion for Long-tailed Multispectral Point Clouds.


