TLDR: This research introduces a novel framework using the One-Class Support Vector Machine (OC-SVM) algorithm and crowdsourced data to accurately map mobile network coverage and identify weak spots. Unlike traditional methods, OC-SVM effectively captures complex, non-convex coverage boundaries, leading to more precise identification of service deficiencies, especially in urban environments, and enabling better network planning and enhanced user Quality of Experience.
Understanding where mobile network coverage is strong and where it falters is crucial for network operators aiming to provide the best possible experience for their users. Traditionally, assessing mobile coverage has relied on methods like “drive testing,” where specialized equipment is used to collect signal data along predefined routes. While precise, this method is expensive, time-consuming, and limited in its reach, often missing indoor coverage details or areas not along major roads.
A more modern approach involves using “crowdsourced data,” which comes from measurements collected by mobile apps on countless consumer smartphones. This method offers vast geographic and temporal coverage at a much lower cost. It can even capture instances of “no signal” indoors, revealing complete coverage holes that traditional methods might miss. However, analyzing this crowdsourced data to accurately pinpoint coverage gaps presents its own challenges due to variations in device quality, GPS inaccuracies, and sampling biases.
This paper introduces a new framework for analyzing mobile coverage and identifying weak spots using this crowdsourced data. The core of their methodology focuses on analyzing coverage at the individual cell (antenna) level, which is then combined to assess entire sites. A significant innovation in this research is the application of the One-Class Support Vector Machine (OC-SVM) algorithm to calculate mobile network coverage. Unlike simpler geometric methods, such as creating a convex hull around service points, the OC-SVM can model complex, non-convex coverage boundaries. This means it can accurately represent irregular shapes, internal holes, and concavities in coverage that are often caused by terrain, buildings, or other obstructions, especially in busy urban areas.
The OC-SVM algorithm treats coverage estimation as a one-class classification problem. Given data points where usable service was observed, it learns the “support” of that distribution, effectively drawing a boundary around the areas with good service. Points outside this boundary are then identified as likely weak or no-coverage zones. The use of a Radial Basis Function (RBF) kernel helps create smooth, locality-aware boundaries that wrap tightly around dense regions of positive signal evidence without being distorted by isolated data points.
The researchers also highlight the importance of tuning the OC-SVM’s hyperparameters, specifically ‘ν’ (nu) and ‘γ’ (gamma). The ‘ν’ parameter controls the trade-off between overfitting and underfitting, influencing how tight or loose the coverage boundary is. The ‘γ’ parameter governs the width of the RBF kernel, affecting the smoothness and detail of the decision boundary. They used a temporal cross-validation strategy, training the model on past data (e.g., January) and validating it on future data (e.g., February), to select the best hyperparameters that balance precision and recall.
A key aspect of their approach is partitioning the crowdsourced data by signal levels and training a separate OC-SVM boundary for each level. This allows for layered, nested boundaries that differentiate between zones of stronger and weaker coverage. When comparing their OC-SVM method against the traditional convex hull approach, the OC-SVM consistently showed better performance, particularly in areas with variable or poor signal levels where coverage boundaries are more intricate. This was quantified using the F1 score, which balances both precision (avoiding false positives) and recall (identifying true coverage areas).
The findings suggest that this OC-SVM model is better equipped to handle the complexities of real-world mobile network coverage. By accurately identifying coverage gaps and weak spots, network operators can make more informed decisions about where to deploy additional resources, such as new cell sites or signal boosters. This targeted approach leads to more efficient resource use and ultimately enhances the Quality of Experience for end users. This novel application of machine learning to crowdsourced data for coverage estimation represents a significant advancement in wireless communications.
Also Read:
- Understanding Urban Dynamics: A Unified AI Model for Simulating Mobility and Mobile Traffic
- Locaris: A New AI Model for Highly Accurate and Robust Indoor Wi-Fi Positioning
For more detailed information, you can read the full research paper available here.


