TLDR: A new method, RFF-GP-HSMM, is proposed to make unsupervised time-series segmentation faster and more scalable. It addresses the high computational cost of the existing GP-HSMM by using Random Fourier Features to approximate Gaussian processes, eliminating the need for slow matrix inversions. Experiments show it achieves similar accuracy to GP-HSMM but is significantly faster (up to 278 times), especially for large datasets, enabling real-time applications in fields like motion analysis and robotics.
Unsupervised segmentation is a powerful technique that automatically identifies recurring patterns within continuous time-series data. This capability is crucial for analyzing complex information that would be difficult to process manually, playing a vital role in various real-world applications, from industrial processes to understanding human and animal behaviors. For instance, humans naturally segment continuous sensory input, like speech and motion, into meaningful units to learn and recognize patterns. Reproducing this ability in artificial intelligence is key for advancements in areas like robot learning and language comprehension.
One highly accurate method for unsupervised time-series segmentation is the Gaussian Process Hidden Semi-Markov Model (GP-HSMM). This model excels at segmenting data without requiring pre-labeled examples, inferring segmentation points and categories through Bayesian inference. GP-HSMM has found success in diverse fields, including analyzing worker behavior in manufacturing, studying animal movements, and even in robotics for learning manipulation tasks.
However, despite its precision, GP-HSMM faces a significant hurdle: high computational cost. Its core mechanism involves Gaussian processes, which necessitate the inversion of a large kernel matrix during training. As the amount of data grows, this matrix inversion becomes extremely time-consuming, limiting GP-HSMM’s practical deployment, especially in industrial settings where real-time processing of large-scale data is essential.
To overcome this computational bottleneck, researchers have proposed a new method called RFF-GP-HSMM, which stands for Random Fourier Feature-based Gaussian Process Hidden Semi-Markov Model. The innovation lies in incorporating Random Fourier Features (RFF) to approximate the Gaussian process. This approximation allows the model to use linear regression instead of the more computationally intensive kernel matrix inversion, thereby preserving the model’s expressive power while drastically reducing computation time.
How RFF-GP-HSMM Works
In essence, Random Fourier Features provide a way to approximate complex kernel functions, like those used in Gaussian processes, with simpler basis functions. This transformation allows the problem to be treated as a linear regression, which is much faster to compute, especially when dealing with large datasets. Instead of inverting a matrix whose size depends on the number of data points (which can be huge), RFF-GP-HSMM only needs to invert a matrix whose size is fixed by a hyperparameter, regardless of the data scale.
Also Read:
- Unlocking the Secrets of Human Movement: A New AI Approach to Submovement Analysis
- Boosting Time Series Forecasts with Foundation Models and Conformal Prediction
Experimental Validation and Impact
The effectiveness of RFF-GP-HSMM was rigorously tested using the Carnegie Mellon University (CMU) motion-capture dataset, which includes various activities like running and jumping. The experiments compared the proposed method against the conventional GP-HSMM in terms of both segmentation accuracy and computational time.
The results were compelling. RFF-GP-HSMM achieved segmentation performance comparable to that of the traditional GP-HSMM, with a normalized Hamming distance (a measure of accuracy where lower is better) of approximately 0.35 for both methods. This demonstrates that the approximation using Random Fourier Features does not significantly degrade accuracy.
Where RFF-GP-HSMM truly shines is in computational efficiency. For a large-scale dataset comprising 39,200 frames (equivalent to about 2 hours and 43 minutes of time-series data), the conventional GP-HSMM took approximately 9 hours to process. In stark contrast, RFF-GP-HSMM completed the same task in just about 2 minutes, achieving an astonishing speed-up of approximately 278 times. This dramatic reduction in processing time makes real-time and scalable segmentation of long time-series data feasible, which was previously a major challenge.
This advancement has broad implications for various domains requiring time-series data segmentation, including motion analysis, speech processing, robotic control, industrial task analysis, and understanding animal behavior. The ability to perform fast and accurate unsupervised segmentation opens doors for more practical and widespread applications of AI in analyzing dynamic, continuous data. For more technical details, you can refer to the full research paper here.


