TLDR: FoMEMO introduces a novel paradigm for expensive multi-objective optimization using a pre-trained foundation model. Instead of relying on costly real-world data or rebuilding models for each new problem, FoMEMO trains a Transformer-based model on hundreds of millions of synthetic data points. This enables fast, in-context optimization for diverse, unknown problems without further model training, demonstrating superior adaptability, competitive performance, and significantly faster query times compared to existing methods.
Optimizing multiple conflicting objectives simultaneously is a common and critical challenge in many real-world scenarios, from designing new engineering components to discovering new drugs. These problems are often “expensive,” meaning that evaluating a potential solution can be costly and time-consuming. In such situations, finding the best solutions with a limited number of evaluations is paramount.
Traditional approaches to this problem, known as multi-objective Bayesian optimization, typically involve building complex statistical models called Gaussian processes for each objective. While effective, these models need to be rebuilt from scratch for every new problem, and their training and inference can be quite slow. Another set of methods relies on pre-training deep learning models using vast amounts of past real-world experimental data. However, acquiring such extensive datasets is often impractical, especially for new or emerging applications, and these models also struggle to generalize to unseen scenarios without dedicated retraining.
Addressing these limitations, researchers have introduced a groundbreaking new approach called FoMEMO, which stands for Foundation Models for Expensive Multi-objective Optimization. This innovative paradigm establishes a “foundation model” that can adapt to a wide range of optimization problems without needing to be retrained for each new task. Instead, it learns from a diverse set of hundreds of millions of synthetic data points, rather than relying on expensive real-world experiments.
How FoMEMO Works
The FoMEMO framework operates in two main stages: synthetic pre-training and in-context optimization.
During the synthetic pre-training stage, the foundation model is trained once using a massive, diverse dataset generated synthetically. This data simulates a vast array of potential real-world scenarios, including different problem characteristics and user preferences. The model learns to predict the likely outcomes of various solutions, conditioned on the history of evaluations (the “domain trajectory”) and specific user preferences for how objectives should be balanced.
Once pre-training is complete, the model enters the in-context optimization stage. When a user faces a new, expensive multi-objective problem, they simply provide the foundation model with the current evaluated solutions and their preferences. The pre-trained model then uses this contextual information to quickly predict the aggregated outcomes and their uncertainties. Based on these predictions, it suggests the next best solutions to evaluate, all without any further model training or updates during the optimization process. This “in-context” learning allows for rapid adaptation and efficient problem-solving.
Key Advantages and Performance
A significant advantage of FoMEMO is its ability to achieve superior adaptability and generalization to unknown problems. By pre-training on a diverse synthetic dataset, the model learns fundamental patterns that apply across many different optimization landscapes. This eliminates the need for costly real-world data collection for training and makes the approach highly practical for emerging applications.
The researchers developed various “acquisition functions” – strategies to guide the search for new solutions – which can be quickly optimized using the foundation model’s predictions. These include preference-based methods like Expected Improvement (EI) and Upper Confidence Bound (UCB), and a preference-free method called Uncertainty Hypervolume Improvement (UHVI).
Extensive experiments on both synthetic benchmarks and real-world engineering design problems demonstrated FoMEMO’s effectiveness. The methods, particularly FoMEMO-UCB, showed competitive or superior optimization performance compared to existing state-of-the-art techniques. Furthermore, FoMEMO exhibited a remarkable runtime advantage, generating candidate solutions orders of magnitude faster than other model-based algorithms. This efficiency is due to the foundation model’s fast inference capabilities, as it avoids the time-consuming model updates required by traditional methods.
Also Read:
- Large Language Models Transform Chemical Experiment Optimization
- LLM-Driven Policy Diffusion: A New Path to Generalization in Offline Reinforcement Learning
Future Directions
While FoMEMO represents a significant step forward, the researchers acknowledge areas for future exploration. These include enhancing the model’s scalability to even higher-dimensional problems, developing more advanced acquisition functions for parallel optimization, exploring alternative methods for synthetic data generation, and investigating whether fine-tuning with small amounts of domain-specific data could further boost performance.
This work offers a promising new direction for tackling complex, expensive optimization challenges, providing a more general, efficient, and adaptable tool for decision-makers across various fields. You can read the full research paper here.


