AI Discovers New Low-Discrepancy Point Sets and Improves Sobol' Sequences for QMC Methods

TLDR: A new study demonstrates how Large Language Models (LLMs) can be used in an evolutionary program synthesis framework to significantly improve Quasi-Monte Carlo (QMC) methods. The LLM-guided system discovered new 2D and 3D low-discrepancy point sets, setting new benchmarks, and optimized Sobol’ direction numbers, leading to reduced integration error for high-dimensional financial option pricing. This approach offers advantages like extensibility and generalizability, showcasing LLMs’ potential in automated scientific discovery.

A groundbreaking study introduces a novel approach to enhancing quasi-Monte Carlo (QMC) methods, which are crucial for high-dimensional numerical integration in various scientific and engineering fields. Traditionally, QMC methods rely on highly uniform point sets to achieve faster convergence rates than standard Monte Carlo methods. This research leverages Large Language Models (LLMs) within an evolutionary program synthesis framework to discover improved QMC designs.

The paper, titled “LLM-Guided Evolutionary Program Synthesis for Quasi-Monte Carlo Design” by Amir Sadikov, tackles two significant challenges in QMC: constructing finite low-discrepancy point sets and optimizing Sobol’ direction numbers to minimize integration error. The core idea is to treat these complex mathematical problems as program synthesis tasks, where an LLM iteratively mutates and refines code based on a task-specific fitness function.

The Evolutionary Approach

The methodology is built upon the OpenEvolve framework, an open-source implementation inspired by AlphaEvolve. This framework operates through an evolutionary loop:

Initialization: It starts with a population of Python programs that generate candidate solutions.
Evaluation: Each program’s output is assessed by a fitness function, which quantifies the quality of the solution (e.g., lower discrepancy or mean squared error).
Selection and Prompting: High-performing programs are chosen as ‘parents.’ An LLM is then prompted with the parent code, its fitness score, and instructions to generate a variation that improves the score.
Generation (Mutation): The LLM, acting as an intelligent mutation operator, creates new, modified programs.
Loop: The new programs are evaluated, scored, and added to the population, continuously refining solutions.

Discovering Low-Discrepancy Point Sets

For constructing low-discrepancy point sets, the research employed a two-phase strategy. Phase 1 focused on direct code construction, where the LLM generated Python code to build point sets. This phase started with simple heuristics like a shifted Fibonacci lattice. Phase 2 then shifted to iterative optimization, prompting the LLM to use numerical optimization routines (like scipy.optimize.minimize) to refine the point coordinates.

The results were remarkable. In 2D, the method rediscovered known optimal point sets for smaller numbers of points (N ≤ 10) and, more importantly, established new best-known benchmarks for larger N (e.g., N > 30). For instance, for N=100, the LLM-evolved method found a point set with a star discrepancy of 0.0150, significantly improving upon the previous best of 0.0188. In 3D, it matched known optimal sets for N ≤ 8 and provided new best-known benchmarks beyond this range.

Optimizing Sobol’ Direction Numbers

The framework was also applied to optimize Sobol’ direction numbers, which are critical parameters influencing the quality of Sobol’ sequences. The LLM focused its modifications on early dimensions, which are most impactful for applications like Asian option pricing. The fitness function here aimed to minimize the mean squared error (MSE) of randomized QMC (rQMC) estimates for a 32-dimensional Asian option price.

The LLM-discovered direction numbers led to a significant reduction in rQMC MSE for larger sample sizes (N ≥ 512) compared to the widely used Joe–Kuo parameters. Crucially, these improved parameters demonstrated strong generalizability across a diverse suite of high-dimensional exotic options, including Lookback, Basket, and Bermudan options, suggesting a more robust and broadly applicable Sobol’ sequence.

Also Read:

Implications and Future Directions

This LLM-driven evolutionary approach offers several advantages over fixed-point set generation methods. The discovered direction numbers are extensible to any sample size N, support progressive integration (adding more points for increased accuracy), allow for easy randomization to obtain unbiased error estimates, and are highly applicable to high-dimensional problems where direct coordinate optimization is intractable.

However, the performance gains did not extend to Barrier options, which have discontinuous payoffs. This suggests that the notion of an “optimal” Sobol’ sequence might be problem-dependent, with the current fitness function specializing in smoother integrands. Future work could explore multi-objective optimization to create more universally applicable direction numbers.

This research strongly supports the use of LLMs as core components in automated scientific discovery, capable of generating novel and valuable mathematical knowledge. The data and code for this research are openly available for further exploration and development. You can find more details in the full research paper: LLM-Guided Evolutionary Program Synthesis for Quasi-Monte Carlo Design.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

AI Discovers New Low-Discrepancy Point Sets and Improves Sobol’ Sequences for QMC Methods

The Evolutionary Approach

Discovering Low-Discrepancy Point Sets

Optimizing Sobol’ Direction Numbers

Implications and Future Directions

Gen AI News and Updates

Google DeepMind Unveils SIMA 2: An Advanced AI Agent for Virtual 3D Worlds

Oracle Unveils ‘Ask Oracle’ Chatbot for Personalized Redwood Experience, Powered by Advanced Select AI

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates