Unlocking Program Paths: AUTO STUB's AI-Powered Stubs

TLDR: AUTO STUB is a novel approach that uses Genetic Programming to automatically generate symbolic stubs for external functions encountered during symbolic execution. This addresses a major limitation in software testing where external functions act as ‘black boxes,’ hindering analysis. By generating training data from random inputs and outputs, AUTO STUB’s AI derives expressions that approximate function behavior, allowing symbolic execution to continue without manual intervention. The system achieves over 90% accuracy for 55% of evaluated functions, enabling the exploration of previously intractable program paths and revealing language-specific edge cases crucial for software testing.

Software testing is a critical process for ensuring the reliability and security of applications. One powerful technique used in this domain is symbolic execution, which explores all possible program paths by representing inputs as symbols rather than concrete values. This method can uncover bugs and vulnerabilities that might be missed by traditional testing. However, symbolic execution faces a significant hurdle when it encounters ‘external functions’ – these are parts of a program that rely on native methods, third-party libraries, or uninstrumented code. Such functions act like black boxes, making it impossible for symbolic execution to understand their internal behavior and thus halting the analysis of any code dependent on them.

The Challenge of External Functions in Symbolic Execution

Imagine a program that checks a user’s input using a function called verify_input. If this function is external, symbolic execution cannot determine the relationship between the user’s input and the function’s output. This means it can’t explore different scenarios, like what happens if verify_input returns true or false, effectively blocking the analysis of subsequent code. Current solutions often involve manual intervention, where developers write ‘symbolic stubs’ – simplified models that approximate the external function’s behavior. This process is time-consuming, prone to errors, and requires deep understanding of the external code, making it a bottleneck in comprehensive software testing.

Introducing AUTO STUB: An Automated Solution

To overcome this limitation, researchers have developed AUTO STUB, a novel approach that automates the creation of these symbolic stubs using Genetic Programming. Genetic Programming is a type of machine learning inspired by biological evolution, where computer programs ‘evolve’ over generations to solve a specific task. AUTO STUB integrates seamlessly into the symbolic execution process. When an external function is encountered, AUTO STUB first generates a diverse set of random inputs and observes the corresponding outputs from the actual external function. This input-output data then serves as training material for Genetic Programming.

How AUTO STUB Leverages Genetic Programming

Genetic Programming in AUTO STUB works by searching for mathematical or logical expressions that accurately mimic the relationship between the observed inputs and outputs. These expressions, once found, become the symbolic stubs. The system uses Grammar-Guided Genetic Programming (G3P) to ensure that the generated expressions are syntactically correct and maintain type consistency across different data types (like integers, floating-point numbers, and strings). A wide range of operators, including mathematical, logical, and string manipulation functions, are used as building blocks for these expressions. The ‘fitness’ of an expression is measured by how well its predicted outputs match the actual outputs, using metrics like Normalized Root Mean Squared Error for numbers, classification accuracy for Booleans, and Levenshtein distance for strings.

Behind the Scenes: Data Generation and Evaluation

To ensure the generated stubs are robust, AUTO STUB employs a sophisticated input generation strategy. For numerical types, it uses stratified sampling to cover a wide range of magnitudes, including special values like NaN (Not-a-Number), Infinity, and min/max values. For strings, it creates random sequences of varying lengths. This comprehensive data generation is crucial for training the Genetic Programming algorithm effectively. The system was evaluated on a benchmark dataset of 273 Java methods from internal libraries, focusing on primitives and mathematical operations, ensuring they had no side effects and returned primitive or string types.

Real-World Impact and Accuracy

The results of AUTO STUB are promising. The system demonstrated that it could automatically approximate external functions with over 90% accuracy for 55% of the functions evaluated. This significantly outperforms a random baseline, proving the effectiveness of its targeted search strategy. For instance, AUTO STUB successfully inferred complex, language-specific behaviors, such as how Java handles Double.isNaN(double), by generating an expression that captures the unique properties of NaN. These insights are invaluable for identifying edge cases and potential bugs in software. While most generated stubs worked flawlessly with symbolic execution engines, some challenges arose when the underlying SMT (Satisfiability Modulo Theories) solver interpreted language-specific semantics (like NaN or Infinity) differently than Java, highlighting an area for future refinement.

Also Read:

Navigating Limitations and Future Directions

Currently, AUTO STUB is limited to stateless functions, meaning it cannot handle objects that retain internal state, such as StringBuilder. Extending its capabilities to stateful objects remains an open challenge, potentially by approximating sequences of calls rather than single functions. Additionally, the generated symbolic stubs are intentionally kept computationally simple to ensure fast solving, meaning they approximate functions with regular complexity rather than Turing-complete behaviors like loops or recursion. Despite these limitations, AUTO STUB represents a significant step forward in automating software testing, making symbolic execution more practical and less reliant on manual effort. For more technical details, you can refer to the full research paper: AUTO STUB : Genetic Programming-Based Stub Creation for Symbolic Execution.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Unlocking Program Paths: AUTO STUB’s AI-Powered Stubs

The Challenge of External Functions in Symbolic Execution

Introducing AUTO STUB: An Automated Solution

How AUTO STUB Leverages Genetic Programming

Behind the Scenes: Data Generation and Evaluation

Real-World Impact and Accuracy

Navigating Limitations and Future Directions

Gen AI News and Updates

Amazon Bedrock’s A2A Protocol: The Catalyst for Next-Gen Cross-Framework Multi-Agent AI Systems

AZTECH Introduces Comprehensive AI Training Series to Propel Regional Digital Transformation

Financial Sector Fortifies Against Surging AI-Powered Scams

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates