I2I-STRADA: A New Approach to Structured Data Analysis with AI Agents

TLDR: I2I-STRADA is a novel AI agent architecture for data analysis that formalizes the reasoning process. Unlike general-purpose LLMs, it uses a structured, modular workflow with distinct sub-tasks for analytical thinking, including goal construction, contextual grounding, and a two-stage adaptive planning and execution. It dynamically creates tools and handles execution state, leading to superior performance on benchmarks like DABstep and DABench by improving planning coherence and insight alignment in complex, real-world data scenarios.

In today’s fast-paced enterprise environments, dealing with vast amounts of diverse and often messy data for real-time analysis is a significant challenge. Traditional methods struggle with data in multiple formats, missing information, and evolving business needs. While advanced AI models, particularly large language models (LLMs), have shown promise in understanding unstructured data and adapting to changing information, they often fall short in providing a consistent, structured approach to analytical thinking.

This is where a new agentic architecture called I2I-STRADA, which stands for Information-to-Insight via Structured Reasoning Agent for Data Analysis, steps in. Developed by Sai Barath Sundar, Pranav Satheesan, and Udayaadithya Avadhanam from Mphasis Limited, I2I-STRADA aims to formalize the complex reasoning process involved in data analysis. Instead of treating reasoning as a ‘black box,’ it models how analysis unfolds through a series of modular sub-tasks that mirror the cognitive steps of human analytical reasoning.

How I2I-STRADA Works: A Structured Approach

The core of I2I-STRADA lies in its structured and modular design, built on two key principles: progressive abstraction, which means filtering out noise while keeping crucial information at each stage, and multi-step refinement, using a two-stage planning process to continuously improve reasoning quality.

The workflow begins with Goal Construction. Here, the agent interprets the user’s query to understand the main intent, identify key data points, outline a preliminary strategy, and note any specific conditions. This initial understanding is crucial for guiding subsequent steps.

Next, the Contextual Reasoner acts as a bridge, refining the initial goal by incorporating contextual information. This includes referencing metadata about data systems and standard operating procedures (SOPs) to ensure the plan aligns with available data structures and specific domain rules.

The system then moves into a two-stage planning process. First, Workflow Scaffolding generates a high-level, global plan before the agent even interacts with the actual data. This foundational ‘scaffold’ guides the entire analysis. Following this, the Adaptive Planning and Executor takes over. This is an iterative module that generates detailed, execution-level plans. Crucially, it dynamically adjusts subsequent steps based on the results of prior actions, including actual data exploration and intermediate outcomes. This adaptability is vital for complex tasks, as real-world data interaction often informs the best path forward. The execution involves writing and running Python code snippets in a secure environment.

Supporting these core reasoning steps are other vital components: a Context-Aware Tool Creation module that dynamically builds data processing tools and scripts on the fly, essential for handling diverse data sources; a Dynamic State Handler that acts as the agent’s working memory, maintaining execution context and enabling debugging; and a Communication Handler that ensures the final results are presented clearly, address user goals, and conform to required formats.

Also Read:

Performance and Impact

I2I-STRADA’s effectiveness and generalizability have been rigorously tested on two prominent benchmark datasets: DABstep and DABench. The DABstep dataset, which focuses on financial and operational data with procedural constraints, saw I2I-STRADA outperform several state-of-the-art data science agents. It achieved an impressive 80.56% accuracy on easy tasks and 28.04% on hard tasks, demonstrating superior planning and error handling, especially when adhering to specific rules.

On the DABench benchmark, which covers a wide array of end-to-end data science tasks across various domains like marketing, finance, and energy, I2I-STRADA also showed strong performance with 90.27% accuracy. This highlights its robustness across different types of data analysis tasks, whether domain-specific or purely statistical.

While the system shows remarkable strengths, the authors note areas for improvement, such as inconsistent handling of “Null” values in certain scenarios and the impact of hyperparameter choices in machine learning algorithms. Nevertheless, I2I-STRADA significantly advances the field by addressing the limitations of general LLMs in complex analytical scenarios, offering a more reliable and interpretable approach to data analysis.

This innovative architecture promises to further the development of sophisticated AI agents capable of comprehensive data analysis in real-world settings. For more details, you can read the full research paper here.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

I2I-STRADA: A New Approach to Structured Data Analysis with AI Agents

How I2I-STRADA Works: A Structured Approach

Performance and Impact

Gen AI News and Updates

Amazon Bedrock’s A2A Protocol: The Catalyst for Next-Gen Cross-Framework Multi-Agent AI Systems

Astreya Unveils New Wave of Enterprise AI Agents to Boost Business Efficiency and Automation

Vida Secures $4 Million Series A Funding to Advance AI Voice Technology and Expand Leadership

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates