TLDR: A new research paper introduces Self-Grounded Verification (SGV), a two-step method to mitigate ‘agreement bias’ in Multimodal Large Language Models (MLLMs) when they act as verifiers for AI agent behavior. Agreement bias causes MLLMs to incorrectly validate flawed agent actions. SGV first prompts the MLLM to generate unbiased ideal task completion steps, then uses these self-generated priors to accurately evaluate agent trajectories. This approach significantly improves MLLM verification accuracy and enables effective real-time supervision for agents in web, computer, and robotic environments, setting new performance benchmarks.
Multimodal Large Language Models, or MLLMs, are powerful AI systems that can understand and process information from various sources, like text and images. They are increasingly being explored for their potential to act as ‘verifiers’ – functions that evaluate the behavior of other AI agents. Imagine an AI agent trying to complete a task, like buying a specific item online or performing actions on a computer. An MLLM verifier would assess if the agent’s steps were correct and if the task was successfully completed.
However, a significant challenge has emerged in this area: a phenomenon called ‘agreement bias’. This bias causes MLLMs to strongly favor information already present in their context window, even if that information describes flawed or incomplete behavior. For example, if an agent’s trajectory (sequence of actions) is provided to an MLLM for evaluation, the MLLM might generate reasoning to rationalize the agent’s mistakes, leading to an incorrect judgment of ‘success’ when the agent actually failed. This bias is widespread across different MLLM models and remains persistent even with advanced testing techniques.
Introducing Self-Grounded Verification (SGV)
To tackle this critical limitation, researchers have proposed a new, lightweight method called Self-Grounded Verification (SGV). SGV aims to make MLLMs more effective at leveraging their vast knowledge and reasoning abilities by using a two-step process.
First, the MLLM is prompted to generate a broad set of ‘priors’ or ideal steps for successfully completing a given task. Crucially, this initial generation happens without the MLLM seeing the specific agent’s trajectory that it will later evaluate. This ensures that the MLLM’s initial understanding of what success looks like is unbiased and based purely on its general knowledge.
In the second step, the MLLM then evaluates the candidate agent trajectory, but this time, it does so while being ‘grounded’ by the priors it generated in the first step. This means the MLLM compares the agent’s actual behavior against its own independently generated ideal steps, leading to a more objective and accurate assessment.
Also Read:
- Unlocking Reliable AI Reasoning Through Hidden Cognitive Signals
- Bridging Neural Networks and Symbolic AI: A New Approach to Language Model Reasoning
Impact and Applications
The implementation of SGV has shown remarkable improvements. MLLM verifiers enhanced with SGV have demonstrated gains of up to 20 percentage points in their ability to detect failures and up to 11 percentage points in overall accuracy. This method introduces minimal computational overhead and can be easily integrated into existing systems.
SGV’s effectiveness extends to various real-world applications. It significantly improves the automatic evaluation of AI agent trajectories in diverse environments, including web navigation tasks (VisualWebArena), computer system interactions (OSWorld), and even robotic manipulation (robomimic). Furthermore, SGV enables MLLMs to provide real-time supervision and feedback to guide agents during task execution, helping them correct mistakes and achieve better outcomes. For instance, a ReAct agent, when paired with an SGV-enhanced verifier, achieved a new state-of-the-art performance on the VisualWebArena benchmark, surpassing previous bests by a substantial margin.
This research, detailed in the paper “Let’s Think in Two Steps: Mitigating Agreement Bias in MLLMs with Self-Grounded Verification”, highlights a crucial step forward in making MLLMs more reliable and trustworthy evaluators for complex AI agent behaviors, paving the way for more robust and capable AI systems.


