FloodVision: AI and Knowledge Graphs Combine for Precise Urban Flood Depth Estimation

TLDR: FloodVision is a zero-shot AI framework that accurately estimates urban flood depth. It integrates GPT-4o’s semantic reasoning with a domain knowledge graph (FloodKG) containing verified object dimensions. This approach identifies reference objects in images, retrieves their heights from FloodKG to prevent AI hallucination, and calculates submergence ratios. Evaluated on crowdsourced images, FloodVision achieved an 8.17 cm mean absolute error, a 20.5% improvement over a GPT-4o-only baseline, demonstrating enhanced accuracy and generalization for real-time flood response.

Urban flooding is a growing concern, causing significant damage and disrupting daily life. Accurate and timely information about floodwater depth is crucial for emergency services, road accessibility, and overall urban resilience. Traditional methods for estimating flood depth often fall short, being either too slow, spatially limited, or computationally intensive.

Recent advancements in computer vision have offered new ways to detect floods, but estimating precise water depth remains a challenge. Many existing computer vision methods struggle with accuracy and generalization because they rely on fixed object detectors and require extensive, task-specific training data. This often means they can’t adapt well to diverse flood scenarios or when specific reference objects aren’t clearly visible.

A significant hurdle for advanced AI models, particularly vision-language models (VLMs), in this domain is their tendency for “quantitative hallucination.” This means they might generate plausible but incorrect estimations for real-world object dimensions, undermining their reliability in critical applications like flood depth measurement.

To address these limitations, researchers have developed a novel framework called FloodVision. This innovative system combines the powerful semantic reasoning capabilities of a foundation vision-language model, specifically GPT-4o, with a carefully structured domain knowledge graph. The core idea is to ground the AI’s reasoning in physical reality by providing it with verified real-world dimensions of common urban objects.

FloodVision works by dynamically identifying visible reference objects in standard RGB images, such as vehicles, people, or infrastructure elements. Once identified, it retrieves their canonical heights from a specialized “FloodKG” (Flood Knowledge Graph). This knowledge graph acts as a reliable source of truth, preventing the VLM from hallucinating object dimensions. The system then estimates how much of each object is submerged and applies statistical filtering to ensure the final depth values are accurate and reliable, removing any anomalous readings.

The FloodKG is a meticulously constructed repository of physical dimensions. It includes a hierarchical ontology covering vehicles (like sedans and SUVs), humans (adults, children), and infrastructure (curbs, fire hydrants). Each entry in the graph provides a mean height and standard deviation, sourced from authoritative data like vehicle specifications, anthropometric surveys, and design manuals. This ensures that the AI has access to accurate physical context.

In experiments, FloodVision was evaluated using 110 crowdsourced images from the MyCoast New York platform, where residents submit geotagged photos and flood depth estimates. The results were impressive: FloodVision achieved a mean absolute error (MAE) of 8.17 cm. This represents a significant 20.5% reduction in error compared to a GPT-4o-only baseline, which scored 10.28 cm MAE. It also outperformed earlier methods based on Convolutional Neural Networks (CNNs).

The framework’s ability to generalize across varying scenes and operate in near real-time makes it highly suitable for practical applications. It could be integrated into digital twin platforms for dynamic visualization of flood conditions or citizen-reporting apps, significantly enhancing smart city flood resilience efforts. This research marks an important step towards more accurate, generalizable, and real-time urban flood depth estimation for emergency response and urban planning.

Also Read:

While FloodVision offers substantial improvements, the researchers acknowledge areas for future development. These include incorporating additional visual cues beyond just reference objects, such as water surface texture or reflections, and exploring few-shot or reinforcement learning to further enhance accuracy and adaptability. The full research paper can be found here.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

FloodVision: AI and Knowledge Graphs Combine for Precise Urban Flood Depth Estimation

Gen AI News and Updates

Generative AI Powers Next-Gen Autonomous Emergency Response

SymLight: Unlocking Interpretable and Deployable Traffic Signal Control

Building Persistent Intelligence: Exploring MemoriesDB for AI Memory Management

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates