ATOM: A New Approach for Dynamic Knowledge Graph Construction with LLMs

TLDR: ATOM is a new, scalable method that uses LLMs to build and update Temporal Knowledge Graphs (TKGs) from text. It improves accuracy and stability by breaking text into “atomic facts” and uses a parallel, dual-time modeling approach to process information efficiently, significantly reducing processing time compared to existing methods.

In today’s fast-paced digital world, the sheer volume of unstructured data, from news articles to social media posts, presents a significant challenge. Extracting meaningful, structured knowledge from this deluge is crucial for everything from real-time analytics to advanced AI systems. While Knowledge Graphs (KGs) have emerged as powerful tools for organizing information, traditional static KGs often fall short when dealing with the dynamic, ever-changing nature of real-world events.

This is where Temporal Knowledge Graphs (TKGs) come into play, integrating time dimensions to capture how facts evolve. However, existing methods for building TKGs, especially those leveraging Large Language Models (LLMs) with zero- or few-shot learning, face hurdles. They can be unstable, producing different results from the same input, non-exhaustive, missing key information, and struggle with scalability as data grows.

A new research paper introduces ATOM (AdapTive and OptiMized), a novel approach designed to overcome these limitations. Developed by Yassir LAIRGI, Ludovic MONCLA, Khalid BENABDESLEM, Rémy CAZABET, and Pierre CLÉAU, ATOM offers a few-shot and scalable method for constructing and continuously updating TKGs from unstructured text. You can read the full research paper here: ATOM: AdapTive and OptiMized dynamic temporal knowledge graph construction using LLMs.

How ATOM Works: A Dual-Time, Parallel Approach

ATOM’s core innovation lies in its multi-module architecture, which prioritizes exhaustivity, stability, and scalability. Instead of processing entire documents at once, ATOM first breaks down input texts into “atomic facts.” These are minimal, self-contained snippets that convey a single piece of information. This decomposition is critical because LLMs often suffer from a “forgetting effect” when dealing with longer contexts, leading to incomplete knowledge extraction. By providing smaller, unambiguous contexts, ATOM ensures more thorough and consistent extraction.

A key feature of ATOM is its “dual-time modeling.” It distinguishes between the “observation time” (when information is observed or ingested) and the “validity period” (when the fact itself is true, characterized by start and end times). This separation is vital for accurately reflecting real-world data and inferring relative times, preventing common errors where observation time is mistakenly treated as the validity start time.

To tackle the computational challenge of processing potentially thousands of atomic facts from a single document, ATOM employs a highly efficient parallel architecture. After atomic facts are extracted, 5-tuples (subject, relation, object, validity start time, validity end time) are constructed in parallel. Crucially, ATOM avoids relying on LLM calls during the merging phase of these atomic TKGs. Instead, it uses a distance-metric-based algorithm for entity and relation resolution and a clever preprocessing step during extraction to handle temporal conflicts. This parallel, LLM-independent merging significantly boosts scalability and reduces latency.

Impressive Results and Performance

Empirical evaluations demonstrate ATOM’s superior performance compared to baseline methods like Graphiti and iText2KG. ATOM achieved approximately 18% higher temporal exhaustivity and 31% higher factual exhaustivity, meaning it captures more relevant information. It also showed about 17% better stability, ensuring more consistent TKG construction across multiple runs.

Perhaps one of ATOM’s most significant advantages is its speed. It achieved over 90% latency reduction compared to baseline methods. This dramatic improvement is attributed to its parallel processing capabilities and its LLM-independent merging strategy, which avoids the computational overhead of repeated LLM calls as the graph expands.

Furthermore, ATOM demonstrated strong consistency in Dynamic Temporal Knowledge Graph (DTKG) construction, particularly in temporal resolution. By correctly distinguishing observation time from validity periods, it avoids misattributing temporal information, a common pitfall in other systems.

Also Read:

Looking Ahead

While ATOM represents a significant leap forward, the researchers acknowledge some limitations. The atomic fact decomposition, while beneficial, can sometimes lead to the LLM generating “inferred” facts not explicitly in the source text, potentially increasing hallucination rates. Also, temporal information might occasionally be misassigned during decomposition, leading to omissions. Future work could involve fine-tuning LLMs specifically for this decomposition task or using supervised classifiers for entity/relation resolution to further refine accuracy.

In conclusion, ATOM provides a robust, scalable, and efficient framework for building and continuously updating Temporal Knowledge Graphs from unstructured text, paving the way for more dynamic and accurate knowledge representation in AI systems.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

ATOM: A New Approach for Dynamic Knowledge Graph Construction with LLMs

How ATOM Works: A Dual-Time, Parallel Approach

Impressive Results and Performance

Looking Ahead

Gen AI News and Updates

PASA Unveils New ‘Data for AI’ Guidance to Foster Responsible Innovation in Pensions Administration

AZTECH Introduces Comprehensive AI Training Series to Propel Regional Digital Transformation

HKU Spearheads AI Integration in Hong Kong’s Digital Education Future

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates