AI Agent OntoLogX Structures Cybersecurity Logs for Threat Intelligence

TLDR: OntoLogX is an autonomous AI agent that uses Large Language Models (LLMs) to convert raw cybersecurity logs into structured, ontology-grounded Knowledge Graphs (KGs). It incorporates Retrieval Augmented Generation (RAG) and iterative correction to ensure semantic and syntactic validity. The system groups KGs into sessions to predict MITRE ATT&CK tactics, linking low-level log data to higher-level adversarial objectives. Evaluated on benchmark and real-world honeypot data, OntoLogX significantly improves the extraction of actionable Cyber Threat Intelligence, demonstrating the value of code-oriented LLMs and ontology-grounded representations.

Cybersecurity threats are constantly evolving, becoming more sophisticated and harder to detect. Traditional defense systems often struggle to keep pace, leading to a critical need for more proactive strategies. A key source of information for understanding these threats lies within system logs, which record attacker behaviors, exploited vulnerabilities, and malicious activities. However, these logs are typically unstructured, inconsistent, and fragmented, making it incredibly difficult to extract meaningful insights.

This is where Cyber Threat Intelligence (CTI) comes in. CTI involves collecting, processing, and analyzing information about threat actors to enable faster and better-informed decisions in cybersecurity operations. Among the most valuable sources for CTI are logs from honeypots, which are systems designed to attract and record malicious interactions, providing a rich dataset of adversarial behavior.

To address the challenges of processing these complex logs, researchers have explored using Knowledge Graphs (KGs). KGs represent concepts, entities, and events, along with the relationships between them, in a way that is closer to human understanding. This structured representation facilitates semantic reasoning and integration with automated workflows.

Recent advancements in Large Language Models (LLMs) have shown remarkable potential in extracting structured information from natural language. However, applying LLMs effectively in specialized domains like cybersecurity, where precise terminology and contextual interpretation are crucial, remains a challenge. Existing approaches often require heavily pre-processed inputs or significant user interaction.

A new autonomous AI agent called OntoLogX has been introduced to tackle these issues. OntoLogX leverages LLMs to transform raw logs directly into ontology-grounded Knowledge Graphs without requiring human intervention. It integrates a lightweight, domain-specific log ontology with Retrieval Augmented Generation (RAG) and iterative correction steps. This ensures that the generated KGs are both syntactically correct and semantically valid.

Beyond just analyzing individual events, OntoLogX aggregates these KGs into sessions. It then uses an LLM to predict MITRE ATT&CK tactics, which are high-level adversarial objectives. This crucial step links low-level log evidence to broader attack strategies, providing a more comprehensive understanding of threat activities.

The methodology of OntoLogX involves several key steps. When a log event arrives, the system first retrieves semantically related KGs from a database to serve as examples for the LLM. The LLM then generates a candidate KG, combining the new log event, optional context, and the domain ontology. This candidate is rigorously validated against ontology constraints. If errors are found, the model is prompted to apply targeted corrections iteratively until a valid representation is achieved. Once validated, the KG is stored, and finally, KGs from the same log session are grouped to predict associated MITRE ATT&CK tactics.

The evaluation of OntoLogX demonstrated its effectiveness in generating ontology-compliant KGs. The retrieval and correction mechanisms significantly improved the precision and recall of information extraction. Interestingly, code-oriented LLMs proved particularly well-suited for this structured log analysis task. The system was tested on both public benchmark logs and a real-world honeypot dataset, showing robust KG generation and accurate mapping of adversarial activity to ATT&CK tactics.

While OntoLogX represents a significant step forward in extracting actionable CTI from logs, the reliance on computationally intensive LLMs presents a scalability challenge for high-throughput environments. Future work aims to explore optimization strategies and extend the ontology to cover more log sources and CTI standards. For more details, you can refer to the full research paper: OntoLogX: Ontology-Guided Knowledge Graph Extraction from Cybersecurity Logs with Large Language Models.

Also Read:

Overall, OntoLogX offers a novel and promising approach to transforming unstructured and heterogeneous logs into valuable intelligence, enhancing proactive and explainable cyber defense.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

AI Agent OntoLogX Structures Cybersecurity Logs for Threat Intelligence

Gen AI News and Updates

Amazon Bedrock’s A2A Protocol: The Catalyst for Next-Gen Cross-Framework Multi-Agent AI Systems

Astreya Unveils New Wave of Enterprise AI Agents to Boost Business Efficiency and Automation

Vesl AI Recognized for AI Infrastructure Innovation with ASOCIO Digital Summit Award

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates