Securing FPGAs: Real-time AI Detects Malicious Code in Embedded Systems

TLDR: Researchers developed a real-time machine learning method to detect malicious bitstreams in reconfigurable embedded systems like FPGAs. By analyzing binary-level features, their Random Forest model achieved a 97% F1-score and was successfully deployed on a PYNQ-Z1 board, enabling on-device, real-time detection of hardware Trojans without needing source code or netlists.

Field-Programmable Gate Arrays (FPGAs) are becoming increasingly common in modern reconfigurable systems, offering high-performance computation with lower power consumption compared to GPUs or traditional processors. Their ability to be dynamically updated makes them ideal for adaptive systems, real-time acceleration, and cloud environments. However, this flexibility also introduces significant security vulnerabilities, particularly from malicious bitstreams that can reprogram devices with compromised data. These threats can lead to denial-of-service attacks, data leakage, or covert operations, especially in multi-tenant cloud setups where hardware is shared and reconfigured over time.

Traditional methods to combat these threats often involve controlling synthesis pipelines or reverse-engineering bitstreams back into netlists for analysis. These approaches can be time-consuming, resource-intensive, and often rely on the availability of development-stage files, which isn’t always feasible, especially when intellectual property confidentiality is a concern.

A new research paper, “Real-time ML-based Defense Against Malicious Payload in Reconfigurable Embedded Systems,” by Rye Stahle-Smith and Rasha Karakchi (Advisor) from the University of South Carolina, proposes an innovative solution to this growing problem. Their approach diverges significantly from existing methods by analyzing bitstreams directly at the binary level. This allows for real-time detection of malicious payloads without needing access to the original source code or netlists, making it highly practical for resource-constrained embedded systems.

How It Works: A Machine Learning Approach

The researchers developed a supervised machine learning method to identify malicious bitstreams using static byte-level features. They created a dataset of 122 samples, including both benign and malicious configurations, by re-engineering state-of-the-art benchmarks to target the Xilinx PYNQ-Z1 FPGA Development Board. To prepare this data for machine learning, they used byte frequency analysis for vectorization, Truncated Singular Value Decomposition (TSVD) for dimensionality reduction to maintain structural relevance and improve computational efficiency, and the Synthetic Minority Oversampling Technique (SMOTE) to address class imbalance within the dataset.

Multiple machine learning classifiers were evaluated, and the Random Forest model emerged as the top performer. It achieved an impressive macro F1-score of 0.97, along with high accuracy, precision, and recall (all around 0.98 ± 0.02). On a hold-out test set, Random Forest demonstrated a true positive rate of 97.14% and a false positive rate of just 0.8%, confirming its strong ability to distinguish between benign and malicious bitstreams.

Also Read:

Real-time Deployment on Embedded Systems

A key aspect of this research is the successful deployment of the trained Random Forest model. The final model was serialized and integrated into the PYNQ-Z1 platform, enabling on-device inference. This means the system can analyze bitstreams in real-time within a constrained embedded environment, without relying on external computation. The deployment trials showed consistent classification results and an average prediction latency of approximately 3.35 seconds, validating the feasibility of real-time hardware Trojan detection in such settings.

This work represents a significant step forward in securing reconfigurable embedded systems against sophisticated hardware attacks. By focusing on binary-level analysis and leveraging efficient machine learning techniques, the researchers have demonstrated a viable path for proactive, real-time defense against malicious payloads. For more details, you can read the full research paper here.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Securing FPGAs: Real-time AI Detects Malicious Code in Embedded Systems

How It Works: A Machine Learning Approach

Real-time Deployment on Embedded Systems

Gen AI News and Updates

Rubrik Report Reveals Alarming Decline in Cyber Resilience Amidst AI Agent Proliferation

Anthropic Reveals First AI-Orchestrated Cyber Espionage Campaign by Chinese State-Sponsored Group

TrojAI Unveils Defend for MCP to Bolster Security for AI Agent Workflows

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates