AI-Driven Investment: A Hierarchical Approach to Portfolio Optimization with Market Sentiment

TLDR: A new research paper introduces HARLF, a hierarchical framework for financial portfolio optimization that integrates lightweight Large Language Models (LLMs) with Deep Reinforcement Learning (DRL). This three-tier architecture uses base RL agents for hybrid data processing, meta-agents for decision aggregation, and a super-agent to synthesize final allocations based on market data and sentiment analysis from financial news. Evaluated on data from 2018-2024, the framework achieved a 26% annualized return and a Sharpe ratio of 1.2, outperforming traditional benchmarks and other RL strategies. The paper highlights scalable cross-modal integration, enhanced stability through hierarchy, and open-source reproducibility.

Financial markets are complex and constantly changing, making it a significant challenge for investors to decide how to allocate their money to maximize returns while managing risk. Traditional methods often struggle with the dynamic nature of these markets and the vast amount of unstructured data, like financial news, that influences investor sentiment.

A new research paper, titled “HARLF: Hierarchical Reinforcement Learning and Lightweight LLM-Driven Sentiment Integration for Financial Portfolio Optimization,” introduces an innovative approach to tackle this problem. Authored by Benjamin CORIAT and Eric BENHAMOU, this paper presents a hierarchical framework that combines the power of Deep Reinforcement Learning (DRL) with lightweight Large Language Models (LLMs) to make smarter investment decisions.

The HARLF Framework: A Three-Tiered Approach

The core of this new framework, called HARLF, is its three-layer architecture designed to process both traditional financial data and sentiment signals from news. This hierarchical structure aims to improve decision-making, making it more stable and easier to understand.

At the lowest level are the base RL agents. These agents are like specialized analysts. Some focus on quantitative financial metrics, such as daily returns, volatility, and various risk-adjusted ratios (Sharpe, Sortino, Calmar, Maximum Drawdown). Others specialize in qualitative data, specifically sentiment scores derived from financial news. They propose initial portfolio weight recommendations based on their specific data inputs.

Above the base agents are the meta-agents. There are two main types: one for data-driven insights and another for NLP-based (sentiment) insights. These meta-agents take the recommendations from their respective base agents and aggregate them, refining the proposed portfolio allocations. This step ensures that decisions are cohesive and specialized based on the type of information being processed.

Finally, at the top, is the super-agent. This is the ultimate decision-maker. It receives the refined recommendations from both the data-driven meta-agent and the NLP-based meta-agent. By synthesizing these two perspectives—quantitative market analysis and qualitative market sentiment—the super-agent makes the final, optimized portfolio allocation decisions. This mimics how experienced traders might balance hard numbers with market mood.

Integrating Sentiment with FinBERT

A key innovation of HARLF is its seamless integration of sentiment analysis. The framework uses FinBERT, a specialized Large Language Model fine-tuned for financial texts, to extract sentiment from financial news articles. For each asset in the portfolio, news articles are scraped monthly, and FinBERT analyzes them to produce a sentiment score. This score, along with volatility, forms the NLP-driven observation vector for the sentiment-focused agents. This allows the system to capture forward-looking signals and investor behavior that traditional price data alone cannot reveal.

Data and Assets Used

The framework was trained on historical financial data from 2003 to 2017 and then evaluated on unseen data from 2018 to 2024. The portfolio included 14 diverse financial instruments, spanning both equities (like S&P 500, NASDAQ, CAC 40, Hang Seng Index) and commodities (Gold, Silver, WTI Crude Oil Futures). This diversification helps the system learn across various market conditions.

To ensure the model’s decisions are practical, several real-world investment constraints were applied: the strategy is ‘long-only’ (only buying assets, no short-selling), uses ‘no leverage’ (investing only available capital), and performs ‘monthly rebalancing’ of asset weights. Each asset started with an equal initial weight, allowing the RL agent to shape the portfolio without inherited biases.

Impressive Performance

The results of the HARLF framework are compelling. During the testing period (2018–2024), the super-agent achieved an impressive 26% annualized return and a Sharpe ratio of 1.2. This significantly outperformed standard benchmarks, including an equal-weighted portfolio (7.5% ROI, 0.57 Sharpe) and the S&P 500 (13.2% ROI, 0.63 Sharpe). The NLP-based meta-agent also showed strong performance, highlighting the value of sentiment-driven decision-making.

Compared to other state-of-the-art RL strategies in academic literature, the HARLF super-agent demonstrated competitive or superior performance, closely matching or surpassing strategies like CNN-RL and various DRL approaches in terms of ROI and Sharpe ratio.

Also Read:

Transparency and Future Outlook

To ensure transparency and allow for further research, the authors have made their work reproducible through Google Colab notebooks, covering data pipelines, sentiment extraction, and the full training workflow. You can find more details about this research paper at this link.

While promising, the current implementation has some limitations, such as assuming synchronously available data and excluding transaction costs. Future research aims to address these by incorporating real-time data, modeling transaction costs, stress testing under extreme market conditions, expanding the text corpus for sentiment analysis, and exploring the efficacy of larger, general-purpose LLMs compared to lightweight models like FinBERT.

This research marks a significant step forward in applying advanced AI techniques to financial portfolio optimization, offering a robust and adaptive solution for navigating the complexities of modern markets.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

AI-Driven Investment: A Hierarchical Approach to Portfolio Optimization with Market Sentiment

The HARLF Framework: A Three-Tiered Approach

Integrating Sentiment with FinBERT

Data and Assets Used

Impressive Performance

Transparency and Future Outlook

Gen AI News and Updates

HKU Spearheads AI Integration in Hong Kong’s Digital Education Future

UNESCO’s 43rd General Conference Concludes with New Leadership and Landmark Ethics Frameworks for Technology

BRYGE AI Secures Silver Stevie® Award for Groundbreaking Health Tech Product for Women

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates