research-article

Open access

MetaTrader: An Reinforcement Learning Approach Integrating Diverse Policies for Portfolio Optimization

Authors:

Hui Niu,

Siyuan Li,

Jian LiAuthors Info & Claims

CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management

Pages 1573 - 1583

https://doi.org/10.1145/3511808.3557363

Published: 17 October 2022 Publication History

PDF eReader

Abstract

Portfolio management is a fundamental problem in finance. It involves periodic reallocations of assets to maximize the expected returns within an appropriate level of risk exposure. Deep reinforcement learning (RL) has been considered a promising approach to solving this problem owing to its strong capability in sequential decision making. However, due to the non-stationary nature of financial markets, applying RL techniques to portfolio optimization remains a challenging problem. Extracting trading knowledge from various expert strategies could be helpful for agents to accommodate the changing markets. In this paper, we propose MetaTrader, a novel two-stage RL-based approach for portfolio management, which learns to integrate diverse trading policies to adapt to various market conditions. In the first stage, MetaTrader incorporates an imitation learning objective into the reinforcement learning framework. Through imitating different expert demonstrations, MetaTrader acquires a set of trading policies with great diversity. In the second stage, MetaTrader learns a meta-policy to recognize the market conditions and decide on the most proper learned policy to follow. We evaluate the proposed approach on three real-world index datasets and compare it to state-of-the-art baselines. The empirical results demonstrate that MetaTrader significantly outperforms those baselines in balancing profits and risks. Furthermore, thorough ablation studies validate the effectiveness of the components in the proposed approach.

Supplementary Material

MP4 File (CIKM22-fp0439.mp4)

In this paper, we propose a novel reinforcement learning approach to solve the portfolio optimization problem by integrating diverse trading policies.

Download
36.49 MB

References

[1]

Saud Almahdi and Steve Y Yang. 2017. An adaptive portfolio trading system: A risk-return portfolio optimization using recurrent reinforcement learning with expected maximum drawdown. Expert Systems with Applications 87 (2017), 267--279.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Portfolio dynamic trading strategies using deep reinforcement learning

Portfolio management algorithm based on long-term prediction of assets

Soft imitation reinforcement learning with value decomposition for portfolio management

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations