Q-Learning-Based Financial Trading: Some Results and Comparisons

Corazza, Marco

doi:10.1007/978-981-15-5093-5_31

Q-Learning-Based Financial Trading: Some Results and Comparisons

Marco Corazza⁷

Chapter
First Online: 10 July 2020

985 Accesses

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 184))

Abstract

In this paper, we consider different financial trading systems (FTSs) based on a Reinforcement Learning (RL) methodology known as Q-Learning (QL). QL is a machine learning method which real-time optimizes its behavior in relation to the responses it gets from the environment as a consequence of its acting. In the paper, first we introduce the essential aspects of RL and QL which are of interest for our purposes, then we present some original and differently configurated FTSs based on QL, finally we apply such FTSs to eight time series of daily closing stock returns from the Italian stock market.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
In Sect. 4, we specify the policy improvement we consider in the applications.
2.
For simplicity’s sake, in the following of the paper we use only the term “net” for the expression “net-of-transaction cost”.
3.
Note that the need to specify such an approximator is due to the fact that some of the state variables, namely the logarithmic rates of return, are continuous.
4.
When \(k=0\), the parameters are randomly initialized following a \(\mathcal {U}(-1, 1)^{N+2}\).
5.
Note that, in order to determine the optimal parameters, we perform a mean square error minimization through a gradient descent-based method.
6.
In this context, “annualized” and “monthly” have to be meant as referring to the stock market year and to the stock market month, respectively.
7.
From here on in, by the expression \(\ll \)[\(\ldots \)] stocks that contribute most to this result [\(\ldots \)]\(\gg \), or equivalent, we mean stocks whose percentages of succes are greater than or equal to \(60\%\).

References

Barto, A.G., Sutton, R.S.: Reinforcement Learning: An Introduction, 2nd edn. The MIT Press (2018)
Google Scholar
Bekiros, S.D.: Heterogeneous trading strategies with adaptive fuzzy actor-critic reinforcement learning: a behavioral approach. J. Econ. Dyn. Control. 34(6), 1153–1170 (2010)
Article MathSciNet Google Scholar
Bertsekas, D.P., Tsitsiklis, J.N.: Neuro-Dynamic Programming. Athena Scientific (1996)
Google Scholar
Brent, R.P.: Algorithms for Minimization Without Derivatives. Prentice-Hall (1973)
Google Scholar
Casqueiro, P.X., Rodrigues, A.J.L.: Neuro-dynamic trading methods. Eur. J. Oper. Res. 175(3), 1400–1412 (2006)
Article Google Scholar
Deng, Y., Bao, F., Kong, Y., Ren, Z., Dai, Q.: Deep Direct Reinforcement Learning for financial signal representation and trading. IEEE Trans. Neural Netw. Learn. Syst. 28(3), 653–664 (2017)
Article Google Scholar
Gosavi, A.: Simulation-Based Optimization. Parametric Optimization Techniques and Reinforcement Learning. Springer, (2015)
Google Scholar
Jangmin, O., Lee, J., Lee, J.W., Zhang, B.-T.: Adaptive stock trading with dynamic asset allocation using reinforcemnt learning. Inform. Sci. 176(15), 2121–2147 (2006)
Google Scholar
Kearns, M., Nevmyvaka, Y.: Machine learning for market microstructure and high frequency trading. In: Easley, D., López de Prado, M., O’Hara, M. (eds.) High-Frequency Trading—New Realities for Traders, Markets and Regulators, pp. 91–124. Risk Books (2013)
Google Scholar
Li, H., Dagli, C.H., Enke, D.: Short-term stock market timing prediction under reinforcement learning schemes. In: Proceedings of the 2007 IEEE Symposium on Approximate Dynamic Programming and Reinforcement Learning, pp. 233–240 (2007)
Google Scholar
Moody, J., Saffel, M.: Learning to trade via direct reinforcement. IEEE Trans. Neural Netw. 12(4), 875–889 (2001)
Article Google Scholar
Tan, Z., Quek, C., Cheng, P.Y.K: Stock trading with cycles: a financial application of ANFIS and reinforcement learning. Expert. Syst. Appl. 38(5), 4741–4755 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Economics, Ca’ Foscari University of Venice, Cannaregio 873, 30121, Venice, Italy
Marco Corazza

Authors

Marco Corazza
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marco Corazza .

Editor information

Editors and Affiliations

Dipartimento di Psicologia and IIASS, Università della Campania “Luigi Vanvitelli”, Caserta, Italy
Anna Esposito
Fundació Tecnocampus, Pompeu Fabra University, Mataró, Barcelona, Spain
Marcos Faundez-Zanuy
Department of Civil, Environmental, Energy, and Material Engineering, University Mediterranea of Reggio Calabria, Reggio Calabria, Italy
Francesco Carlo Morabito
Laboratorio di Neuronica, Dipartimento Elettronica e Telecomunicazioni , Politecnico di Torino, Torino, Italy
Eros Pasero

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Corazza, M. (2021). Q-Learning-Based Financial Trading: Some Results and Comparisons. In: Esposito, A., Faundez-Zanuy, M., Morabito, F., Pasero, E. (eds) Progresses in Artificial Intelligence and Neural Systems. Smart Innovation, Systems and Technologies, vol 184. Springer, Singapore. https://doi.org/10.1007/978-981-15-5093-5_31

Download citation

DOI: https://doi.org/10.1007/978-981-15-5093-5_31
Published: 10 July 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-5092-8
Online ISBN: 978-981-15-5093-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics