An Accelerated Value/Policy Iteration Scheme for Optimal Control Problems and Games

Alla, Alessandro; Falcone, Maurizio; Kalise, Dante

doi:10.1007/978-3-319-10705-9_48

Alessandro Alla¹²,
Maurizio Falcone¹³ &
Dante Kalise¹⁴

Part of the book series: Lecture Notes in Computational Science and Engineering ((LNCSE,volume 103))

3241 Accesses

Abstract

We present an accelerated algorithm for the solution of static Hamilton-Jacobi-Bellman equations related to optimal control problems and differential games. The new scheme combines the advantages of value iteration and policy iteration methods by means of an efficient coupling. The method starts with a value iteration phase on a coarse mesh and then switches to a policy iteration procedure over a finer mesh when a fixed error threshold is reached. We present numerical tests assessing the performance of the scheme.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

A. Alla, M. Falcone, D. Kalise, An efficient policy iteration algorithm for dynamic programming equations (submitted). Available at arXiv:1308.2087
Google Scholar
M. Bardi, I. Capuzzo-Dolcetta, Optimal Control and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations (Birkhäuser, Boston, 1997)
Book MATH Google Scholar
R. Bellman, Dynamic Programming (Princeton University Press, Princeton, 1957)
MATH Google Scholar
O. Bokanowski, S. Maroso, H. Zidani, Some convergence results for Howard’s algorithm. SIAM J. Numer. Anal. 47, 3001–3026 (2009)
Article MATH MathSciNet Google Scholar
C. Chow, J. Tsitsiklis, An optimal one-way multigrid algorithm for discrete-time stochastic control. IEEE Trans. Autom. Control 36, 898–914 (1991)
Article MATH MathSciNet Google Scholar
R.A. Howard, Dynamic Programming and Markov Processes. Technology Press of the Massachusetts Institute of technology. New York; London: J. Wiley, cop. (1960)
Google Scholar
R. Kalaba, On nonlinear differential equations, the maximum operation and monotone convergence. J. Math. Mech. 8, 519–574 (1959)
MATH MathSciNet Google Scholar
M. Pollatschek, B. Avi-Itzhak, Algorithms for stochastic games with geometrical interpretation. Manage. Sci. 15, 399–415 (1969)
Article MATH MathSciNet Google Scholar
M.L. Puterman, S.L. Brumelle, On the convergence of Policy iteration in stationary dynamic programming. Math. Oper. Res. 4, 60–69 (1979)
Article MATH MathSciNet Google Scholar
M.S. Santos, Numerical solution of dynamic economic models, in Handbook of Macroeconomics, ed. by J.B. Taylor, M. Woodford (Elsevier Science, Amsterdam/New York, 1999), pp. 311–386
Google Scholar
M.S. Santos, J. Rust, Convergence properties of policy iteration. SIAM J. Control Optim. 42, 2094–2115 (2004)
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

University of Hamburg, Bundesstraße 55, Hamburg, Germany
Alessandro Alla
SAPIENZA – University of Rome, Ple. Aldo Moro 2, Rome, Italy
Maurizio Falcone
RICAM, Austrian Academy of Sciences, Altenberger Straße 69, Linz, Austria
Dante Kalise

Authors

Alessandro Alla
View author publications
You can also search for this author in PubMed Google Scholar
Maurizio Falcone
View author publications
You can also search for this author in PubMed Google Scholar
Dante Kalise
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dante Kalise .

Editor information

Editors and Affiliations

MATHICSE-ANMC, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
Assyr Abdulle
SB MATHICSE CMCS, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
Simone Deparis
SB MATHICSE ANCHP, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
Daniel Kressner
SB MATHICSE CSQI, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
Fabio Nobile
SB MATHICSE GR-PI, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
Marco Picasso

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Alla, A., Falcone, M., Kalise, D. (2015). An Accelerated Value/Policy Iteration Scheme for Optimal Control Problems and Games. In: Abdulle, A., Deparis, S., Kressner, D., Nobile, F., Picasso, M. (eds) Numerical Mathematics and Advanced Applications - ENUMATH 2013. Lecture Notes in Computational Science and Engineering, vol 103. Springer, Cham. https://doi.org/10.1007/978-3-319-10705-9_48

Download citation

DOI: https://doi.org/10.1007/978-3-319-10705-9_48
Published: 31 October 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10704-2
Online ISBN: 978-3-319-10705-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics