Access

You are not currently logged in.

Access your personal account or get JSTOR access through your library or other institution:

login

Log in to your personal account or through your institution.

If You Use a Screen Reader

This content is available through Read Online (Free) program, which relies on page scans. Since scans are not currently available to screen readers, please contact JSTOR User Support for access. We'll provide a PDF copy for your screen reader.

A Modified Form of the Iterative Method of Dynamic Programming

Arie Hordijk and Henk Tijms
The Annals of Statistics
Vol. 3, No. 1 (Jan., 1975), pp. 203-208
Stable URL: http://www.jstor.org/stable/2958088
Page Count: 6
  • Read Online (Free)
  • Download ($19.00)
  • Subscribe ($19.50)
  • Cite this Item
Since scans are not currently available to screen readers, please contact JSTOR User Support for access. We'll provide a PDF copy for your screen reader.
A Modified Form of the Iterative Method of Dynamic Programming
Preview not available

Abstract

This paper considers the discrete time finite state Markovian decision problem with the average return criterion. A modified form of the iterative method of dynamic programming is studied. Under the assumption that the maximal average return is independent of the initial state the asymptotic behaviour of the sequence of functions generated by this modified method is found. It is shown that the modified iterative method supplies both upper and lower bounds on the maximal average return and ε-optimal policies. Moreover, a convergence result is proved for the policies produced by the modified iterative method.

Page Thumbnails

  • Thumbnail: Page 
203
    203
  • Thumbnail: Page 
204
    204
  • Thumbnail: Page 
205
    205
  • Thumbnail: Page 
206
    206
  • Thumbnail: Page 
207
    207
  • Thumbnail: Page 
208
    208