Optimization procedures for Markovian and semi-Markovian decision processes
| dc.contributor.author | Ren, Zhi-Zhong Oscar | en_US |
| dc.date.accessioned | 2024-08-15T17:36:38Z | |
| dc.date.available | 2024-08-15T17:36:38Z | |
| dc.date.copyright | 1994 | en_US |
| dc.date.issued | 1994 | |
| dc.degree.department | Department of Mathematics and Statistics | |
| dc.degree.level | Master of Science M.Sc. | en |
| dc.description.abstract | In this paper, we investigate both Markovian decision processes (MDP) and semi-Markovian decision processes (Semi-MDP), for either discrete or continuous time, and with or without discounting. Attention is focused primarily on the determination of optimal strategy in MDP or Semi-MDP with finite states and finite action space. The structures of system rewards in terms of yields and bonuses associated with state occupancies, transitions among the states in the process and the action taken in each state are presented and incorporated into the appropriate optimization criteria and algorithms. The existence of optimal stationary strategies for an infinite horizon are noted and used in the algorithms for different cases. The different policy iteration methods involving the appropriate policy improveĀment algorithms (PIA) as well as the value determination operations (VDO) used to obtain the optimal stationary strategies and the total expected return values or average gain value per unit time over infinite time horizons are presented; MoreĀ over, the value iteration procedure (VIP) used to obtain optimal time-dependent strategies and the corresponding optimized total expected return values for discrete time MDP or Semi-MDP over finite horizons are also presented. All the algorithms are fully discussed and a number of examples are presented through this paper. In addition to the discussion of some of the underlying theory and properties of MDP and Semi-MDP, this paper also provides a set of programs written and tested in Maple for implementing the various optimization algorithm. | en |
| dc.format.extent | 105 pages | |
| dc.identifier.uri | https://hdl.handle.net/1828/19402 | |
| dc.rights | Available to the World Wide Web | en_US |
| dc.title | Optimization procedures for Markovian and semi-Markovian decision processes | en_US |
| dc.type | Thesis | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- REN_ZHI_ZHONG_MSc_1994_677221.pdf
- Size:
- 2.31 MB
- Format:
- Adobe Portable Document Format