The increasing complexity of engineering systems has motivated continuing research on computational learning methods toward making autonomous intelligent systems that can learn how to improve their performance over time while interacting with their environment. These systems need not only to sense their environment, but also to integrate information from the environment into all decision-makings. The evolution of such systems is modeled as an unknown controlled Markov chain. In a previous research, the predictive optimal decision-making (POD) model was developed, aiming to learn in real time the unknown transition probabilities and associated costs over a varying finite time horizon. In this paper, the convergence of the POD to the stationary distribution of a Markov chain is proven, thus establishing the POD as a robust model for making autonomous intelligent systems. This paper provides the conditions that the POD can be valid, and be an interpretation of its underlying structure.
Skip Nav Destination
Article navigation
July 2009
Research Papers
Convergence Properties of a Computational Learning Model for Unknown Markov Chains
Andreas A. Malikopoulos
Andreas A. Malikopoulos
Department of Mechanical Engineering,
amaliko@umich.edu
University of Michigan
, Ann Arbor, MI 48109
Search for other works by this author on:
Andreas A. Malikopoulos
J. Dyn. Sys., Meas., Control. Jul 2009, 131(4): 041011 (7 pages)
Published Online: May 20, 2009
Article history
Received:
March 18, 2008
Revised:
February 4, 2009
Published:
May 20, 2009
Citation
Malikopoulos, A. A. (May 20, 2009). "Convergence Properties of a Computational Learning Model for Unknown Markov Chains." ASME. J. Dyn. Sys., Meas., Control. July 2009; 131(4): 041011. https://doi.org/10.1115/1.3117202
Download citation file:
Get Email Alerts
Hybrid Kinematic-dynamic Sideslip and Friction Estimation
J. Dyn. Sys., Meas., Control
Koopman Model Predictive Control of an Integrated Thermal Management System for Electric Vehicles
J. Dyn. Sys., Meas., Control
Electromagnetic Model of Linear Resonant Actuators
J. Dyn. Sys., Meas., Control (May 2023)
Discrete Robust Control of Robot Manipulators Using an Uncertainty and Disturbance Estimator
J. Dyn. Sys., Meas., Control (May 2023)
Related Articles
A Real-Time Computational Learning Model for Sequential Decision-Making Problems Under Uncertainty
J. Dyn. Sys., Meas., Control (July,2009)
An Approach for Testing Methods for Modeling Uncertainty
J. Mech. Des (September,2006)
A Set of Estimation and Decision Preference Experiments for Exploring Risk Assessment Biases in Engineering Students
ASME J. Risk Uncertainty Part B (March,2023)
Nonlinear Parameters and State Estimation for Adaptive Nonlinear Model Predictive Control Design
J. Dyn. Sys., Meas., Control (April,2016)
Related Proceedings Papers
Related Chapters
Model-Building for Robust Reinforcement Learning
Intelligent Engineering Systems through Artificial Neural Networks, Volume 20
An Bayesian Assessment Model for Equipment Techonlogy State
International Conference on Software Technology and Engineering (ICSTE 2012)
An Approach for System Development Using Evolutionary Probabilistic Strategy and Grammar Rules
Intelligent Engineering Systems through Artificial Neural Networks, Volume 16