Localization and a Distributed Local Optimal Solution Algorithm for a Class of Multi-Agent Markov Decision Processes

Chang, Hyeong-Soo;

International Journal of Control, Automation, and Systems

Volume 1 Issue 3
/
Pages.358-367
/
2003
/
1598-6446(pISSN)
/
2005-4092(eISSN)

Institute of Control, Robotics and Systems (제어로봇시스템학회)

Localization and a Distributed Local Optimal Solution Algorithm for a Class of Multi-Agent Markov Decision Processes

Chang, Hyeong-Soo (Department of Computer Science and Engineering, Songang University)

Published : 2003.09.01

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

We consider discrete-time factorial Markov Decision Processes (MDPs) in multiple decision-makers environment for infinite horizon average reward criterion with a general joint reward structure but a factorial joint state transition structure. We introduce the "localization" concept that a global MDP is localized for each agent such that each agent needs to consider a local MDP defined only with its own state and action spaces. Based on that, we present a gradient-ascent like iterative distributed algorithm that converges to a local optimal solution of the global MDP. The solution is an autonomous joint policy in that each agent's decision is based on only its local state.cal state.

Keywords

References

IEEE Trans. Automat. Control v.AC-32 Decentralized optimal control of Markov chains with a common past information set M. Aicardi;F Davoli;R. Minciardi
Markov Decision Processes, Models, Methods, Directions, and Open Problems Applications of Markov decision processes in communication networks: a survey E. Altman;E. Feinberg(ed.);A. Shwartz(ed.)
Proc. of the 41st IEEE CDC Optimal planning for autonomous air vehicle battle management G. Arslan;J. D. Wolfe;J. Shamma;J. L. Speyer
Parallel and Distributed Computation;Numerical Methods D. P. Bertsekas;J. N. Tsitsiklis
Neuro-Dynamic Programming D. P. Bertsekas;J. N. Tsitsiklis
Technical Report 2001-4, Department of Stochastics, Vrije Universiteit Amsterdam On the structure of value functions for threshold policies in queueing models S. Bhulai;G. Koole
Ph.D. Thesis, School of Electrical and Computer Engineering, Purdue University On-line sampling-based control for network queueing problems H. S. Chang
Operations Research The linear programming approach to approximate dynamic programming D. P. de Farias;B. Van Roy
Performance Evaluation v.18 The Markov-modulated Poisson process (MMPP) cookbook W. Fischer;K. Mejer-Helistern
Adaptive Markov Control Processes O. Hernandez-Lerma
IEEE Trans. Automat. Control v.AC-27 Decentralized control of finite state Markov processes K. Hsu;S. I. Marcus
IEEE Trans. Automat. Control v.37 no.5 On distributed dynamic programming A. Jalali;M. J. Ferguson
J. Network and Systems Management v.3 On computing Markov decision theory-based cost for routing in circuit-switched broadband networks A. Kolarov;J. Hui
Queueing Systems v.34 On the value function of a priority queue with an application to a controlled pollying model G. Koole;P. Nain
IEEE Trans. Automat. Control v.AC-19 no.5 Decomposition of systems governed by Markov chains H. J. Kushner;C. Chen
Proc. 11th Annual Conf. on Uncertainty in Artificial Intelligence On the complexity of solving Markov decision problems M. Littman;T. Dean;L. Kaelbling
Ann. Oper. Res. v.35 Separable routing:a scheme for state-dependent routing of circuit switched telephone traffic T. J. Ott;K. R. Krishnan
Markov Decision Processes:Discrete Stochastic Dynamic Programming M. L. Puterman
Proc. 1st Int. Workshop on the Numerical Solution of Markov Chains A survey of aggregation-disaggregation in large Markov chains P. J. Schweitzer
Comput. Oper. Res. v.27 Comparing neuro-dynamic programming algorithms for the vehicle routing problem with stochastic demands N. Secomandi

International Journal of Control, Automation, and Systems

Localization and a Distributed Local Optimal Solution Algorithm for a Class of Multi-Agent Markov Decision Processes

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)