Hexagon-Based Q-Learning Algorithm and Applications

Yang, Hyun-Chang;Kim, Ho-Duck;Yoon, Han-Ul;Jang, In-Hun;Sim, Kwee-Bo;

International Journal of Control, Automation, and Systems

Volume 5 Issue 5
/
Pages.570-576
/
2007
/
1598-6446(pISSN)
/
2005-4092(eISSN)

Institute of Control, Robotics and Systems (제어로봇시스템학회)

Hexagon-Based Q-Learning Algorithm and Applications

Yang, Hyun-Chang (School of Electrical and Electronics Engineering, Chung-Ang University) ;
Kim, Ho-Duck (School of Electrical and Electronics Engineering, Chung-Ang University) ;
Yoon, Han-Ul (School of Electrical and Electronics Engineering, the University of Illinois at Urbana Champaign) ;
Jang, In-Hun (School of Electrical and Electronics Engineering, Chung-Ang University) ;
Sim, Kwee-Bo (School of Electrical and Electronics Engineering, Chung-Ang University)

Published : 2007.10.31

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

This paper presents a hexagon-based Q-leaning algorithm to find a hidden targer object with multiple robots. An experimental environment was designed with five small mobile robots, obstacles, and a target object. Robots went in search of a target object while navigating in a hallway where obstacles were strategically placed. This experiment employed two control algorithms: an area-based action making (ABAM) process to determine the next action of the robots and hexagon-based Q-learning to enhance the area-based action making process.

Keywords

References

L. Parker, 'Adaptive action selection for cooperative agent teams,' Proc. of the 2nd International Conference on Simulation of Adaptive Behavior, pp. 15-64, 1992
G. Ogasawara, T. Omata, and T. Sato, 'Multiple movers using distributed, decision-theoretic control,' Proc. of Japan-USA Symposium on Flexible Automation, vol. 1, pp. 623-630, 1992
D. Ballard, An Introduction to Natural Computation, The MIT Press, Cambridge, 1997
J. Jang, C. Sun, and E. Mizutani, Neuro-Fuzzy and Soft Computing, Prentice-Hall, New Jersey, 1997
W. Ashley and T. Balch, 'Value-based observation with robot teams (VBORT) using probabilistic techniques,' Proc. of International Conference on Advanced Robotics, 2003
J. B. Park, B. H. Lee, and M. S. Kim, 'Remote control of a mobile robot using distance-based reflective force,' Proc. of IEEE International Conference on Robotics and Automation, vol. 3, pp. 3415-3420, 2003
D. Patterson and J. Hennessy, Computer Organization and Design, 3rd ed., Morgan-Kaufmann, Korea, 2005
T. Mitchell, Machine Learning, McGraw-Hill, Singapore, 1997
C. Clausen and H. Wechsler, 'Quad Q-learning,' IEEE Trans. on Neural Network, vol. 11, pp. 279-294, 2000 https://doi.org/10.1109/72.839000
S. Russel and P. Norbig, Artificial Intelligence: A Modern Approach, 2nd ed., Prentice-Hall, New Jersey, 2003
H. U. Yoon and K. B. Sim, 'Hexagon-based Qlearning for object search with multiple robots,' Lecture Notes in Computer Science (LNCS), Springer-Verlag, vol. 3612, pp. 713-722, 2005 https://doi.org/10.1007/11539902_88
H. U. Yoon and K. B. Sim, 'Hexagon-based Qlearning to find a hidden target object,' Lecture Notes in Artificial Intelligence (LNAI), Springer-Verlag, vol. 3801, pp. 429-434, 2005
S. H. Whang, K. B. Sim, I. C. Jeong, et al., 'Design of efficient strategies for distributed multi-agent robot soccer system,' Proc. of the FIRA Robot Congress, 2004
Bluetooth Co., Specification of the Bluetooth System, vol. 1, pp. 537-828, 2001
H. U. Yoon, S. H. Hwang, D. W. Kim, D. H. Lee, and K. B. Sim, 'Robotic agent design and application in the ubiquitous intelligent space,' Journal of Control, Automation, and Systems Engineering (Korean), vol. 11, no. 12, pp. 1039-1044, 2005 https://doi.org/10.5302/J.ICROS.2005.11.12.1039

International Journal of Control, Automation, and Systems

Hexagon-Based Q-Learning Algorithm and Applications

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)