Search | Korea Science

Comparison of value-based Reinforcement Learning Algorithms in Cart-Pole Environment

Byeong-Chan Han;Ho-Chan Kim;Min-Jae Kang
- International Journal of Internet, Broadcasting and Communication
- /
- v.15 no.3
- /
- pp.166-175
- /
- 2023
Reinforcement learning can be applied to a wide variety of problems. However, the fundamental limitation of reinforcement learning is that it is difficult to derive an answer within a given time because the problems in the real world are too complex. Then, with the development of neural network technology, research on deep reinforcement learning that combines deep learning with reinforcement learning is receiving lots of attention. In this paper, two types of neural networks are combined with reinforcement learning and their characteristics were compared and analyzed with existing value-based reinforcement learning algorithms. Two types of neural networks are FNN and CNN, and existing reinforcement learning algorithms are SARSA and Q-learning.
https://doi.org/10.7236/IJIBC.2023.15.3.166 인용 PDF

Predicting bond strength of corroded reinforcement by deep learning

Tanyildizi, Harun
- Computers and Concrete
- /
- v.29 no.3
- /
- pp.145-159
- /
- 2022
In this study, the extreme learning machine and deep learning models were devised to estimate the bond strength of corroded reinforcement in concrete. The six inputs and one output were used in this study. The compressive strength, concrete cover, bond length, steel type, diameter of steel bar, and corrosion level were selected as the input variables. The results of bond strength were used as the output variable. Moreover, the Analysis of variance (Anova) was used to find the effect of input variables on the bond strength of corroded reinforcement in concrete. The prediction results were compared to the experimental results and each other. The extreme learning machine and the deep learning models estimated the bond strength by 99.81% and 99.99% accuracy, respectively. This study found that the deep learning model can be estimated the bond strength of corroded reinforcement with higher accuracy than the extreme learning machine model. The Anova results found that the corrosion level was found to be the input variable that most affects the bond strength of corroded reinforcement in concrete.
https://doi.org/10.12989/cac.2022.29.3.145 인용 KSCI

A Survey on Deep Reinforcement Learning Libraries (심층강화학습 라이브러리 기술동향)

Shin, S.J.;Cho, C.L.;Jeon, H.S.;Yoon, S.H.;Kim, T.Y.
- Electronics and Telecommunications Trends
- /
- v.34 no.6
- /
- pp.87-99
- /
- 2019
Reinforcement learning is a type of machine learning paradigm that forces agents to repeat the observation-action-reward process to assess and predict the values of possible future action sequences. This allows the agents to incrementally reinforce the desired behavior for a given observation. Thanks to the recent advancements of deep learning, reinforcement learning has evolved into deep reinforcement learning that introduces promising results in various control and optimization domains, such as games, robotics, autonomous vehicles, computing, industrial control, and so on. In addition to this trend, a number of programming libraries have been developed for importing deep reinforcement learning into a variety of applications. In this article, we briefly review and summarize 10 representative deep reinforcement learning libraries and compare them from a development project perspective.
https://doi.org/10.22648/ETRI.2019.J.340608 인용 PDF

Enhanced Machine Learning Algorithms: Deep Learning, Reinforcement Learning, and Q-Learning

Park, Ji Su;Park, Jong Hyuk
- Journal of Information Processing Systems
- /
- v.16 no.5
- /
- pp.1001-1007
- /
- 2020
In recent years, machine learning algorithms are continuously being used and expanded in various fields, such as facial recognition, signal processing, personal authentication, and stock prediction. In particular, various algorithms, such as deep learning, reinforcement learning, and Q-learning, are continuously being improved. Among these algorithms, the expansion of deep learning is rapidly changing. Nevertheless, machine learning algorithms have not yet been applied in several fields, such as personal authentication technology. This technology is an essential tool in the digital information era, walking recognition technology as promising biometrics, and technology for solving state-space problems. Therefore, algorithm technologies of deep learning, reinforcement learning, and Q-learning, which are typical machine learning algorithms in various fields, such as agricultural technology, personal authentication, wireless network, game, biometric recognition, and image recognition, are being improved and expanded in this paper.
https://doi.org/10.3745/JIPS.02.0139 인용 PDF KSCI

Improved Deep Q-Network Algorithm Using Self-Imitation Learning (Self-Imitation Learning을 이용한 개선된 Deep Q-Network 알고리즘)

Sunwoo, Yung-Min;Lee, Won-Chang
- Journal of IKEEE
- /
- v.25 no.4
- /
- pp.644-649
- /
- 2021
Self-Imitation Learning is a simple off-policy actor-critic algorithm that makes an agent find an optimal policy by using past good experiences. In case that Self-Imitation Learning is combined with reinforcement learning algorithms that have actor-critic architecture, it shows performance improvement in various game environments. However, its applications are limited to reinforcement learning algorithms that have actor-critic architecture. In this paper, we propose a method of applying Self-Imitation Learning to Deep Q-Network which is a value-based deep reinforcement learning algorithm and train it in various game environments. We also show that Self-Imitation Learning can be applied to Deep Q-Network to improve the performance of Deep Q-Network by comparing the proposed algorithm and ordinary Deep Q-Network training results.
https://doi.org/10.7471/ikeee.2021.25.4.644 인용 PDF KSCI

Digital Twin and Visual Object Tracking using Deep Reinforcement Learning (심층 강화학습을 이용한 디지털트윈 및 시각적 객체 추적)

Park, Jin Hyeok;Farkhodov, Khurshedjon;Choi, Piljoo;Lee, Suk-Hwan;Kwon, Ki-Ryong
- Journal of Korea Multimedia Society
- /
- v.25 no.2
- /
- pp.145-156
- /
- 2022
Nowadays, the complexity of object tracking models among hardware applications has become a more in-demand duty to complete in various indeterminable environment tracking situations with multifunctional algorithm skills. In this paper, we propose a virtual city environment using AirSim (Aerial Informatics and Robotics Simulation - AirSim, CityEnvironment) and use the DQN (Deep Q-Learning) model of deep reinforcement learning model in the virtual environment. The proposed object tracking DQN network observes the environment using a deep reinforcement learning model that receives continuous images taken by a virtual environment simulation system as input to control the operation of a virtual drone. The deep reinforcement learning model is pre-trained using various existing continuous image sets. Since the existing various continuous image sets are image data of real environments and objects, it is implemented in 3D to track virtual environments and moving objects in them.
https://doi.org/10.9717/kmms.2022.25.2.145 인용 PDF KSCI HTML

Controller Learning Method of Self-driving Bicycle Using State-of-the-art Deep Reinforcement Learning Algorithms

Choi, Seung-Yoon;Le, Tuyen Pham;Chung, Tae-Choong
- Journal of the Korea Society of Computer and Information
- /
- v.23 no.10
- /
- pp.23-31
- /
- 2018
Recently, there have been many studies on machine learning. Among them, studies on reinforcement learning are actively worked. In this study, we propose a controller to control bicycle using DDPG (Deep Deterministic Policy Gradient) algorithm which is the latest deep reinforcement learning method. In this paper, we redefine the compensation function of bicycle dynamics and neural network to learn agents. When using the proposed method for data learning and control, it is possible to perform the function of not allowing the bicycle to fall over and reach the further given destination unlike the existing method. For the performance evaluation, we have experimented that the proposed algorithm works in various environments such as fixed speed, random, target point, and not determined. Finally, as a result, it is confirmed that the proposed algorithm shows better performance than the conventional neural network algorithms NAF and PPO.
https://doi.org/10.9708/jksci.2018.23.10.023 인용 PDF KSCI

Deep Reinforcement Learning in ROS-based autonomous robot navigation

Roland, Cubahiro;Choi, Donggyu;Jang, Jongwook
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2022.05a
- /
- pp.47-49
- /
- 2022
Robot navigation has seen a major improvement since the the rediscovery of the potential of Artificial Intelligence (AI) and the attention it has garnered in research circles. A notable achievement in the area was Deep Learning (DL) application in computer vision with outstanding daily life applications such as face-recognition, object detection, and more. However, robotics in general still depend on human inputs in certain areas such as localization, navigation, etc. In this paper, we propose a study case of robot navigation based on deep reinforcement technology. We look into the benefits of switching from traditional ROS-based navigation algorithms towards machine learning approaches and methods. We describe the state-of-the-art technology by introducing the concepts of Reinforcement Learning (RL), Deep Learning (DL) and DRL before before focusing on visual navigation based on DRL. The case study preludes further real life deployment in which mobile navigational agent learns to navigate unbeknownst areas.
PDF

Mapless Navigation with Distributional Reinforcement Learning (분포형 강화학습을 활용한 맵리스 네비게이션)

Van Manh Tran;Gon-Woo Kim
- The Journal of Korea Robotics Society
- /
- v.19 no.1
- /
- pp.92-97
- /
- 2024
This paper provides a study of distributional perspective on reinforcement learning for application in mobile robot navigation. Mapless navigation algorithms based on deep reinforcement learning are proven to promising performance and high applicability. The trial-and-error simulations in virtual environments are encouraged to implement autonomous navigation due to expensive real-life interactions. Nevertheless, applying the deep reinforcement learning model in real tasks is challenging due to dissimilar data collection between virtual simulation and the physical world, leading to high-risk manners and high collision rate. In this paper, we present distributional reinforcement learning architecture for mapless navigation of mobile robot that adapt the uncertainty of environmental change. The experimental results indicate the superior performance of distributional soft actor critic compared to conventional methods.
https://doi.org/10.7746/jkros.2024.19.1.092 인용 PDF

A Comparison of Deep Reinforcement Learning and Deep learning for Complex Image Analysis

Khajuria, Rishi;Quyoom, Abdul;Sarwar, Abid
- Journal of Multimedia Information System
- /
- v.7 no.1
- /
- pp.1-10
- /
- 2020
The image analysis is an important and predominant task for classifying the different parts of the image. The analysis of complex image analysis like histopathological define a crucial factor in oncology due to its ability to help pathologists for interpretation of images and therefore various feature extraction techniques have been evolved from time to time for such analysis. Although deep reinforcement learning is a new and emerging technique but very less effort has been made to compare the deep learning and deep reinforcement learning for image analysis. The paper highlights how both techniques differ in feature extraction from complex images and discusses the potential pros and cons. The use of Convolution Neural Network (CNN) in image segmentation, detection and diagnosis of tumour, feature extraction is important but there are several challenges that need to be overcome before Deep Learning can be applied to digital pathology. The one being is the availability of sufficient training examples for medical image datasets, feature extraction from whole area of the image, ground truth localized annotations, adversarial effects of input representations and extremely large size of the digital pathological slides (in gigabytes).Even though formulating Histopathological Image Analysis (HIA) as Multi Instance Learning (MIL) problem is a remarkable step where histopathological image is divided into high resolution patches to make predictions for the patch and then combining them for overall slide predictions but it suffers from loss of contextual and spatial information. In such cases the deep reinforcement learning techniques can be used to learn feature from the limited data without losing contextual and spatial information.
https://doi.org/10.33851/JMIS.2020.7.1.1 인용 PDF KSCI HTML

Search Result 193, Processing Time 0.041 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)