A METHOD FOR TRAINING AN INTELLIGENT AGENT USING DOUBLE DQN NETWORKS, WAYPOINTS AND REWARD FUNCTION
The problems of increasing the control efficiency of autonomous vehicles are considered. The problem of reducing the required computational resources for the intelligent vehicle control module is highlighted. A neural network training algorithm for Double DQN architecture with modified reward functions is proposed. The basis of the proposed solution is the use of lane segmentation, reward function and the use of additional waypoints in training. A software model has been developed and simulation of the learning process has been performed. The results obtained from a comparative analysis with known solutions show a stable increase in episode duration, and effective training in a realistic urban simulation. The study points to the possibility of reducing the need for high computing power, which will enable the use of central processing units (CPUs) for basic functions of unmanned vehicles instead of graphics processing units (GPUs).
Tikhonov M.K., Nepomnyashchiy D.O., Khaidukova V.N. A method for training an intelligent agent using Double DQN networks, waypoints and reward function // Research result. Information technologies. – Т. 9, №3, 2024. – P. 63-72. DOI: 10.18413/2518-1092-2024-9-3-0-7
While nobody left any comments to this publication.
You can be first.