Список литературы

2518-1092

Научный результат. Информационные технологии

2518-1092

10.18413/2518-1092-2024-9-3-0-7

3561

ИСКУССТВЕННЫЙ ИНТЕЛЛЕКТ И ПРИНЯТИЕ РЕШЕНИЙ

<strong>МЕТОД ОБУЧЕНИЯ ИНТЕЛЛЕКТУАЛЬНОГО АГЕНТА С ПОМОЩЬЮ СЕТЕЙ DOUBLE DQN, ПУТЕВЫХ ТОЧЕК И ФУНКЦИИ ВОЗНАГРАЖДЕНИЯ</strong>

<strong>A METHOD FOR TRAINING AN INTELLIGENT AGENT USING DOUBLE DQN NETWORKS, WAYPOINTS AND REWARD FUNCTION</strong>

Тихонов

Максим Константинович

Tikhonov

Maksim Konstantinovich

samualgame@gmail.com

Непомнящий

Дмитрий Олегович

Nepomnyashchiy

Dmitry Olegovich

Хайдукова

Валерия Николаевна

Khaidukova

Valeria Nikolaevna

2024

9300

Рассмотрены задачи повышения эффективности управления автономных транспортных средств. Выделена проблема снижения требуемых вычислительных ресурсов для интеллектуального модуля управления автомобилем. Предложен алгоритм обучения нейронной сети для архитектуры Double DQN с модифицированной функций вознаграждения. Основой предлагаемого решения является использование сегментации полосы движения, функции вознаграждения и использования дополнительных путевых точек при обучении. Разработана программная модель и выполнено моделирование процесса обучения. Полученные результаты сравнительного анализа с известными решениями показывают стабильное повышение длительности эпизода, и эффективное обучение в реалистичной городской симуляции. Исследование указывает на возможность уменьшения необходимости в высокой вычислительной мощности, что даст возможность использовать центральные процессоры (CPU) для основных функций беспилотных автомобилей вместо графических процессоров (GPU).

The problems of increasing the control efficiency of autonomous vehicles are considered. The problem of reducing the required computational resources for the intelligent vehicle control module is highlighted. A neural network training algorithm for Double DQN architecture with modified reward functions is proposed. The basis of the proposed solution is the use of lane segmentation, reward function and the use of additional waypoints in training. A software model has been developed and simulation of the learning process has been performed. The results obtained from a comparative analysis with known solutions show a stable increase in episode duration, and effective training in a realistic urban simulation. The study points to the possibility of reducing the need for high computing power, which will enable the use of central processing units (CPUs) for basic functions of unmanned vehicles instead of graphics processing units (GPUs).

беспилотный автомобильполоса движенияинтеллектуальное управлениеагентнейронная сетьглубокое Q-обучениесимуляция

self-driving carlane followingintelligent controlagentneural networkdeep Q-learningsimulation

Список литературы

Final Rule Occupant Protection Amendment Automated Vehicles [Электронный ресурс]. URL: https://www.nhtsa.gov/document/final-rule-occupant-protection-amendment-automated-vehicles (дата обращения 10.09.2023).

RAND Corporation [Электронный ресурс]. URL: https://www.rand.org/ (дата обращения 23.09.2023).

Malayjerdi M. et al. Virtual simulations environment development for autonomous vehicles interaction // ASME International Mechanical Engineering Congress and Exposition. – American Society of Mechanical Engineers, 2020. – Т. 84492. – С. V02BT02A009.

Young R. A. Automated Driving System Safety: Miles for 95% Confidence in “Vision Zero” // SAE International Journal of Advances and Current Practices in Mobility. – 2020. – Т. 2. – №. 2020-01-1205. – С. 3454-3480.

Jo K. et al. Development of autonomous car—Part II: A case study on the implementation of an autonomous driving system based on distributed architecture // IEEE Transactions on Industrial Electronics. – 2015. – Т. 62. – №. 8. – С. 5119-5132.

Rodrigues P., Vieira S. Optimizing agent training with deep q-learning on a self-driving reinforcement learning environment //2020 IEEE Symposium Series on Computational Intelligence (SSCI). – IEEE, 2020. – С. 745-752.

Chishti S. O. A. et al. Self-driving cars using CNN and Q-learning // 2018 IEEE 21st International Multi-Topic Conference (INMIC). – IEEE, 2018. – С. 1-7.

Chhikara P. et al. Data dimensionality reduction techniques for Industry 4.0: Research results, challenges, and future research directions //Software: Practice and Experience. – 2022. – Т. 52. – №. 3. – С. 658-688.

Malekloo A. et al. Machine learning and structural health monitoring overview with emerging technology and high-dimensional data source highlights //Structural Health Monitoring. – 2022. – Т. 21. – №. 4. – С. 1906-1955.

Peng B. et al. End-to-end autonomous driving through dueling double deep Q-network // Automotive Innovation. – 2021. – Т. 4. – С. 328-337.

Riboni A., Candelieri A., Borrotti M. Deep Autonomous Agents Comparison for Self-driving Cars // International Conference on Machine Learning, Optimization, and Data Science. – Cham: Springer International Publishing, 2021. – С. 201-213.

Kiran B. R. et al. Deep reinforcement learning for autonomous driving: A survey // IEEE Transactions on Intelligent Transportation Systems. – 2021. – Т. 23. – №. 6. – С. 4909-4926.

Lin Q. et al. Reachflow: An online safety assurance framework for waypoint-following of self-driving cars // 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). – IEEE, 2020. – С. 6627-6632.

Szoke L. et al. Skills to drive: Successor features for autonomous highway pilot // IEEE Transactions on Intelligent Transportation Systems. – 2022. – Т. 23. – №. 10. – С. 18707-18718.

Dosovitskiy A. et al. CARLA: An open urban driving simulator // Conference on robot learning. – PMLR, 2017. – С. 1-16.

Barbour E., McFall K. Autonomous vehicle simulation using open source software Carla // Journal of UAB ECTC. – 2019. – Т. 18.

GitHub [Электронный ресурс]. – Режим доступа: URL: https://github.com/rzambrano1/rl-agent-self-driving-carla (дата обращения 22.02.2024).