Abstract
Aiming at the problem of low convergence efficiency of traditional multi-UAV path planning algorithms in unknown complex environments, this paper proposes a deep reinforcement learning algorithm incorporating the attention mechanism. The method is based on the Soft Actor-Critic (SAC) framework, which introduces a multi-attention mechanism in the Critic network, dynamically learns the dependency relationship between intelligences, and realizes key information screening and conflict avoidance. An environment with multiple random obstacles is designed to simulate complex emergent situations. The results show that the proposed algorithm significantly improves the mission success rate and average reward, significantly extends the survival time and exploration range of the UAVs, and verifies the effectiveness of the attention mechanism in enhancing the efficiency, robustness, and long-term planning capability of multi-UAV collaboration, as compared to the baseline method that does not use attention.

This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright (c) 2026 Ziyi Zhu, Ji Huang, Wangye Jiang
Downloads
Publication Facts
Reviewer profiles N/A
Author statements
- Academic society
- China Instrument and Control Society
- Publisher
- China Instrument and Control Society