In this paper, we propose a novel approach to multi-agent underwater source localization by means of Multi-Agent Reinforcement Learning (MARL). Our framework optimizes the trajectories of two autonomous underwater vehicles, each towing an antenna, to maximize the probability of detection of the source. We implement a shared-parameter MARL strategy with non-synchronous actions to address the challenges posed by non-stationary multi-agent environments. We train a neural network on a simplified simulation environment and evaluate it in a realistic simulation engine, demonstrating robustness to communication losses of up to 60%. Our preliminary results indicate that RL-based trajectory optimization can achieve comparable performance to traditional approaches.

doi.org/10.1109/OCEANS58557.2025.11104604
OCEANS 2025 Brest
Centrum Wiskunde & Informatica, Amsterdam (CWI), The Netherlands

Middelkoop, M., Celi, F., Faggiani, A., Hummel, H., Bhulai, S., Tesei, A., … Ferri, G. (2025). Optimizing source localization via reinforcement learning in multi-agent underwater networks. Oceans Conference Record (IEEE). doi:10.1109/OCEANS58557.2025.11104604