TY - JOUR
T1 - Intelligent Motion Control to Enhance the Swimming Performance of a Biomimetic Underwater Vehicle Using Reinforcement Learning Approach
AU - Algarin-Pinto, Juan A.
AU - Castañón, Luis Eduardo Garza
AU - Martínez, Adriana Vargas
AU - Minchala Avila, Luis Ismael
AU - Payeur, Pierre
N1 - Publisher Copyright:
© 2025 The Authors. This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
PY - 2025
Y1 - 2025
N2 - This article develops an intelligent motion control strategy using reinforcement learning to regulate a biomimetic autonomous underwater vehicle (BAUV) swimming performance. The BAUV is driven by the oscillatory motion of a lunate-shaped caudal fin. To navigate effectively, the vehicle must regulate its propeller’s motion to swim toward goals. The caudal fin’s oscillatory motion is defined by sine functions, where the amplitude, frequency, bias, and time shifting are the main flapping parameters that must be regulated. The developed algorithm, based on the n-step state-action-reward-state-action on-policy learning, fine-tunes these parameters. Further, deep Q learning network speed adjusters were designed to enhance BAUV guidance. By integrating waypoint guidance systems and intelligent path trackers, the vehicle reduces its heading deviation and distance to goals. The effectiveness of these methods was validated through simulations featuring various disturbances like longitudinal, lateral, and swirl currents. The proposed scheme allowed the vehicle to reach desired targets efficiently, even in the presence of strong currents.
AB - This article develops an intelligent motion control strategy using reinforcement learning to regulate a biomimetic autonomous underwater vehicle (BAUV) swimming performance. The BAUV is driven by the oscillatory motion of a lunate-shaped caudal fin. To navigate effectively, the vehicle must regulate its propeller’s motion to swim toward goals. The caudal fin’s oscillatory motion is defined by sine functions, where the amplitude, frequency, bias, and time shifting are the main flapping parameters that must be regulated. The developed algorithm, based on the n-step state-action-reward-state-action on-policy learning, fine-tunes these parameters. Further, deep Q learning network speed adjusters were designed to enhance BAUV guidance. By integrating waypoint guidance systems and intelligent path trackers, the vehicle reduces its heading deviation and distance to goals. The effectiveness of these methods was validated through simulations featuring various disturbances like longitudinal, lateral, and swirl currents. The proposed scheme allowed the vehicle to reach desired targets efficiently, even in the presence of strong currents.
KW - Biomimetic autonomous underwater vehicle (BAUV)
KW - deep learning
KW - path tracking control
KW - reinforcement learning
KW - waypoint guidance systems
KW - Biomimetic autonomous underwater vehicle (BAUV)
KW - Deep learning
KW - Path tracking control
KW - Reinforcement learning; waypoint guidance systems
UR - https://publicaciones.ucuenca.edu.ec/ojs/index.php/maskana/article/view/1066
U2 - 10.1109/ACCESS.2025.3544482
DO - 10.1109/ACCESS.2025.3544482
M3 - Artículo
AN - SCOPUS:86000740608
SN - 2169-3536
VL - 13
SP - 37128
EP - 37146
JO - IEEE Access
JF - IEEE Access
ER -