Robust Attitude Control of an Agile Aircraft Using Improved Q-Learning

Zahmatkesh, Mohsen; Emami, Seyyed; Banazadeh, Afshin; Castaldi, Paolo

doi:10.3390/act11120374

Robust Attitude Control of an Agile Aircraft Using Improved Q-Learning

Zahmatkesh, Mohsen; Emami, Seyyed; Banazadeh, Afshin; Castaldi, Paolo

Authors

Mohsen Zahmatkesh mohsen.zahmatkesh@durham.ac.uk
Research Associate in Mechatronics

Seyyed Emami

Afshin Banazadeh

Paolo Castaldi

Abstract

Attitude control of a novel regional truss-braced wing (TBW) aircraft with low stability characteristics is addressed in this paper using Reinforcement Learning (RL). In recent years, RL has been increasingly employed in challenging applications, particularly, autonomous flight control. However, a significant predicament confronting discrete RL algorithms is the dimension limitation of the state-action table and difficulties in defining the elements of the RL environment. To address these issues, in this paper, a detailed mathematical model of the mentioned aircraft is first developed to shape an RL environment. Subsequently, Q-learning, the most prevalent discrete RL algorithm, will be implemented in both the Markov Decision Process (MDP) and Partially Observable Markov Decision Process (POMDP) frameworks to control the longitudinal mode of the proposed aircraft. In order to eliminate residual fluctuations that are a consequence of discrete action selection, and simultaneously track variable pitch angles, a Fuzzy Action Assignment (FAA) method is proposed to generate continuous control commands using the trained optimal Q-table. Accordingly, it will be proved that by defining a comprehensive reward function based on dynamic behavior considerations, along with observing all crucial states (equivalent to satisfying the Markov Property), the air vehicle would be capable of tracking the desired attitude in the presence of different uncertain dynamics including measurement noises, atmospheric disturbances, actuator faults, and model uncertainties where the performance of the introduced control system surpasses a well-tuned Proportional–Integral–Derivative (PID) controller.

Citation

Zahmatkesh, M., Emami, S., Banazadeh, A., & Castaldi, P. (2022). Robust Attitude Control of an Agile Aircraft Using Improved Q-Learning. Actuators, 11(12), Article 374. https://doi.org/10.3390/act11120374

Journal Article Type	Article
Acceptance Date	Dec 6, 2022
Online Publication Date	Dec 12, 2022
Publication Date	2022-12
Deposit Date	Dec 31, 2024
Journal	Actuators
Electronic ISSN	2076-0825
Publisher	MDPI
Peer Reviewed	Peer Reviewed
Volume	11
Issue	12
Article Number	374
DOI	https://doi.org/10.3390/act11120374
Public URL	https://durham-repository.worktribe.com/output/3230264

Robust neuro-adaptive command-filtered back-stepping fault-tolerant control of satellite using composite learning (2024)
Journal Article

T-STAR: Time-Optimal Swarm Trajectory Planning for Quadrotor Unmanned Aerial Vehicles (2025)
Journal Article

A Real-Time RRT-APF Approach for Efficient Multi-Robot Navigation in Complex Environments (2025)
Presentation / Conference Contribution

Downloadable Citations

HTML

BIB

RTF

Authors

Abstract

Citation

You might also like

Downloadable Citations