Dang, Y., Xu, J., Li, D., & Sun, H. (online). A Preference-based Online Reinforcement Learning with Embedded Communication Failure Solutions in Smart Grid. IEEE Transactions on Industrial Informatics, https://doi.org/10.1109/TII.2024.3507203