Skip to main content

Research Repository

Advanced Search

ReLeDP: Reinforcement-Learning-Assisted Dynamic Pricing for Wireless Smart Grid

Zhang, Peiying; Wang, Chao; Aujla, Gagangeet Singh; Batth, Ranbir Singh

ReLeDP: Reinforcement-Learning-Assisted Dynamic Pricing for Wireless Smart Grid Thumbnail


Peiying Zhang

Chao Wang

Ranbir Singh Batth


The smart grid must ensure that power providers can obtain substantial benefits by selling energy, while at the same time, they need to consider the cost of consumers. To realize this win-win situation, the smart grid relies on dynamic pricing mechanisms. However, most of the existing dynamic pricing schemes are based on artificial objective rules or conventional models, which cannot ensure the desired effectiveness. Thus, we apply reinforcement learning to model the supply-demand relationship between power providers and consumers in a smart grid. The dynamic pricing problem of the smart grid is modeled as a discrete Markov decision process, and the decision process is solved by Q-learning. Now, the success of any intelligent dynamic pricing scheme relies on timely data transmission. However, the scale and speed of data generation can create several network bottlenecks that can further reduce the performance of any dynamic pricing scheme. Hence, to overcome this challenge, we have proposed an artificial-intelligence-based adaptive network architecture that adopts software-defined networking. In this architecture, we have used a self-organized map-based traffic classification approach followed by a dynamic virtual network embedding mechanism. We demonstrate the effectiveness of the dynamic pricing strategy supported through adaptive network architecture based on various performance indicators. The outcomes suggest that the proposed strategy is of great significance to realize the sustainability of power energy in the future. Lastly, we discuss various implementation challenges and future directions before concluding the article.


Zhang, P., Wang, C., Aujla, G. S., & Batth, R. S. (2021). ReLeDP: Reinforcement-Learning-Assisted Dynamic Pricing for Wireless Smart Grid. IEEE Wireless Communications, 28(6), 62-69.

Journal Article Type Article
Online Publication Date Dec 31, 2021
Publication Date 2021-12
Deposit Date Feb 12, 2022
Publicly Available Date May 6, 2022
Journal IEEE Wireless Communications
Print ISSN 1536-1284
Electronic ISSN 1558-0687
Publisher Institute of Electrical and Electronics Engineers
Peer Reviewed Peer Reviewed
Volume 28
Issue 6
Pages 62-69


Accepted Journal Article (1.6 Mb)

Copyright Statement
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

You might also like

Downloadable Citations