TY - JOUR
T1 - A Hybrid DQN and Optimization Approach for Strategy and Resource Allocation in MEC Networks
AU - Wu, Yi Chen
AU - Dinh, Thinh Quang
AU - Fu, Yaru
AU - Lin, Che
AU - Quek, Tony Q.S.
N1 - Funding Information:
Manuscript received June 10, 2020; revised November 16, 2020; accepted January 27, 2021. Date of publication February 17, 2021; date of current version July 12, 2021. This work was supported in part by the Ministry of Science and Technology, Taiwan, under Grant MOST107-2221-E-002-196-MY3, in part by the MOE ARF Tier 2 under Grant T2EP20120-0006, and in part by the SUTD Growth Plan Grant for AI. The associate editor coordinating the review of this article and approving it for publication was M. Payaró. (Corresponding author: Che Lin.) Yi-Chen Wu is with the Institute of Communications Engineering, National Tsing Hua University, Hsinchu City 30013, Taiwan (e-mail: [email protected]).
Funding Information:
ACKNOWLEDGMENT The authors would like to thank the support of time and facilities from the Ho Chi Minh City University of Technology, VNU-HCM, for this study.
Publisher Copyright:
© 2012 IEEE.
PY - 2021/7
Y1 - 2021/7
N2 - We consider a multi-user multi-server mobile edge computing (MEC) network with time-varying fading channels and formulate an offloading decision and resource allocation problem. To solve this mixed-integer non-convex problem, we propose two hybrid approaches that learn offloading strategy with DQN (opt-DQN) or Q-table (opt-QL) at each user equipment (UE). The communication resources are allocated with an optimization algorithm at each computational access point (CAP). We also propose a pure DQN method that learns both the offloading strategy and resource allocation via Q-learning (QL). We analyze the convergence behavior of the QL-based algorithms from a game-theoretical perspective and demonstrate the performance of the proposed hybrid approaches for different network sizes. The simulation results show that the hybrid approaches reach lower costs than other baseline algorithms and the pure-DQN approach. Moreover, the performance of the pure-DQN approach degrades severely as the network size increases, while opt-DQN still performs the best, followed by opt-QL. These observations demonstrate that the hybrid approach that combines the advantages of both QL and convex optimization is a promising design for a multi-user MEC network, wherein complicated offloading and resource allocation strategies need to be determined in a timely and accurate fashion.
AB - We consider a multi-user multi-server mobile edge computing (MEC) network with time-varying fading channels and formulate an offloading decision and resource allocation problem. To solve this mixed-integer non-convex problem, we propose two hybrid approaches that learn offloading strategy with DQN (opt-DQN) or Q-table (opt-QL) at each user equipment (UE). The communication resources are allocated with an optimization algorithm at each computational access point (CAP). We also propose a pure DQN method that learns both the offloading strategy and resource allocation via Q-learning (QL). We analyze the convergence behavior of the QL-based algorithms from a game-theoretical perspective and demonstrate the performance of the proposed hybrid approaches for different network sizes. The simulation results show that the hybrid approaches reach lower costs than other baseline algorithms and the pure-DQN approach. Moreover, the performance of the pure-DQN approach degrades severely as the network size increases, while opt-DQN still performs the best, followed by opt-QL. These observations demonstrate that the hybrid approach that combines the advantages of both QL and convex optimization is a promising design for a multi-user MEC network, wherein complicated offloading and resource allocation strategies need to be determined in a timely and accurate fashion.
UR - http://www.scopus.com/inward/record.url?scp=85100930827&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85100930827&partnerID=8YFLogxK
U2 - 10.1109/TWC.2021.3057882
DO - 10.1109/TWC.2021.3057882
M3 - Article
AN - SCOPUS:85100930827
SN - 1536-1276
VL - 20
SP - 4282
EP - 4295
JO - IEEE Transactions on Wireless Communications
JF - IEEE Transactions on Wireless Communications
IS - 7
M1 - 9356475
ER -