TY - GEN
T1 - Multiagent Autonomous Learning for Distributed Channel Allocation in Wireless Networks
AU - Zafaruddin, Syed Mohammad
AU - Bistritz, Ilai
AU - Leshem, Amir
AU - Niyato, Dusit
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/7
Y1 - 2019/7
N2 - In distributed networks such as ad-hoc and device-to-device (D2D) networks, no base station exists and conveying global channel state information (CSI) between users is costly or simply impractical. When the CSI is time-varying and unknown to the users, the users face the challenge of both learning the channel statistics online and converging to good channel allocation. This introduces a multi-armed bandit (MAB) scenario with multiple decision makers. If two or more users choose the same channel, a collision occurs and they all receive zero reward. We propose a distributed channel allocation algorithm in which each user converges to the optimal allocation while achieving an order optimal regret of O (log T), where T denotes the length of time horizon. The algorithm is based on a carrier sensing multiple access (CSMA) implementation of the distributed auction algorithm. It does not require any exchange of information between users. Users need only to observe a single channel at a time and sense if there is a transmission on that channel, without decoding the transmissions or identifying the transmitting users. We compare the performance of the proposed algorithm with the state-of-the-art scheme using simulations of realistic long term evolution (LTE) channels.
AB - In distributed networks such as ad-hoc and device-to-device (D2D) networks, no base station exists and conveying global channel state information (CSI) between users is costly or simply impractical. When the CSI is time-varying and unknown to the users, the users face the challenge of both learning the channel statistics online and converging to good channel allocation. This introduces a multi-armed bandit (MAB) scenario with multiple decision makers. If two or more users choose the same channel, a collision occurs and they all receive zero reward. We propose a distributed channel allocation algorithm in which each user converges to the optimal allocation while achieving an order optimal regret of O (log T), where T denotes the length of time horizon. The algorithm is based on a carrier sensing multiple access (CSMA) implementation of the distributed auction algorithm. It does not require any exchange of information between users. Users need only to observe a single channel at a time and sense if there is a transmission on that channel, without decoding the transmissions or identifying the transmitting users. We compare the performance of the proposed algorithm with the state-of-the-art scheme using simulations of realistic long term evolution (LTE) channels.
KW - Distributed channel allocation
KW - dynamic spectrum access
KW - multiplayer multi-armed bandit
KW - online learning
KW - resource management
KW - wireless networks
UR - http://www.scopus.com/inward/record.url?scp=85072305275&partnerID=8YFLogxK
U2 - 10.1109/SPAWC.2019.8815567
DO - 10.1109/SPAWC.2019.8815567
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85072305275
T3 - IEEE Workshop on Signal Processing Advances in Wireless Communications, SPAWC
BT - 2019 IEEE 20th International Workshop on Signal Processing Advances in Wireless Communications, SPAWC 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 20th IEEE International Workshop on Signal Processing Advances in Wireless Communications, SPAWC 2019
Y2 - 2 July 2019 through 5 July 2019
ER -