Distributed Learning and Multiaccess of On-Off Channels

The problem of distributed access of a set of N on-off channels by K ≤ N users is considered. The channels are slotted and modeled as independent but not necessarily identical alternating renewal processes. Each user decides to either observe or transmit at the beginning of every slot. A transmission is successful only if the channel is at the on state and there is only one user transmitting. When a user observes, it identifies whether a transmission would have been successful had it decided to transmit. A distributed learning and access policy referred to as alternating sensing and access (ASA) is proposed. It is shown that ASA has finite expected regret when compared with the optimal centralized scheme with fixed channel allocation.

[1]  J. Wendelberger Adventures in Stochastic Processes , 1993 .

[2]  Qing Zhao,et al.  Distributed Learning in Multi-Armed Bandit With Multiple Players , 2009, IEEE Transactions on Signal Processing.

[3]  Brian M. Sadler,et al.  A Survey of Dynamic Spectrum Access , 2007, IEEE Signal Processing Magazine.

[4]  Francis Comets,et al.  Large Deviations and Applications , 2011, International Encyclopedia of Statistical Science.

[5]  Dimitri P. Bertsekas,et al.  Data Networks , 1986 .

[6]  J. Walrand,et al.  Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards , 1987 .

[7]  Wenhan Dai,et al.  Efficient online learning for opportunistic spectrum access , 2012, 2012 Proceedings IEEE INFOCOM.

[8]  Hai Jiang,et al.  Medium access in cognitive radio networks: A competitive multi-armed bandit framework , 2008, 2008 42nd Asilomar Conference on Signals, Systems and Computers.

[9]  ZhaoQing,et al.  Distributed learning in multi-armed bandit with multiple players , 2010 .

[10]  Ao Tang,et al.  Opportunistic Spectrum Access with Multiple Users: Learning under Competition , 2010, 2010 Proceedings IEEE INFOCOM.

[11]  Lang Tong,et al.  Asymptotically Efficient Multichannel Estimation for Opportunistic Spectrum Access , 2012, IEEE Transactions on Signal Processing.

[12]  Mingyan Liu,et al.  Channel estimation for opportunistic spectrum sensing: Uniform and random sensing , 2010, 2010 Information Theory and Applications Workshop (ITA).

[13]  Xiaoying Gan,et al.  An Estimation Algorithm of Channel State Transition Probabilities for Cognitive Radio Systems , 2008, 2008 3rd International Conference on Cognitive Radio Oriented Wireless Networks and Communications (CrownCom 2008).

[14]  T. L. Lai Andherbertrobbins Asymptotically Efficient Adaptive Allocation Rules , 2022 .

[15]  Yi Gai,et al.  Decentralized Online Learning Algorithms for Opportunistic Spectrum Access , 2011, 2011 IEEE Global Telecommunications Conference - GLOBECOM 2011.