Optimality of Myopic Sensing in Multi-Channel Opportunistic Access

doi:10.48550/arXiv.0811.0637

Optimality of Myopic Sensing in Multi-Channel Opportunistic Access

We consider opportunistic communications over multiple channels where the state ("good" or "bad") of each channel evolves as independent and identically distributed Markov processes. A user, with limited sensing and access capability, chooses one channel to sense and subsequently access (based on the sensed channel state) in each time slot. A reward is obtained when the user senses and accesses a "good" channel. The objective is to design the optimal channel selection policy that maximizes the expected reward accrued over time. This problem can be generally cast as a Partially Observable Markov Decision Process (POMDP) or a restless multi-armed bandit process, to which optimal solutions are often intractable. We show in this paper that the myopic policy, with a simple and robust structure, achieves optimality under certain conditions. This result finds applications in opportunistic communications in fading environment, cognitive radio networks for spectrum overlay, and resource-constrained jamming and anti-jamming.

Publication:

arXiv e-prints

Pub Date:

November 2008

DOI:

10.48550/arXiv.0811.0637

arXiv:

arXiv:0811.0637

Bibcode:

2008arXiv0811.0637A

Keywords:

Computer Science - Networking and Internet Architecture;
Computer Science - Information Theory

E-Print:

Revised version

NASA/ADS

Optimality of Myopic Sensing in Multi-Channel Opportunistic Access

Abstract