Distributed Event-Triggered Bandit Convex Optimization with Time-Varying Constraints

doi:10.48550/arXiv.2406.14060

Distributed Event-Triggered Bandit Convex Optimization with Time-Varying Constraints

This paper considers the distributed bandit convex optimization problem with time-varying inequality constraints over a network of agents, where the goal is to minimize network regret and cumulative constraint violation. Existing distributed online algorithms require that each agent broadcasts its decision to its neighbors at each iteration. To better utilize the limited communication resources, we propose a distributed event-triggered online primal--dual algorithm with two-point bandit feedback. Under several classes of appropriately chosen decreasing parameter sequences and non-increasing event-triggered threshold sequences, we establish dynamic network regret and network cumulative constraint violation bounds. These bounds are comparable to the results achieved by distributed event-triggered online algorithms with full-information feedback. Finally, a numerical example is provided to verify the theoretical results.

Publication:

arXiv e-prints

Pub Date:

June 2024

DOI:

10.48550/arXiv.2406.14060

arXiv:

arXiv:2406.14060

Bibcode:

2024arXiv240614060Z

Keywords:

Mathematics - Optimization and Control

E-Print:

34 pages, 4 figures. arXiv admin note: text overlap with arXiv:2311.01957

NASA/ADS

Distributed Event-Triggered Bandit Convex Optimization with Time-Varying Constraints

Abstract