Swap-regret-minimizing bandits for distributed network optimization

Huang, Zhiming

Swap-regret-minimizing bandits for distributed network optimization

dc.contributor.author	Huang, Zhiming
dc.contributor.supervisor	Pan, Jianping
dc.date.accessioned	2025-09-04T19:18:22Z
dc.date.available	2025-09-04T19:18:22Z
dc.date.issued	2025
dc.degree.department	Department of Computer Science
dc.degree.level	Doctor of Philosophy PhD
dc.description.abstract	Modern networked systems—ranging from real-time communication platforms to distributed computing infrastructures—operate in increasingly dynamic and strategic environments, where traditional optimization methods often fall short. This dissertation develops a new algorithmic framework for distributed network optimization grounded in game-theoretic bandit learning. We model fundamental problems, such as congestion control and resource allocation, as repeated games involving strategic agents who receive only partial (bandit) feedback. Motivated by practical challenges in computer networks, we design and analyze algorithms that not only minimize regret but also steer collective behavior toward equilibrium. The contributions of this dissertation are threefold. First, we propose a new framework based on swap-regret minimization and online mirror descent, and establish high-probability regret bounds in multi-player bandit settings. These results guarantee convergence to correlated equilibria under decentralized, partial-information feedback. Second, we introduce optimistic learning techniques to accelerate convergence by leveraging predictability in the environment. Third, we apply our algorithms to real-world networking tasks, including TCP congestion control, and demonstrate improved stability, throughput, and fairness through extensive trace-driven emulations. Together, these contributions bridge the theoretical foundations of online learning and game theory with practical considerations in network protocol design, offering robust tools for decentralized decision-making in uncertain and adversarial environments.
dc.description.scholarlevel	Graduate
dc.identifier.uri	https://hdl.handle.net/1828/22713
dc.language	English	eng
dc.language.iso	en
dc.rights	Available to the World Wide Web
dc.subject	Multi-Armed Bandits
dc.subject	Online Learning
dc.subject	Game Theory
dc.subject	Correlated Equilibrium
dc.subject	Network Optimization
dc.subject	Congestion Control
dc.title	Swap-regret-minimizing bandits for distributed network optimization
dc.type	Thesis

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Huang_Zhiming_PhD_2025.pdf
Size:: 2.74 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.62 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Electronic Theses and Dissertations (ETD)