Thresholded linear bandits

dc.contributor.author	Nguyen, Trang Thu
dc.contributor.supervisor	Mehta, Nishant
dc.date.accessioned	2025-05-02T20:30:52Z
dc.date.available	2025-05-02T20:30:52Z
dc.date.issued	2025
dc.degree.department	Department of Computer Science
dc.degree.level	Master of Science MSc
dc.description.abstract	Thresholded linear bandits is a novel bandit problem that lies in the intersection of several important multiarmed bandit (MAB) variants, including active learning, structured bandits, and learning halfspaces. To achieve sublinear regret in the presence of exponentially many arms, one method is to exploit the structure of the reward function. However, the presence of an unknown threshold component makes previously known algorithms for structured bandits unsuitable. Moreover, the threshold introduces a discontinuity to the reward function, making the problem significantly more difficult. In this thesis, we study the union of axis-parallel halfspace variant of the thresholded linear bandits problem. We suggest an algorithm that achieves sublinear regret and provide theoretical guarantees on the performance of the algorithm
dc.description.scholarlevel	Graduate
dc.identifier.uri	https://hdl.handle.net/1828/22112
dc.language	English	eng
dc.language.iso	en
dc.rights	Available to the World Wide Web
dc.subject	multiarmed bandits
dc.subject	machine learning theory
dc.subject	thresholded bandits
dc.title	Thresholded linear bandits
dc.type	Thesis

Files

Now showing 1 - 1 of 1

Now showing 1 - 1 of 1