Viktor Bengs
Dr.
We consider the combinatorial bandit problem with semibandit feedback under finite sampling budget, where the action is to choose a set of arms in a non-stochastic setting with subset-dependent feedback. We propose an algorithmic framework to solve it, which, additionally, can be leveraged for the algorithm configuration problem, where the goal is to find an optimal parameter configuration for a given target algorithm. We showcase that our introduced algorithm requires significantly less computation time than other existing theoretically-grounded approaches while still yielding high-quality configurations.
inproceedings BSB+20
BibTeXKey: BSB+20