Relative upper confidence bound for the k-armed dueling bandit problem
Reference
Zoghi, M., Whiteson, S., Munos, R. and de Rijke, M. (2014) ‘Relative upper confidence bound for the k-armed dueling bandit problem’, Thirty-Firt International Conference on Machine Learning, , pp. 10–18.
Full text available at jmlr.org
Author
![]() | Prof. Shimon Whiteson |