Relative upper confidence bound for the k-armed dueling bandit problem

Date: 2014

Categories:

Reference

Zoghi, M., Whiteson, S., Munos, R. and de Rijke, M. (2014) ‘Relative upper confidence bound for the k-armed dueling bandit problem’, Thirty-Firt International Conference on Machine Learning, , pp. 10–18.

Full text available at jmlr.org

Author

Prof. Shimon Whiteson

More

«
»