Reducing dueling bandits to cardinal bandits, Proceedings of The 31st International Conference on Machine Learning, pp.856-864, 2014. ,
Seeking Subjective Dominance in Multidimensional Space: An Explanation of the Asymmetric Dominance Effect, Organizational Behavior and Human Decision Processes, vol.63, issue.3, pp.223-232, 1995. ,
DOI : 10.1006/obhd.1995.1075
Instance-dependent regret bounds for dueling bandits, Conference on Learning Theory, pp.336-360, 2016. ,
Sorting and Selection in Posets, SIAM Journal on Computing, vol.40, issue.3, pp.597-622, 2011. ,
DOI : 10.1137/070697720
URL : http://www.cs.berkeley.edu/~samr/pubs/poset-arxivver.pdf
Designing multi-objective multi-armed bandits algorithms: a study, Neural Networks (IJCNN) The 2013 International Joint Conference on, pp.1-8, 2013. ,
Contextual dueling bandits, Conference on Learning Theory, pp.563-587, 2015. ,
Action elimination and stopping conditions for the multiarmed bandit and reinforcement learning problems, The Journal of Machine Learning Research, vol.7, pp.1079-1105, 2006. ,
Computing with Noisy Information, SIAM Journal on Computing, vol.23, issue.5, pp.1001-1018, 1994. ,
DOI : 10.1137/S0097539791195877
The movielens datasets: History and context, ACM Transactions on Interactive Intelligent Systems (TiiS), vol.5, issue.4, p.19, 2015. ,
Asymmetric decoy effects on lower-quality versus higher-quality brands: Meta-analytic and experimental evidence, Journal of Consumer Research, vol.22, issue.3, pp.268-284, 1995. ,
Adding Asymmetrically Dominated Alternatives: Violations of Regularity and the Similarity Hypothesis, Journal of Consumer Research, vol.9, issue.1, pp.90-98, 1982. ,
DOI : 10.1086/208899
Regret lower bound and optimal algorithm in dueling bandit problem, Conference on Learning Theory, pp.1141-1154, 2015. ,
Copeland dueling bandit problem: Regret lower bound, optimal algorithm, and computationally efficient algorithm, 2016. ,
Dueling bandits: Beyond condorcet winners to general tournament solutions, Advances in Neural Information Processing Systems, pp.1253-1261, 2016. ,
Contextual and Procedural Determinants of Partner Selection: Of Asymmetric Dominance and Prominence, Social Cognition, vol.17, issue.2, pp.118-139, 1999. ,
DOI : 10.1521/soco.1999.17.2.118
URL : http://www.southampton.ac.uk/~crsi/Contextualolsen.pdf
The framing of decisions and the psychology of choice, Science, vol.211, issue.4481, pp.453-458, 1981. ,
DOI : 10.1126/science.7455683
Double thompson sampling for dueling bandits, Advances in Neural Information Processing Systems, pp.649-657, 2016. ,
Beat the mean bandit, Proceedings of the 28th International Conference on Machine Learning (ICML-11), pp.241-248, 2011. ,
The K-armed dueling bandits problem, Journal of Computer and System Sciences, vol.78, issue.5, pp.1538-1556, 2012. ,
DOI : 10.1016/j.jcss.2011.12.028
URL : http://www.cs.cornell.edu/People/tj/publications/yue_etal_09a.pdf
Relative upper confidence bound for the k-armed dueling bandit problem, Proceedings of the 31st International Conference on Machine Learning (ICML-14), pp.10-18, 2014. ,
Copeland dueling bandits, Advances in Neural Information Processing Systems, pp.307-315, 2015. ,
MergeRUCB, Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, WSDM '15, pp.17-26, 2015. ,
DOI : 10.1016/j.jcss.2011.12.028