Aurélie C. Lozano, Naoki Abe, et al.
KDD 2009
We present an algorithm, called the Offset Tree, for learning to make decisions in situations where the payof of only one choice is observed, rather than all choices. The algorithm reduces this setting to binary classification, allowing one toreuse any existing, fully supervised binary classification algorithm in this partial information setting. We show that the Offset Tree is an optimal reduction to binary classification. In particular, it has regret at most (k - 1) times the regret of the binary classifier it uses (where k is the number of choices), and no reduction to binary classification can do better. This reduction is also computationally optimal, both at training and test time, requiring just O(log2 k) work to train on an example or make a prediction. Experiments with the Offset Tree show that it generally performs better than several alternative approaches. Copyright 2009 ACM.
Aurélie C. Lozano, Naoki Abe, et al.
KDD 2009
Alina Beygelzimer, Geoffrey Grinstein, et al.
Physica A: Statistical Mechanics and its Applications
Alina Beygelzimer, John Langford, et al.
aaai 2005
Jie Tang, Jimeng Sun, et al.
KDD 2009