Chen, Xi; Lin, Qihang; Zhou, Dengyong - 2015
consider categorical labeling tasks and formulate the budget allocation problem as a Bayesian Markov decision process (MDP …), which simultaneously conducts learning and decision making. Using the dynamic programming (DP) recurrence, one can obtain …