Experimenters have to make theoretically irrelevant decisions concerning user interfaces and ordering or labeling of options. Such presentation decisions affect behavior and cause results to appear contradictory across experiments, obstructing utility estimation and policy recommendations. The...