Similar Search Results

Efficient Counterfactual Learning from Bandit Feedback

Narita, Yusuke - 2018

What is the most statistically eﬀicient way to do oﬀ-policy optimization with batch data from bandit feedback? For log data generated by contextual bandit algorithms, we consider oﬀline estimators for the expected reward from a counterfactual policy. Our estimators are shown to have the...

Persistent link: https://www.econbiz.de/10012906605

Efficient Counterfactual Learning From Bandit Feedback

Narita, Yusuke - 2018

What is the most statistically efficient way to do off-policy optimization with batch data from bandit feedback? For log data generated by contextual bandit algorithms, we consider offline estimators for the expected reward from a counterfactual policy. Our estimators are shown to have lowest...

Persistent link: https://www.econbiz.de/10012907150

Algorithm is experiment : machine learning, market design, and policy eligibility rules

Narita, Yusuke; Yata, Kohei - 2021

Persistent link: https://www.econbiz.de/10014394217

Algorithm is experiment: machine learning, market design, and policy eligibility rules

Narita, Yusuke; Yata, Kohei - 2021

Persistent link: https://www.econbiz.de/10012515858

Off-policy evaluation with general logging policies : implementation at Mercari

Narita, Yusuke; Okumura, Kyohei; Shimizu, Akihiro; … - 2022

Persistent link: https://www.econbiz.de/10014435186

Algorithm is experiment : machine learning, market design, and policy eligibility rules

Narita, Yusuke; Yata, Kohei - 2022

Persistent link: https://www.econbiz.de/10013387729

Algorithm is experiment : machine learning, market design, and policy eligibility rules

Narita, Yusuke; Yata, Kohei - 2022

Persistent link: https://www.econbiz.de/10013393675

Algorithm as experiment: machine learning, market design, and policy eligibility rules

Narita, Yusuke; Yata, Kohei - 2024

Persistent link: https://www.econbiz.de/10014539002

When to Target Customers? Retention Management using Dynamic Off-Policy Policy Learning

Ko, Ryuya; Uetake, Kosuke; Yata, Kohei; Okada, Ryosuke - 2022

We examine how to learn personalized customer retention strategies when customers' intentions to purchase evolve over time. Working with a Japanese online platform, we first implement a large-scale randomized experiment, in which coupons are randomly sent to first-time buyers at different times....

Persistent link: https://www.econbiz.de/10014235545

Hearing the voice of the future : Trump vs Clinton

Narita, Yusuke - 2019

Persistent link: https://www.econbiz.de/10012134518