Online action learning in high dimensions: A new exploration rule for contextual et-greedy heuristics
Year of publication: |
2020
|
---|---|
Authors: | Flores, Claudio C. ; Medeiros, Marcelo C. |
Publisher: |
Rio de Janeiro : Pontifícia Universidade Católica do Rio de Janeiro (PUC-Rio), Departamento de Economia |
Subject: | Bandit | sequential treatment | high dimensions | LASSO | regret |
Series: | Texto para discussão ; 674 |
---|---|
Type of publication: | Book / Working Paper |
Type of publication (narrower categories): | Working Paper |
Language: | English |
Other identifiers: | 1734830778 [GVK] hdl:10419/249722 [Handle] RePEc:rio:texdis:674 [RePEc] |
Source: |
-
Flores, Claudio C., (2020)
-
Zbonakova, Lenka, (2016)
-
Stable graphical model estimation with Random Forests for discrete, continuous, and mixed variables
Fellinghauer, Bernd, (2013)
- More ...
-
Flores, Claudio C., (2020)
-
Terasvirta, Timo, (2005)
-
Audrino, Francesco, (2011)
- More ...