Online Learning and Pricing for Service Systems with Reusable Resources
We consider a price-based revenue management problem with finite reusable resources over a finite time horizon $T$. Customers arrive following a price-dependent Poisson process and each customer requests one unit of $c$ homogeneous reusable resources. If there is an available unit, the customer gets served within a price-dependent exponentially distributed service time; otherwise, the customer waits in a queue until the next available unit. In this paper, we assume that the firm does not know how the arrival and service rates depend on posted prices, and thus it makes adaptive pricing decisions in each period based only on past observations to maximize the cumulative revenue. Given a discrete price set with cardinality $P$, we propose two online learning algorithms, termed Batch Upper Confidence Bound (BUCB) and Batch Thompson Sampling (BTS), and prove that the cumulative regret upper bound is $\tilde{O}(\sqrt{PT})$, which matches the regret lower bound. In establishing the regret, we bound the transient system performance upon price changes via a novel coupling argument, and also generalize bandits to accommodate sub-exponential rewards. We also extend our approach to models with balking and reneging customers, and discuss a continuous price setting. Our numerical experiments demonstrate the efficacy of the proposed BUCB and BTS algorithms
Year of publication: |
2022
|
---|---|
Authors: | Jia, Huiwen ; Shi, Cong ; Shen, Siqian |
Publisher: |
[S.l.] : SSRN |
Saved in:
freely available
Extent: | 1 Online-Ressource (69 p) |
---|---|
Type of publication: | Book / Working Paper |
Language: | English |
Notes: | Nach Informationen von SSRN wurde die ursprüngliche Fassung des Dokuments December 15, 2020 erstellt |
Other identifiers: | 10.2139/ssrn.3755902 [DOI] |
Source: | ECONIS - Online Catalogue of the ZBW |
Persistent link: https://www.econbiz.de/10014089846
Saved in favorites
Similar items by person
-
Multi-armed bandit with sub-exponential rewards
Jia, Huiwen, (2021)
-
Online learning and pricing for service systems with reusable resources
Jia, Huiwen, (2024)
-
Online Learning and Pricing for Network Revenue Management with Reusable Resources
Jia, Huiwen, (2022)
- More ...