//--> //--> //-->
Toggle navigation
Logout
Change account settings
EN
DE
ES
FR
A-Z
Beta
About EconBiz
News
Thesaurus (STW)
Research Skills
Help
EN
DE
ES
FR
My account
Logout
Change account settings
Login
Publications
Events
Your search terms
Search
Retain my current filters
~isPartOf:"Mathematics of operations research"
~person:"Yu, Huizhen"
~subject:"Estimation"
~subject:"Markov-Kette"
Search options
All Fields
Title
Exact title
Subject
Author
Institution
ISBN/ISSN
Published in...
Publisher
Open Access only
Advanced
Search history
My EconBiz
Favorites
Loans
Reservations
Fines
You are here:
Home
Recent results in stochastic p...
Similar by subject
Narrow search
Delete all filters
| 4 applied filters
Year of publication
From:
To:
Subject
All
Estimation
Markov-Kette
Markov chain
2
Stochastic process
2
Stochastischer Prozess
2
Borel spaces Markov decision process
1
Control theory
1
Dynamic programming
1
Dynamische Optimierung
1
Kontrolltheorie
1
Learning process
1
Lernprozess
1
Markov decision processes
1
Mathematical programming
1
Mathematische Optimierung
1
Q-learning
1
Theorie
1
Theory
1
convergence
1
discrete-time stochastic control
1
dynamic programming
1
measurability
1
policy iteration
1
reinforcement learning
1
stochastic approximation
1
total cost criteria
1
value iteration
1
more ...
less ...
Type of publication
All
Article
2
Type of publication (narrower categories)
All
Article in journal
2
Aufsatz in Zeitschrift
2
Language
All
English
2
Author
All
Yu, Huizhen
Bertsekas, Dimitri P.
2
Bhatnagar, Shalabh
2
Basu, Arnab
1
Dianetti, Jodi
1
Dieker, A. B.
1
Ferrari, Giorgio
1
Fischer, Markus
1
Gapeev, Pavel V.
1
Ghosh, Mrinal K.
1
Huang, Yu-Jui
1
Kahalé, Nabil
1
Karmakar, Prasenjit
1
Kwon, H. Dharma
1
Light, Bar
1
Ma, Will
1
Nendel, Max
1
Oliu-Barton, Miquel
1
Reed, Josh
1
Shaki, Yair
1
Vempala, Santosh S.
1
Xu, Kuang
1
Yaji, Vinayaka G.
1
Yun, Se-Young
1
Zhang, Hongzhong
1
Zhou, Zhou
1
Ziliotto, Bruno
1
more ...
less ...
Published in...
All
Mathematics of operations research
Source
All
ECONIS (ZBW)
2
Showing
1
-
2
of
2
Sort
relevance
articles prioritized
date (newest first)
date (oldest first)
1
On boundedness of Q-learning iterates for stochastic shortest path problems
Yu, Huizhen
;
Bertsekas, Dimitri P.
- In:
Mathematics of operations research
38
(
2013
)
2
,
pp. 209-227
Persistent link: https://www.econbiz.de/10009751534
Saved in:
2
A mixed value and policy iteration method for stochastic control with universally measurable policies
Yu, Huizhen
;
Bertsekas, Dimitri P.
- In:
Mathematics of operations research
40
(
2015
)
4
,
pp. 926-968
Persistent link: https://www.econbiz.de/10011409000
Saved in:
Results per page
10
25
50
100
250
A service of the
zbw
×
Loading...
//-->