CSpace
Notes on average Markov decision processes with a minimum-variance criterion
Liu, JY
2002-04-01
发表期刊OPERATIONS RESEARCH LETTERS
ISSN0167-6377
卷号30期号:2页码:107-116
摘要In this paper we consider the optimization problem of the variance of the reward for the nonstationary average Markov decision processes (MDP, for short). Examples in this paper show that there are mistakes in proofs of main theorems in two papers, Kurano [(J. Math, Anal. Appl. 123 (1987) 572)] and Guo [(Math. Meth. Oper. Res. 49 (1999) 87-96)] which investigated the optimization problems of the variance of the sun of costs for the average MDP. We propose a variance criterion which is different from that investigated by Kurano (1987) and Guo (1999) and we prove that there exists a Markov policy which is epsilon-strong variance optimal policy for any epsilon > 0 under some appropriate conditions. (C) 2002 Elsevier Science B.V. All tights reserved.
关键词Markov decision processes nonstationary MDP average criterion variance criterion strong variance optimal policy
语种英语
WOS研究方向Operations Research & Management Science
WOS类目Operations Research & Management Science
WOS记录号WOS:000176621900007
出版者ELSEVIER SCIENCE BV
引用统计
文献类型期刊论文
条目标识符http://ir.amss.ac.cn/handle/2S8OKBNM/17415
专题中国科学院数学与系统科学研究院
通讯作者Liu, JY
作者单位Acad Sinica, Inst Appl Math, Beijing 100080, Peoples R China
推荐引用方式
GB/T 7714
Liu, JY. Notes on average Markov decision processes with a minimum-variance criterion[J]. OPERATIONS RESEARCH LETTERS,2002,30(2):107-116.
APA Liu, JY.(2002).Notes on average Markov decision processes with a minimum-variance criterion.OPERATIONS RESEARCH LETTERS,30(2),107-116.
MLA Liu, JY."Notes on average Markov decision processes with a minimum-variance criterion".OPERATIONS RESEARCH LETTERS 30.2(2002):107-116.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Liu, JY]的文章
百度学术
百度学术中相似的文章
[Liu, JY]的文章
必应学术
必应学术中相似的文章
[Liu, JY]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。