CSpace
Quantum reinforcement learning
Dong, Daoyi1; Chen, Chunlin2; Li, Hanxiong3,4; Tarn, Tzyh-Jong5
2008-10-01
发表期刊IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS
ISSN1083-4419
卷号38期号:5页码:1207-1220
摘要The key approaches for machine learning, particularly learning in unknown probabilistic environments, are new representations and computation mechanisms. In this paper, a novel quantum reinforcement learning (QRL) method is proposed by combining quantum theory and reinforcement learning (RL). inspired by the state superposition principle and quantum parallelism, a framework of a value-updating algorithm is introduced. The state (action) in traditional RL is identified as the eigen state (eigen action) in QRL. The state (action) set can be represented with a quantum superposition state, and the eigen state (eigen action) can be obtained by randomly observing the simulated quantum state according to the collapse postulate of quantum measurement. The probability of the eigen action is determined by the probability amplitude, which is updated in parallel according to rewards. Some related characteristics of QRL such as convergence, optimality, and balancing between exploration and exploitation are also analyzed, which shows that this approach makes a good tradeoff between exploration and exploitation using the probability amplitude and can speedup learning through the quantum parallelism. To evaluate the performance and practicability of QRL, several simulated experiments are given, and the results demonstrate the effectiveness and superiority of the QRL algorithm for some complex problems. This paper is also an effective exploration on the application of quantum computation to artificial intelligence.
关键词collapse Grover iteration probability amplitude quantum reinforcement learning (QRL) state superposition
DOI10.1109/TSMCB.2008.925743
语种英语
资助项目National Natural Science Foundation of China[60703083] ; K. C. Wong Education Foundation (Hong Kong) ; China Postdoctoral Science Foundation[20060400515] ; RGC of Hong Kong[CityU:116406]
WOS研究方向Automation & Control Systems ; Computer Science
WOS类目Automation & Control Systems ; Computer Science, Artificial Intelligence ; Computer Science, Cybernetics
WOS记录号WOS:000259191900003
出版者IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
引用统计
文献类型期刊论文
条目标识符http://ir.amss.ac.cn/handle/2S8OKBNM/5603
专题中国科学院数学与系统科学研究院
通讯作者Dong, Daoyi
作者单位1.Chinese Acad Sci, Key Lab Syst & Control, Inst Syst Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China
2.Nanjing Univ, Dept Control & Syst Engn, Nanjing 210093, Peoples R China
3.Cent S Univ, Changsha 410083, Peoples R China
4.City Univ Hong Kong, Dept Mfg Engn & Engn Management, Kowloon, Hong Kong, Peoples R China
5.Washington Univ, Dept Elect & Syst Engn, St Louis, MO 63130 USA
推荐引用方式
GB/T 7714
Dong, Daoyi,Chen, Chunlin,Li, Hanxiong,et al. Quantum reinforcement learning[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS,2008,38(5):1207-1220.
APA Dong, Daoyi,Chen, Chunlin,Li, Hanxiong,&Tarn, Tzyh-Jong.(2008).Quantum reinforcement learning.IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS,38(5),1207-1220.
MLA Dong, Daoyi,et al."Quantum reinforcement learning".IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS 38.5(2008):1207-1220.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Dong, Daoyi]的文章
[Chen, Chunlin]的文章
[Li, Hanxiong]的文章
百度学术
百度学术中相似的文章
[Dong, Daoyi]的文章
[Chen, Chunlin]的文章
[Li, Hanxiong]的文章
必应学术
必应学术中相似的文章
[Dong, Daoyi]的文章
[Chen, Chunlin]的文章
[Li, Hanxiong]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。