验证码:

换一张

忘记密码？记住我

取消登录

切换中国科技网通行证登录

切换中国科技网通行证登录

取消

中文版 | English

中国科学院数学与系统科学研究院机构知识库

KMS Of Academy of mathematics and systems sciences, CAS

登录注册

图片搜索

粘贴图片网址

首页
研究单元&专题
作者
文献类型
学科分类
知识图谱
新闻&公告

在结果中检索

研究单元&专题

作者

文献类型

期刊论文 [10]

发表日期

语种

英语 [10]

出处

IEEE TRANS... [2]

APPLIED ST... [1]

EUROPEAN P... [1]

Financial ... [1]

IEEE TRANS... [1]

IEEE TRANS... [1]

资助项目

China Post... [1]

China Post... [1]

JSPS KAKEN... [1]

K. C. Wong... [1]

LSE''s Res... [1]

Major Proj... [1]

收录类别

SCI [6]

资助机构

知识图谱

CSpace

已提交作品

待认领作品

已认领作品

未提交全文

浏览/检索结果: 共10条，第1-10条

帮助

限定条件

语种：英语

已选(0)清除条数/页：排序方式：
	Model-Free Reinforcement Learning by Embedding an Auxiliary System for Optimal Control of Nonlinear Systems 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 4, 页码: 1520-1534 作者: Xu, Zhenhui; Shen, Tielong; Cheng, Daizhan 收藏 \| 浏览/下载：85/0 \| 提交时间：2022/06/21 Mathematical model Trajectory Heuristic algorithms Optimal control System dynamics Artificial neural networks Convergence Approximate optimal control design auxiliary trajectory completely model-free integral reinforcement learning (IRL)
	Dynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning Framework 期刊论文 JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 页码: 13 作者: Shi, Chengchun; Wang, Xiaoyu; Luo, Shikai; Zhu, Hongtu; Ye, Jieping; Song, Rui 收藏 \| 浏览/下载：111/0 \| 提交时间：2022/04/29 A/B testing Causal inference Online experiment Online updating Reinforcement learning Sequential testing
	Laser Based Navigation in Asymmetry and Complex Environment 期刊论文 SYMMETRY-BASEL, 2022, 卷号: 14, 期号: 2, 页码: 18 作者: Zhao, Yuchen; Xie, Keying; Liu, Qingfei; Li, Yawen; Wu, Tian 收藏 \| 浏览/下载：95/0 \| 提交时间：2022/04/02 deep reinforcement learning navigation obstacle avoidance unmaned-vehicle
	Stimuli strategy and learning dynamics promote the wisdom of crowds 期刊论文 EUROPEAN PHYSICAL JOURNAL B, 2021, 卷号: 94, 期号: 12, 页码: 8 作者: Li Zhenpeng; Tang Xijin 收藏 \| 浏览/下载：121/0 \| 提交时间：2022/04/02
	A Network Evolution Model of Credit Risk Contagion between Banks and Enterprises Based on Agent-Based Model 期刊论文 JOURNAL OF MATHEMATICS, 2021, 卷号: 2021, 页码: 12 作者: Mu, Pei; Chen, Tingqiang; Pan, Kun; Liu, Meng 收藏 \| 浏览/下载：103/0 \| 提交时间：2022/04/02
	Take Bitcoin into your portfolio: a novel ensemble portfolio optimization framework for broad commodity assets 期刊论文 Financial Innovation, 2021, 卷号: 7, 期号: 1 作者: Li,Yuze; Jiang,Shangrong; Wei,Yunjie; Wang,Shouyang 收藏 \| 浏览/下载：120/0 \| 提交时间：2021/10/26 Portfolio optimization Bitcoin Deep learning Reinforcement learning Variational mode decomposition
	A Novel Resilient Control Scheme for a Class of Markovian Jump Systems With Partially Unknown Information 期刊论文 IEEE TRANSACTIONS ON CYBERNETICS, 2021, 页码: 10 作者: Zhang, Kun; Su, Rong; Zhang, Huaguang 收藏 \| 浏览/下载：110/0 \| 提交时间：2022/04/02 Games Process control Markov processes Game theory Actuators System dynamics Heuristic algorithms Adaptive dynamic programming integral reinforcement learning (IRL) resilient control zero-sum game
	Reinforcement learning behaviors in sponsored search 期刊论文 APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2016, 卷号: 32, 期号: 3, 页码: 358-367 作者: Chen, Wei; Liu, Tie-Yan; Yang, Xinxin 收藏 \| 浏览/下载：118/0 \| 提交时间：2018/07/30 advertiser behavior sponsored search generalized second-price auction locally envy-free equilibrium
	Quantum reinforcement learning 期刊论文 IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 卷号: 38, 期号: 5, 页码: 1207-1220 作者: Dong, Daoyi; Chen, Chunlin; Li, Hanxiong; Tarn, Tzyh-Jong 收藏 \| 浏览/下载：103/0 \| 提交时间：2018/07/30 collapse Grover iteration probability amplitude quantum reinforcement learning (QRL) state superposition
	Incoherent control of quantum systems with wavefunction-controllable subspaces via quantum reinforcement learning 期刊论文 IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 卷号: 38, 期号: 4, 页码: 957-962 作者: Dong, Daoyi; Chen, Chunlin; Tarn, Tzyh-Jong; Pechen, Alexander; Rabitz, Herschel 收藏 \| 浏览/下载：117/0 \| 提交时间：2018/07/30 incoherent control quantum reinforcement learning (QRL) wavefunction controllability wavefunction-controllable subspace

首页
研究单元产出分布图
收录类型分布图
论文引用排行
作者
文献类型
学科分类
关于网站
使用帮助
联系我们

条目量24431
全文量9
访问量1735528
下载量1199

版权所有 @2018 - 2024 中国科学院数学与系统科学研究院 - Powered by CSpace

地址邮编: 北京市海淀区中关村东路55号（100190）
电话: 86-10-82541777