CSpace

浏览/检索结果: 共10条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Model-Free Reinforcement Learning by Embedding an Auxiliary System for Optimal Control of Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 4, 页码: 1520-1534
作者:  Xu, Zhenhui;  Shen, Tielong;  Cheng, Daizhan
收藏  |  浏览/下载:85/0  |  提交时间:2022/06/21
Mathematical model  Trajectory  Heuristic algorithms  Optimal control  System dynamics  Artificial neural networks  Convergence  Approximate optimal control design  auxiliary trajectory  completely model-free  integral reinforcement learning (IRL)  
Dynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning Framework 期刊论文
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 页码: 13
作者:  Shi, Chengchun;  Wang, Xiaoyu;  Luo, Shikai;  Zhu, Hongtu;  Ye, Jieping;  Song, Rui
收藏  |  浏览/下载:111/0  |  提交时间:2022/04/29
A/B testing  Causal inference  Online experiment  Online updating  Reinforcement learning  Sequential testing  
Laser Based Navigation in Asymmetry and Complex Environment 期刊论文
SYMMETRY-BASEL, 2022, 卷号: 14, 期号: 2, 页码: 18
作者:  Zhao, Yuchen;  Xie, Keying;  Liu, Qingfei;  Li, Yawen;  Wu, Tian
收藏  |  浏览/下载:95/0  |  提交时间:2022/04/02
deep reinforcement learning  navigation  obstacle avoidance  unmaned-vehicle  
Stimuli strategy and learning dynamics promote the wisdom of crowds 期刊论文
EUROPEAN PHYSICAL JOURNAL B, 2021, 卷号: 94, 期号: 12, 页码: 8
作者:  Li Zhenpeng;  Tang Xijin
收藏  |  浏览/下载:121/0  |  提交时间:2022/04/02
A Network Evolution Model of Credit Risk Contagion between Banks and Enterprises Based on Agent-Based Model 期刊论文
JOURNAL OF MATHEMATICS, 2021, 卷号: 2021, 页码: 12
作者:  Mu, Pei;  Chen, Tingqiang;  Pan, Kun;  Liu, Meng
收藏  |  浏览/下载:103/0  |  提交时间:2022/04/02
Take Bitcoin into your portfolio: a novel ensemble portfolio optimization framework for broad commodity assets 期刊论文
Financial Innovation, 2021, 卷号: 7, 期号: 1
作者:  Li,Yuze;  Jiang,Shangrong;  Wei,Yunjie;  Wang,Shouyang
收藏  |  浏览/下载:120/0  |  提交时间:2021/10/26
Portfolio optimization  Bitcoin  Deep learning  Reinforcement learning  Variational mode decomposition  
A Novel Resilient Control Scheme for a Class of Markovian Jump Systems With Partially Unknown Information 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 页码: 10
作者:  Zhang, Kun;  Su, Rong;  Zhang, Huaguang
收藏  |  浏览/下载:110/0  |  提交时间:2022/04/02
Games  Process control  Markov processes  Game theory  Actuators  System dynamics  Heuristic algorithms  Adaptive dynamic programming  integral reinforcement learning (IRL)  resilient control  zero-sum game  
Reinforcement learning behaviors in sponsored search 期刊论文
APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2016, 卷号: 32, 期号: 3, 页码: 358-367
作者:  Chen, Wei;  Liu, Tie-Yan;  Yang, Xinxin
收藏  |  浏览/下载:118/0  |  提交时间:2018/07/30
advertiser behavior  sponsored search  generalized second-price auction  locally envy-free equilibrium  
Quantum reinforcement learning 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 卷号: 38, 期号: 5, 页码: 1207-1220
作者:  Dong, Daoyi;  Chen, Chunlin;  Li, Hanxiong;  Tarn, Tzyh-Jong
收藏  |  浏览/下载:103/0  |  提交时间:2018/07/30
collapse  Grover iteration  probability amplitude  quantum reinforcement learning (QRL)  state superposition  
Incoherent control of quantum systems with wavefunction-controllable subspaces via quantum reinforcement learning 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 卷号: 38, 期号: 4, 页码: 957-962
作者:  Dong, Daoyi;  Chen, Chunlin;  Tarn, Tzyh-Jong;  Pechen, Alexander;  Rabitz, Herschel
收藏  |  浏览/下载:117/0  |  提交时间:2018/07/30
incoherent control  quantum reinforcement learning (QRL)  wavefunction controllability  wavefunction-controllable subspace