Design Optimization of Direct Heuristic Dynamic Programming Based on Hybrid Estimation of Distributi

来源 :2015全国理论计算机科学学术年会 | 被引量 : 0次 | 上传用户:whqqqqqqq
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  As an important class of approximate dynamic programming, the direct heuristic dynamic programming (DHDP) is discussed in this paper.DHDP performs well due to its model-free online learning capability.While the classical DHDP is implemented with gradient-based adaptation learning algorithm of neural network, in this paper we present a design strategy of DHDP with a novel hybrid estimation of distribution algorithm for online learning and control, and the proposed design optimization method achieves the weight training of neural networks with faster convergence rate.Our proposed approach can be viewed as an improvement for DHDP.The simulation is conducted on a practical system plant to test the online learning performance by using our approach.Then, the simulation results show the effectiveness of our approach.
其他文献
The individual household electricity consumption is major part of the city in the electricity market.The accurate prediction of household power load is very important for power sector to reasonable de
Herd behavior is a phenomenon that often appears in the stock market.It is caused by the irrational imitation of investors and is expressed as major investors make similar investing decisions in a sho
In this paper, we consider the problem of scheduling jobs with release dates and rejection on a bounded single parallel batching machine.Our objective is to minimize the sum of total completion time o
Conditional probability neural network (CPNN) has special advantage in pattern classification problems.However, how to find the optimal parameters of the CPNN to achieve better performance is an extra
The air pollution in Lanzhou city has caused wide public concern over the recent years.Among the factors leading to air pollution in lanzhou city, high PM10 concentration is an important one.Thus, pre
In CFD simulations, the smaller the cell size is, the more accurate the result is.However, a smaller cell size in all simulation regions means much more cells which in turn increased the consumption o
In virtualized and dynamical cloud computing environment, all resources such as infrastructure, hardware,platform, software and data can be virtualized and partitioned into some kinds of resouces pool
This paper presents an integrated method for ligaturing simulation of blood vessel in Virtual Simulation Training System of Liver Surgery.The integrated method mainly includes four aspects: simulation
This paper explores the Deep Belief Networks (DBNs) in the application of high-speed train vibration signals processing.Firstly, a new method based on DBNs is proposed.The vibration signals are prepro
A regular (3, 4)-CNF formula F is a 3-CNF formula, where each variable occurs exactly four times in F.A regular (3, 4, u)-CNF formula F is a regular (3, 4)-CNF formula, where each variable occurs u ti