A robust self-learning PID control system design for nonlinear systems using a particle optimization

合集下载

Robust Control and Estimation

Robust Control and Estimation Robust control and estimation are crucial components in the field of engineering and technology. They are used to ensure that systems can operate effectively and reliably, even in the presence of uncertainties and disturbances. The goal of robust control and estimation is to design systems that can adapt to changing conditions and maintain stable performance. This is particularlyimportant in industries such as aerospace, automotive, and manufacturing, wherethe performance and safety of systems are of utmost importance. One of the key challenges in robust control and estimation is dealing with uncertainties in the system. These uncertainties can arise from various sources, such as variations in operating conditions, environmental disturbances, or modeling errors. Designing controllers and estimators that can effectively handle these uncertainties is a complex task that requires a deep understanding of system dynamics and control theory. Engineers and researchers in this field often use mathematical models and simulation tools to analyze the behavior of systems and develop robust control and estimation strategies. In addition to uncertainties, another important aspect of robust control and estimation is the ability to handle disturbances. Disturbances can have a significant impact on the performance of a system, and robust control and estimation techniques aim to minimize the effects of disturbances and maintain stable operation. This often involves the use of feedback control strategies, such as proportional-integral-derivative (PID) control, as well as state estimation techniques, such as Kalman filtering, to accurately estimate the state of the system and adjust the control inputs accordingly. From an engineering perspective, robust control and estimation play a critical role in ensuring the safety and reliability of systems. For example, in the aerospace industry, robust control and estimation techniques are used to design autopilot systems that can maintainstable flight even in the presence of turbulence or other disturbances. In the automotive industry, these techniques are used to develop advanced driver assistance systems (ADAS) that can effectively control the vehicle and assist the driver in various driving conditions. In manufacturing, robust control and estimation are used to optimize the performance of industrial processes and ensure consistent product quality. From a research perspective, robust control andestimation are active areas of study with ongoing advancements and developments. Researchers are constantly exploring new control and estimation techniques, such as adaptive control, robust optimization, and machine learning-based approaches, to improve the performance and robustness of systems. They are also investigating the integration of robust control and estimation with other emerging technologies, such as cyber-physical systems and Internet of Things (IoT), to create more intelligent and adaptive systems. In conclusion, robust control and estimation are essential components in the design and operation of complex engineering systems. They are critical for ensuring the stability, performance, and safety of systems in the presence of uncertainties and disturbances. From an engineering and research perspective, robust control and estimation continue to be active areas of study with significant implications for various industries and technological advancements. As technology continues to evolve, the importance of robust control and estimation will only continue to grow, making it an exciting and challenging field for engineers and researchers alike.。

Neural PID Control of Robot Manipulators With

Index Terms—Exoskeleton, neural networks, PID, robot.
I. INTRODUCTION
P ROPORTIONAL–integral-derivative (PID) control is widely used in industrial robot manipulators [1]. In the absence of robot knowledge, a PID controller may be the best controller because it is model free and its parameters can be adjusted easily and separately [2]. However, an integrator in a PID controller reduces the bandwidth of the closed-loop system. In order to remove steady-state error caused by uncertainties and noise, the integrator gain has to be increased. This leads to worse transient performance and even destroys the stability. Therefore, many robot manipulators use pure proportional–derivative (PD) control or PD control with a small integral gain [3].
W. Yu is with the Departamento de Control Automatico, CINVESTAV-IPN, Mexico City 07360, Mexico (e-mail: yuw@ctrl.cinvestav.mx).

一种基于人类学习认知过程的PID控制方法

DOI: 10.11991/yykj.201811006网络出版地址：/kems/detail/23.1191.U.20181201.006.html一种基于人类学习认知过程的PID 控制方法李继广1,3，董彦非1，屈高敏1，杨雷恒2，易俊杰11. 西安航空学院飞行器学院，陕西西安 7100772. 西安航空职业技术学院通用航空学院，陕西西安 7100893. 西安市无人机应用创新基地，陕西西安 710071摘要：针对传统PID 方法对复杂系统非线性问题控制能力不足缺点，提出了一种基于人类学习认识模型的智能PID 控制方法。

首先建立了人类不同年龄阶段学习认识过程的数学模型，并应用该模型设计了一种可以在线自主调参的智能PID 控制器。

该控制器不仅具有自学习、自调整的能力，还克服了大多数智能方法计算迭代复杂、没有数学解析模型的缺点。

仿真结果表明本文设计的控制器是有效的。

关键词：智能控制；人类学习模型；控制方法；智能PID 控制器；自适应方法；参数整定；复杂系统；非线性中图分类号：TJ391; TJ761.1 文献标志码：A 文章编号：1009−671X(2019)02−0075−05An PID control method based on human learning cognitive processLI Jiguang 1,3, DONG Yanfei 1, QU Gaomin 1, YANG Leiheng 2, YI Junjie 11. School of Aerocraft, Xi'an Aeronautical University, Xi'an 710077, China2. School of General Aviation, Xi'an Aerotechnical Polytechnic College, Xi'an 710089, China3. Xi'an UAV Application Innovation Base, Xi'an 710071, ChinaAbstract : In order to overcome the shortcomings of traditional PID methods in controlling nonlinear problem of complex systems, an intelligent PID control method based on human learning cognitive model is proposed in this paper.Firstly, the mathematical model of learning process in different age stages of the mankind was established, and an intelligent PID controller with on-line self-tuning parameters was designed. The controller not only has the ability of self-learning and self-adjusting, but also overcomes the shortcomings of most intelligent methods, such as complicated iteration and not having mathematical analytical model. The simulation results show that the controller is effective.Keywords: intelligent control; human learning model; control method; intelligent PID controller; adaptive method;parameter tuning; complex system; nonlinear在控制领域，经典的PID 控制方法以其明确的物理概念、对被控对象模型的高度适应性、简单的调参过程等优点，占据着工业控制的绝大部分领域。

电气电子毕业设计365模糊自适应控制研究

摘要PID控制由于其算法简单、鲁棒性好及可靠性高，被广泛应用于过程控制和运动控制中，尤其适用于可建立精确数学模型的确定性系统。

然而实际工业生产过程往往具有非线性、时变不确定性，难以建立精确的数学模型，应用常规PID控制器不能达到理想的控制效果。

为了克服传统PID控制的弱点,控制界已经提出了大量的对PID控制的改进方案。

但这些方案一般是针对某些具体问题，缺乏通用性，附加的结构或算法也增加了控制器的复杂性，使它们的广泛应用受到限制将模糊决策理论与常规PID控制技术结合，研制出模糊自适应PID 控制器，并利用VB的ActiveX技术创建了模糊自适应PID控件，将该空间嵌入组态软件中进行实时控制，结果表明该控制策略具有较强的鲁棒性和适应性。

针对实际工业过程控制的难点，借鉴生物免疫系统中的免疫反馈原理，结合模糊控制可以逼近非线性函数的特点，分析了积分系数在系统响应过程中的非线性变化规律，提出了一种模糊免疫非线性PID控制方法。

这种方法具有量小，调整时间短，抗干扰能力和鲁棒性强等优点，理论分析和仿真研究证明了该方法的可行性和有效性。

关键词：自适应，模糊控制，PID控制，模糊免疫非线性PID控制Adaptive Fuzzy Control ResearchABSTRACTPID control is widely used in motion control and process control because of its simple algorithm, good robust and reliability, particularly applicable to the certain systems of establish a precise mathematical model. Nevertheless, the actual production process are often nonlinear, uncertain time-varying, it is difficult to establish a precise mathematical model, the conventional PID control can not achieve the desired effect.To overcome the weaknesses of traditional PID control, a lot of improve projects of the PID control have been put forward in the control industry. However, these projects are generally targeted at certain specific issues, the lack of universality. Additional structural or algorithm increased the complexity of the controller,restrict their wide use.This paper puts forward a technology which combinate the fuzzy decision theory and conventional PID control, researches and makes out a adaptive fuzzy PID controller (AFPID), by use of ActiveX of VB create a ActiveXof AFPID, embeds this ActiveX into configuration software to real-time control, the result shows this control tactic has stronger robust and adaptivity.A fuzzy immune nonlinear PID control method is presented in order to solve the difficulties of actual industry process control. This method is based on the immune feedback principle in the biological immune system, the approaching ability for nonlinear function of fuzzy controller and the nonlinear change law of the integral gain in the response process of the system. This method has low overshoot, short regulate time, strong anti-disturbance ability and great robustness. The theoretical analysis and simulation results show the feasibility and effectiveness of this method.Key words:self-adaption，fuzzy control，PID control，fuzzy immune nonlinear PID control模糊自适应控制研究0 引言自从L.A.Zadeh提出模糊集合论以来，基于该理论形成一门新的模糊系统理论学科，在控制、信号处理、模式识别、通信等领域得到了广泛的应用。

单神经元PID控制器设计

单神经元PID控制器设计摘要常规PID控制器具有结构简单、易于实现、鲁棒性强等优点，但实际的生产过程中，控制对象一般都具有延迟大、非线性强、干扰大的特点。

因此当工况改变时，对象的动态特性也发生改变，控制品质就会随之下降，所以采用常规PID控制器很难获得令人满意的控制效果。

神经网络具有强鲁棒性、容错性、并行处理、自学习、逼近非线性关系等特点，在解决非线性和不确定系统控制方面有很大潜力，近年来已广泛应用于工业过程控制领域。

由于单神经元模型具有自适应和自学习的能力，使得它可以作为一种很好的方法而得以应用，因此本文将单神经元模型与常规PID控制器相结合，形成了具有自适应能力的单神经元PID控制器。

本文讨论了单神经元自适应PID控制器和多变量单神经元PID控制器的结构，控制算法，并MATLAB仿真软件给出了实例仿真。

MATLAB仿真结果表明，该控制系统既保持了常规PID控制的优点，又有自学习的智能特性，因而具有良好的控制品质和较强的自适应能力。

关键词：PID控制器；数学模型；自适应控制；单神经元；MA TLAB仿真；多变量The Design of Single Neuron PID ControllerAbstractConventional PID controller is simple in structure, easy to implement, robust and other advantages. However, in the actual production process control targets have delayed the general, non-linear strong, and the heavy characteristics of the disturbance, so when the situation changes, the object of dynamic change, quality control will be declined. Therefore, the conventional PID control method is difficult to obtain satisfactory performance.Neural network has stronger robust, fault-tolerant, parallel processing, self-learning, approaching the characteristics of non-linear relationship, and uncertainty in solving nonlinear control system there is great potential, in recent years has been widely used in controlled areas. In single-neuron model is self-adaptive and self-learning ability , it can be regarded as an effective intelligent way for application ,so this article will be use single-neuron model with the conventional PID controller combine to form the adaptive capacity of a single-neuron PID controller. This paper discusses the structure and the control algorithm of the single neuron adaptive PID controller and the variable single neuron PID controller,besides,giving the simulation examples by MATLAB simulation software.The MATLAB simulation results indicated that the control system both maintained the conventional cascade PID control merit, and has from the self-learning intelligent characteristic, thus has the good control quality and the strong auto-adapted ability.Key words:PID controller；Parameters of the model；Adaptive control；Single-neuron；MATLAB simulation；multivariable目录第1章引言 (1)1.1单神经元PID控制的产生背景 (1)1.2单神经元PID控制研究现状 (1)1.3本论文的主要内容 (2)第2章神经网络理论基础 (3)2.1单神经元模型 (3)2.2神经网络的拓扑结构 (5)2.3神经网络的学习方法 (6)2.3.1神经网络学习方式 (6)2.3.2神经网络学习规则 (7)2.4感知器 (9)第3章单神经元自适应PID控制器的设计及仿真实现 (12)3.1传统PID控制 (12)3.2单神经元自适应PID控制器 (13)3.2.1自适应控制系统 (13)3.2.2控制结构 (16)3.2.3控制算法 (18)3.2.4仿真程序及分析 (19)3.3采用二次型性能指标的的单神经元自适应PID控制器 (27)第4章多变量单神经元PID控制器的设计及实现 (29)4.1二变量控制系统框图 (29)4.2控制算法 (29)4.3SIMULINK仿真 (30)第5章结论 (32)参考文献 (33)谢辞 (34)第1章引言1.1 单神经元PID控制的产生背景PID控制具有结构简单、稳定性能好、可靠性高等优点，尤其适用于不能建立精确数学模型的不确定性控制系统。

液压伺服系统智能PID控制

ｓｙｓｔｅｍｍｕｓｔｂｅａｄａｐｔｉｖｅｔｏｈａｒｄｃｉｒｃｕｍｓｔａｎｃｅｓ．Ｔｈｅｖａｒｉａｔｉｏｎｏｆｌｏａｄｈａｓａｗｉｄｅｒａｎｇｅ，ａｎｄ
ａｌｓｏｗｉｔｈｖｅｒｙｓｔｒｏｎｇｉｎｔｅｒｆｅｒｅｎｃｅｍｏｍｅｎｔ．Ａｌｌｔｈｅｓｅｍａｋｅｔｈｅｐａｒａｍｅｔｅｒｓｏｆｓｙｓｔｅｍ
随着我国现代化建设的向前推动，随动系统在我国工农业生产、国防与科学技术各个部门越来越得到广泛的运用。广泛采用随动系统，既节省人力，又提高效率和工作质量。
液压传动与控制是以液体（油、高水基液压油、合成液体）作为介质来实现各种机械量（力、位移或速度等）的传递。
液压传动与单纯的机械传动、电气传动和气压传动相比，具有传递功率大、结构紧凑、体积小重量轻等特点．因而被广泛运用于各种机械设备及精密的自动控制系统中。
首先，改进了ＰＸ．８电液伺服系统的硬件。主要工作是选用性能更加优异的信号反馈元器件和电子元器件，重新设计了伺服放大器。伺服放大器的主要功能是将计算机的控制信号按照系统需要的丌坏增益放大，具有足够的能力推动执行机构运行，还能完成速度、加速度、角度位置反馈信号的检测和调整。同时，还设计完成了一些辅
本文改进了Ｐｘ一８电液伺服系统，对伺服放大器进行了重新设计。考虑到系统在工作中，常规ＰＩＤ控制方法难于获得始终良好的控制效果，本文尝试了一种基于遗传算法和神经网络的智能ＰＩＤ控制方法，并进行了系统仿真和实验台实验研究。
仿真和实验结果表明，本文所提出的智能控制方法具有很好的自适应性和鲁棒性，可以有效的抑制负载变化和外界干扰对系统的不利影响，具有较好的控制效果。
在控制理论方面，伺服系统的智能控制理论系统是一门跨学科、需要多学科提供基础支持的技术科学，因而智能控制系统必然是一个综合集成智能系统。当前，国内外智能控制技术研究领域主要分为以下几类：

堡克特流程控制器Type8693说明书

The compact process controller Type 8693is optimized for integrated mounting on thepneumatic actuators in the process valveseries Type 23xx/2103 and is specially de-signed for the requirements of a hygienicprocess environment. The actual value ofthe process factor is directly supplied to thedevice as 4-20 mA, PT100 or a frequencysignal. The process controller calculatesthe setpoint for the subordinated positionerthrough the variance comparison. Due tothe analogue feedback all analogue valueson the controlling level can be transferred.With integrated diagnostic functions opera-tion conditions of the control valve can bemonitored. Through status signals, valvediagnostic messages are transmitted ac-cording to NAMUR NE107 and recorded ashistory entries. The parameterization of pro-cess controller and positioner can be carriedout automatically.The easy handling and theselection of additional software functionsare done either on a big graphic display withbacklight and keypad or over COMMUNICA-TOR. The positioner registers the valve posi-tion without deterioration through a contact-free, analog position sensor. The control ofsingle or double-acting actuators is donewithout internal air consumption. Com-munication interfaces such as PROFIBUSDP-V1, DeviceNet, EtherNet/IP, PROFINET,Modbus TCP, büS (based on CANopen) andanalogue as well as binary feedback canalso be chosen.Digital electropneumatic processcontroller for the integrated mount-ing on process control valvesGlobe control valveType 2301Angle-seat controlvalveType 2300Diaphragm controlvalveType 2103ORP meterType 8202FlowmeterType 8045Technical dataMaterial BodyCoverSealingPPS, stainless steelPCEPDMPower supply 24 V DC ±process valveControl valve systemELEMENTcontrol valve system Type 8802-GD-J 2301 + 8693ELEMENTcontrol valve system Type 8802-DF-J 2103 + 8693ELEMENTcontrol valve system Type 8802-YG-J2300 + 8693Globe control valveType 2301Diaphragm control valve Type 2103control valve Third party hygienic process valvesOrdering information for ELEMENT TopControl control valve systemsA TopControl control valve system consists of a process controller Type 8693 and an ELEMENT control valve Type 23xx/2103.The following information is necessary for the selection of a complete control valve:• Article no. of the desired TopControl process controller Type 8693 ( see ordering chart on p. 3)• Article no. of the selected control valve Type 23xx/2103 (see separate datasheets, e.g. 2300, 2301 or 2103)You order two components and receive a complete assembled and certified valve.When you click on the orange box "More info." below, you will come to our website for the resp. product where you can download the Technical data, continuedOrdering chart Type 8693 (other versions on request)AdditionalEtherNet/IP, PROFINET, Modbus TCP and büS (Bürkert System Bus): double-acting versions with low air capacity1)see additional software functions parametrisable diagnostic functions / binary outputs on page 13Note: Standard versions are UL approved.Ordering chart accessoriesOrdering chart adapter kit(has to be ordered separately)For installation kits to 3rd party process valves please see datasheet datasheet Type KK01 adapter kits for hygienic process valves or contact your sales office for related drawings or individual engineering support.Dimensions [mm]Version connection cable glandsMaterials1Cover PC2Body casing Stainless steel 3BASIC body PPS4Plug M12 Stainless steel 5ScrewsStainless steel 6Push-in connector Threaded ports G ⅛POM/stainless steel Stainless steel 7SealingEPDMDimensions [mm]Mounting on control valves of actuator series Type 27xx , actuator size 175/225 mmMounting on third party hygienic process valvesConnection options Connection MultipoleConnection options, continued Connection cable glandswith ID no. 918 038.** The indicated colors refer to the connection cable available as an accessory (92903474).4 ... 20 mAGND 4 ... 20 mA +24 VClock +Clock – / GND (identical with GND operating voltage)Clock +Clock –Pt 100Fieldbus connection M12 D-codiedConnector diagram32Circular connector M12, 4 pin - operating voltageConfigurationOperating voltage + 24 V DCOperating voltage GNDCircular connector M8, 4 pin - Input signal for process value** The indicated wire colours refer to the optional connector cable with ID no. 92903474büS - Bürkert System Bus connectionM12 Circular connector,Circular connector M12 × 1, 5 pin - büS connectionCAN-Label /ShieldingCircular connector M12, 4 pin - Operating voltageSignal flow diagramProcess control circuitAdditional software functions of the TopControl Type 8693• Automatic start of the control valve systems• Automatic parameterization of the process control circuit• Automatic or manual characteristic curves selection• Setting of the seal and the maximum stroke threshold respectively • Parameterization of the positioner• Manual parameterization of the process controller• Liwithation of the stroke range• Liwithation of the manipulating speed• Setting of the moving direction• Configuration of the binary input• Signal range splitting on several controllers• Configuration of an analogue or double binary outputs• Signal fault detection• Safety position• Code protection• Contrast inversion of the display• Language selection • Parametisable diagnostic functions* / Binary outputs (option)– Operating-hours counter– Path accumulator– Position monitoring– Process actual value monitoring– Monitoring of the mechanical end positions in the armature– G raphical display of the dwell time density and movementrange– Direction reversal counter– Temperature monitoring* F or installation kits to 3rd party process valves please see data-sheet installation kits for hygienic process valves or contact your sales office for related drawings or individual engineering support You will find a more detailed description for every diagnostic func-tion in the operating manual.Position control loopSchematic diagram of the Type 86931) The operating voltage is supplied with a 3-wire unit independent from the setpoint signal.2) Alternative optionsWith PROFIBUS DP, DeviceNet, EtherNet/IP, PROFINET, Modbus TCP and büS - Bürkert System BusIn case of special application conditions, please consult for advice.Subject to alteration.© Christian Bürkert GmbH & Co. KG1810/18_EU-en_00895093 To find your nearest Bürkert facility, click on the orange box。

Robust Control

Robust ControlRobust control is a crucial concept in the field of engineering, particularly in the realm of control systems. It involves designing control systems that can effectively handle uncertainties and variations in the system being controlled. This is essential because real-world systems are often subject to various disturbances and uncertainties that can affect their performance. One of the key challenges in robust control is ensuring that the system remains stable and performs well under different operating conditions. This requires the use of robust control techniques that can account for uncertainties in the system model and disturbances that may affect the system's behavior. By incorporating robust control strategies, engineers can design control systems that are more resilient and reliable in the face of uncertainties. There are several approaches to robust control, including robust PID control, H-infinity control, and mu-synthesis. Each of these approaches has its strengths and weaknesses, and the choice of approach depends on the specific requirements of the system being controlled. Robust PID control, for example, is a popular choice for many industrial applications due to its simplicity and effectiveness in handling uncertainties. On the other hand, H-infinity control is a more advanced approach that is particularly well-suited for systems with stringent performance requirements. By optimizing the system's performance under worst-case scenarios, H-infinity control can provide superior performance compared to other robust control techniques. However, it also requires more complex design and analysis techniques, making it suitable for more demanding applications. Mu-synthesis is another powerful approach to robust control that is based on the theory of structured singular value (mu). By optimizing the system's performance over a range of uncertainties, mu-synthesis can provide robuststability and performance guarantees for complex systems. However, it also requires sophisticated tools and techniques for design and analysis, making it suitable for high-performance applications. Overall, robust control plays a vital role in ensuring the stability and performance of control systems in the face of uncertainties. By incorporating robust control techniques into the design process, engineers can create control systems that are more reliable, resilient, and effective in real-world applications. This not only improves the overallperformance of the system but also enhances its safety and robustness in the face of uncertainties.。

1、下载文档前请自行甄别文档内容的完整性，平台不提供额外的编辑、内容补充、找答案等附加服务。
2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
3、如文档侵犯您的权益，请联系客服反馈,我们会尽快为您处理(人工客服工作时间：9:00-18:30)。

ORIGINAL ARTICLEA robust self-learning PID control system design for nonlinear systems using a particle swarm optimization algorithmChih-Min Lin •Ming-Chia Li •Ang-Bung Ting •Ming-Hung LinReceived:23February 2010/Accepted:18May 2011/Published online:22June 2011ÓSpringer-Verlag 2011Abstract This study presents a robust self-learning pro-portional-integral-derivative (RSPID)control system design for nonlinear systems.This RSPID control system comprises a self-learning PID (SPID)controller and a robust controller.The gradient descent method is utilized to derive the on-line tuning laws of SPID controller;and the H 1control technique is applied for the robust con-troller design so as to achieve robust tracking performance.Moreover,in order to achieve fast learning of PID con-troller,a particle swarm optimization (PSO)algorithm is adopted to search the optimal learning-rates of PID adap-tive gains.Finally,two nonlinear systems,a two-link manipulator and a chaotic system are examined to illustrate the effectiveness of the proposed control algorithm.Sim-ulation results show that the proposed control system can achieve favorable control performance for these nonlinear systems.Keywords PID control ÁParticle swarm optimization (PSO)ÁH ?control1IntroductionThe proportional-integral-derivative (PID)controller has been practically applied in industries for 60years due to its simple architecture and easy design properties.Until today,the PID controller is still used in different control appli-cations,even though lots of new control techniques have been proposed.However,the traditional PID controller needs some manual tuning before it is used to practical application in industries.When the PID controllers are applied to complicated systems such as nonlinear systems,different tuning algorithms of PID controllers have been proposed [1–5].In recent decade,several intelligent tuning algorithms have been applied to tune the PID controllers.The PID controller automatic tuning methods have been proposed by using a genetic algorithm [6],an immune algorithm [7]and a fuzzy-genetic algorithm [8].However,the learning speed of these algorithms is slow;thus,they are not suitable for real time control systems.Harinath and Mann [9]proposed a fuzzy PID controller for multivariable process systems.However,it takes a two-level tuning algorithm;thus,it also takes too much computation time.Therefore,in this paper,another simple optimization algorithm called particle swam optimization (PSO)algo-rithm will be used for the optimal parameter search of the PID controller.The PSO algorithm,a new evolutionary computation technique,is proposed by Kennedy and Eberhart [10].It was developed by the research of the social behavior of animals,e.g.,bird ﬂocking.Unlike a typical GA,PSO algorithms have memorial capability without complicated evolutionary process such as crossover and mutation in GA,and each particle can memorize its best solution.In addition,if another particle discovers a better solution,it will be shared among other particles.The best solution isC.-M.Lin (&)ÁM.-C.Li ÁA.-B.Ting ÁM.-H.LinDepartment of Electrical Engineering,Yuan Ze University,No.135,Yuan-Tung Road,Chung-Li,Tao-Yuan 320,Taiwan,ROCe-mail:cml@.tw M.-C.Lie-mail:s929101@.tw A.-B.Tinge-mail:s968501@.tw M.-H.Line-mail:s958505@.twInt.J.Mach.Learn.&Cyber.(2011)2:225–234DOI 10.1007/s13042-011-0021-4thus memorized and every particle will move toward this best solution.In past decade,PSO algorithms have been widely used to solve the modeling and control problems of application systems[11–14].Moreover,the PID control based on PSO algorithms have been proposed in[15,16]. However,it is difﬁcult to solve the inequality for the optimal solution in[15];and it has not given the stability analysis in[16].In this paper,a robust self-learning PID(RSPID) control system is proposed for nonlinear systems.This RSPID control system comprises a self-learning PID (SPID)controller and a robust controller.The SPID controller is utilized to approximate an ideal controller, and the robust controller is designed to recover the residual approximation error between the ideal controller and SPID controller.The gradient descent method and H1control technique are utilized to derive the on-line tuning laws of SPID controller and robust controller,so that the robust stability of the system can be obtained. Furthermore,the PSO algorithm is adopted to auto-search the optimal learning-rates of PID controller to increase the learning speed.Finally,two nonlinear systems are presented to support the validity of the proposed control method.This study is organized as follows.Problem formulation is described in Sect.2.The PSO algorithm is brieﬂy reviewed in Sect.3.The design procedure of the proposed RSPID control system is constructed in Sect.4.In Sect.5, simulations are performed to verify the effectiveness of the proposed control method.Finally,conclusions are drawn in Sect.6.2Problem formulationConsider a class of n th-order multi-input multi-output (MIMO)nonlinear systems expressed in the following form:xðnÞðtÞ¼fðxðtÞÞþGðxÞðtÞuðtÞþdðtÞy¼xðtÞð1ÞwhereuðtÞ¼½u1ðtÞ;u2ðtÞ;...;u mðtÞ T2<m the controlinput vectorof the system y¼xðtÞ¼½x1ðtÞ;x2ðtÞ;...;x mðtÞ T2<m the systemoutput vector xðtÞ¼½x TðtÞ;_x TðtÞ;...;xðnÀ1ÞTðtÞ T2<mn the statevector of thesystemfðxðtÞÞ2<m unknownboundednonlinearfunctionGðxðtÞÞ2<mÂm unknownboundednonlinearmatrixdðtÞ¼½d1ðtÞ;d2ðtÞ;...;d mðtÞ T2<m unknownboundedexternaldisturbance.When neglecting the modeling uncertainty and external disturbance,the nominal system of Eq.1can be obtained as xðnÞðtÞ¼f nðxðtÞÞþG n uðtÞð2Þwhere f nðxðtÞÞ2<m is the nominal function of fðxðtÞÞ, and the constant matrix G n¼diagðg n1;g n2;...;g nmÞ2 <mÂm is the nominal functions of GðxðtÞÞ.Without losing generality,assume the constant g ni!0for i¼1...m. Assume that the nonlinear system of Eq.2is controllable and GÀ1n exists for all xðtÞ.If the external disturbance and modeling uncertainty exist,the MIMO nonlinear systems Eq.1can be reformulated asxðnÞðtÞ¼f nðxðtÞÞþG n uðtÞþlðxðtÞ;tÞð3Þwhere lðxðtÞ;tÞis referred to as the lumped uncertainty, including system’s uncertainty and external disturbance.The control problem is toﬁnd a suitable controller for the MIMO nonlinear systems Eq.1so that the system output vector xðtÞcan track desired reference trajectory vector x dðtÞ¼½x d1ðtÞ;x d2ðtÞ;...;x dmðtÞ T2<m closely.A lot of control techniques have been presented to achieve reference trajectory tracking.However,in this paper,a simple adaptive PID control scheme will be pro-posed to deal with the uncertain MIMO nonlinear system. Moreover,the H1control technique will be used to guarantee the robust tracking performance.Deﬁne the tracking error aseðtÞD x dðtÞÀxðtÞ2<mð4Þthen the system tracking error vector is deﬁned ase D½e TðtÞ;_e TðtÞ;...;eðnÀ1ÞTðtÞ T2<mnð5ÞIf the system dynamics f nðxðtÞÞand G n,and the lumped uncertainty lðx;tÞare exactly known,an ideal controller can be designed asuÃðtÞ¼GÀ1n½xðnÞdðtÞÀf nðxðtÞÞÀlðxðtÞ;tÞþH T eðtÞ ð6Þwhere H¼½H n;...;H2;H1 T2<mnÂm is the feedback gain matrix which contains real numbers.H i¼diagðh i1;h i2;...;h imÞ2<mÂm is a nonzero positive constant diagonal matrix.Substituting the ideal controller Eq.6 into Eq.3,gives the error dynamic equationeðnÞþH T e¼0ð7ÞIn Eq.7,if H is chosen to correspond to the coefﬁcients of a Hurwitz polynomial,it implies limt!1jj eðtÞjj¼0. However,in practical application,the system uncertainties and external disturbance of nonlinear systems are generally unknown,so that the idea controller uÃin Eq.6is always unobtainable.Thus,a SPID controller is designed to mimic the idea controller.And then,based on the H1control technique,the robust controller is developed to attenuate the effect of the approximation error between SPID controller and the ideal controller so that the robust tracking performance can be achieved.3Robust self-learning PID(RSPID)control system designThe block diagram of the nonlinear control system is shown in Fig.1.The RSPID control system is assumed to take the following form:u RSPIDðtÞ¼u SPIDðtÞþu RðtÞð8Þwhere u SPIDðtÞis a self-learning PID controller utilized to approximate the ideal controller uÃ,and u RðtÞis the robust controller designed to suppress the inﬂuence of residual approximation error between the ideal controller and SPID controller.3.1SPID controller designThe SPID controller can be described asu SPIDðtÞ¼^K P eðtÞþ^K IZ t0eðsÞd sþ^K Dd eðtÞdtð9Þwhere^K P;^K I and^K D are the adaptive parameters ofproportional gain,integral gain and derivative gain matri-ces,respectively;and^K P¼diagð^k P1;^k P2;...;^k PmÞ2<mÂm;^KI¼diagð^k I1;^k I2;...;^k ImÞ2<mÂm;^K D¼diagð^k D1;^k D2;...^kDmÞ2<mÂm:An integrated error function is deﬁned assðe;tÞ eðnÀ1ÞþH1eðnÀ2ÞþÁÁÁþH nZ teðsÞd sð10Þwhere sðe;tÞ¼½s1ðtÞ;s2ðtÞ;...;s mðtÞ T.From Eq.9,thecontrol law Eq.8can be rewritten asu RSPIDðtÞ¼u SPIDð^K P;^K I;^K D;tÞþu RðtÞð11ÞTaking the time derivative of both sides of Eq.10andusing Eq.3,it can be obtained that_sðe;tÞ¼eðnÞþH T e¼Àf nðxðtÞÞÀG n uðtÞþxðnÞdÀlðxðtÞ;tÞþH T eð12ÞSubstituting Eq.9into Eq.12and multiplying both sidesby s Tðe;tÞ,yieldss Tðe;tÞ_sðe;tÞ¼Às Tðe;tÞf nðxðtÞÞÀs Tðe;tÞG n½u SPIDð^K P;^K I;^K D;tÞþu RðtÞþs Tðe;tÞðxðnÞdÀlðxðtÞ;tÞþH T eÞð13ÞBy deﬁning12s Tðe;tÞsðe;tÞas a cost function,then itsderivative is s Tðe;tÞ_sðe;tÞ.According to the gradientdescent method,the gains of^K P;^K I and^K D are updatedby the following tuning laws_^kPi¼Àgo s Tðe;tÞ_sðe;tÞSPID io u SPIDiðtÞ^Pig P s iðtÞg ni e iðtÞð14Þ_^kIi¼Àg Io s Tðe;tÞ_sðe;tÞSPID io u SPIDiðtÞ^Ii¼g I s iðtÞg niZ te iðtÞd sð15Þ_^kDi¼Àg Do s Tðe;tÞ_sðe;tÞo u SPIDiðtÞo u SPIDiðtÞo^k Di¼g D s iðtÞg nide iðtÞð16Þwhere u SPIDiis the i th element of u SPID;g P;g I and g D arethe learning-rates,which will be auto-searched by PSOalgorithm.3.2Robust controller designIn case of the existence of an approximation error,the idealcontroller can be reformulated as the summation of SPIDcontroller and the approximation error:uÃðtÞ¼u SPIDð^K P;^K I;^K D;tÞþeðtÞð17Þwhere eðtÞ¼½e1ðtÞ;e2ðtÞ;...;e mðtÞ T2<m denotes the approximation error.Substituting Eq.8into Eq.3,yieldsxðnÞðtÞ¼f nðxðtÞÞþG n½u SPIDðtÞþu RðtÞ þlðxðtÞ;tÞð18ÞFrom Eqs.6,10,18and after some straightforward manipulation,it can be obtained thateðnÞþH T e¼G n½uÃðtÞÀu SPIDðtÞÀu RðtÞ ¼_sðe;tÞð19ÞNow,the robust controller can be developed to attenuate the effect of the approximation error between the ideal controller and SPID controller so that the H1tracking performance can be achieved.In case of the existence of eðtÞ;consider a speciﬁed H1tracking performance[17]X m i¼1Z Ts2iðtÞdtX mi¼1½s2ið0Þ=g ni þX mi¼1r2iZ Te2iðtÞdtð20Þwhere r i is a prescribed attenuation constant.The robust controller is designed asu RðtÞ¼ð2R2ÞÀ1ðR2þIÞsðe;tÞð21Þwhere R¼diagðr1;r2;...;r mÞ2<mÂm:Then the follow-ing theorem can be stated and proven.Theorem1:Consider the nth-order MIMO nonlinear systems represented by Eq.1.The RSPID control law is designed as Eq.11,where u SPIDðtÞis given in Eq.9with the on-line parameter tuning algorithms given as Eqs.14–16, and the robust controller is designed as Eq.21.Then the desired H1tracking performance in Eq.20can be achieved for the speciﬁed attenuation levels r i;i¼1;2;...;m: Proof:The Lyapunov function is given byVðsðe;tÞÞ¼12s Tðe;tÞsðe;tÞð22ÞTaking the derivative of the Lyapunov function and using Eqs.17,19and21,yields_Vðsðe;tÞÞ¼s Tðe;tÞ_sðe;tÞ¼s Tðe;tÞG n½eðtÞÀð2R2ÞÀ1ðR2þIÞsðe;tÞ¼X mi¼1g ni s iðtÞe iðtÞÀs2iðtÞr2iþ12r2i!¼X mi¼1g ni s iðtÞe iðtÞÀs2iðtÞ2Às2iðtÞ2r2i!¼X mi¼1g niÀs2iðtÞ2À12s iðtÞr iÀr i e iðtÞ2þr2ie2iðtÞ2 "#X m i¼1g niÀs2iðtÞ2þr2ie2iðtÞ2!ð23ÞAssuming e iðtÞ2L2½0;T ;8T2½0;1Þ;integrating theabove equation from t=0to t=T,yieldsVðTÞÀVð0ÞX mi¼1g niÀ12Z Ts2iðtÞtðÞdtþr2i2Z Te2iðtÞdt2435ð24ÞSince VðTÞ!0,the above inequality implies the followinginequality12X mi¼1g niZ Ts2iðtÞdt Vð0Þþ12X mi¼1g ni r2iZ Te2iðtÞdtð25ÞUsing Eq.22,the above inequality is equivalent to thefollowingX mi¼1Z Ts2iðtÞdtX mi¼1½s2ið0Þ=g ni þX mi¼1r2iZ Te2iðtÞdtð26ÞThus the proof is completed.4Particle swarm optimization(PSO)algorithmThe learning-rates of the tuning laws in SPID controller areusually selected by trial-and-error process.In order toachieve the best learning speed,the PSO algorithm isadopted to search the optimal learning-rates g P;g I and g Din the SPID controller.In1995,Kennedy and Eberhart[10]initially proposedthe particle swarm concept and PSO algorithm;this algo-rithm is one of optimization methods.It has been proven tobe efﬁcient in solving optimization problem.In the PSOalgorithm,each particle represents a candidate solution tothe optimization problem.The particle keeps track of itscoordinates in the problem space which are associated withthe personal best solution.Another is the global best valuethat is tracked by the global version of the particle swarmoptimizer.At each time step,the PSO algorithm consists ofchanging the velocity that accelerates each particle towardits personal best and global best locations.Acceleration isweighted by a random term with separate random numbersbeing generated for acceleration toward personal best andglobal best locations,respectively[11].From then on,several PSO algorithms have been proposed with slightlydifferent versions[12–16].Theﬂowchart of the utilized PSO algorithm is drawn inFig.2.4.1Fitness functionIn order to maintain the control characteristic of SPIDcontroller,aﬁtness function is chosen asfit ¼10:1þjj e ðt Þjjð27ÞIt means that if the error states e ðt Þis forced to zero then the expected value of ﬁtness will be fit ¼10.4.2Velocity and position update lawIn PSO algorithm,a population of particles is randomly generated initially;each particle adjusts self-position with velocity according to its own experience and the experi-ences of other particles.The particle velocity and position update law is adopted as [16]v l q ðn þ1Þ¼v l q ðn ÞÂiw þn 1Ârand 1ðÁÞÂ½L l best q Àp l q ðn Þþn 2Ârand 2ðÁÞÂ½G l best qÀp l q ðn Þþn 3Ârand 3ðÁÞÂ½S l best q Àp l q ðn Þð28Þp l q ðn þ1Þ¼p l q ðn Þþv l q ðn þ1Þwhere iw is called the inertia weight which balances the global and local search,v l q ðn Þand p l q ðn Þdenote current velocity and current position,respectively;rand 1ðÁÞ,rand 2ðÁÞand rand 3ðÁÞdenote random variables between 0and 1;n 1,n 2,and n 3denote acceleration factor_1,acceleration factor_2and acceleration factor_3,respectively;L l best q ,G l best q ,and S l best q are the personal best index,the global best index,and the sub-population best index of the q th particle,respectively.Additionally,q ¼1;2;...;n p ,in which n p is the population size and‘¼1;2;...;n d ,in which n d is the dimension of each particle and iw is given by iw ¼iw max Àiw max Àiw minN maxÂN nð29Þwhere iw max ,iw min ,N max and N n are iteration maximum value,iteration minimum value,total iteration number and current iteration number of inertia weight.This PSO algorithm is used to on-line tune the learning-rates g P ,g I and g D in Eqs.14–16to achieve fast learning speed of PID gains.5Simulation resultsTwo uncertain nonlinear systems,a two-link manipulator control system and a uniﬁed chaotic system are examined to illustrate the effectiveness of the proposed design method.The parameters of PSO are set as:the population size z =20,the dimension of the particle h =2,the acceleration factors n 1=0.75,n 2=3.25,n 3=0.1,the total iteration number N max =10,the iteration maximum value iw max =0.9,the iteration minimum value iw min =0.4,the initial states of velocity v l q ðn Þand position p l q ðn Þof each particle are randomly generated.Example 1.Two-link manipulator systemIn this example,the proposed control system is applied to control a two-link robot manipulator.Figure 3depicts this two-link manipulator,the angles of the second and the third links were considered to be h 1and h 2,respectivelyFig.2The ﬂowchart of PSO algorithm[18].In addition,the numerical values of parameters of the robot model were speciﬁed as that in [19].The dynamic equation is given as followsM ðh Þ€h þA m ðh ;_hÞ_h þB ðh ÞþZ _h þs d ¼s ð30Þwhere h 2<n is the joint position vector;M ðh Þ2<n Ân is asymmetric positive deﬁnite inertia matrix;A m ðh ;_hÞis a vector of Coriolos and centripetal torques;B ðh Þ2<n representing the gravitational torques;Z ¼K x þV f 2<n Ân is a diagonal matrix consisting of the back emf coefﬁcient matrix K x and the viscous friction coefﬁcient matrix V f ;s d 2<n is the unmodeled disturbances vector;s 2<n is the vector of control input torques.By deﬁning the state vector x ðt Þ¼½x 1ðt Þ;x 2ðt Þ T ¼½h 1;h 2 T ,the dynamic Eq.30can be expressed as €x ðt Þ¼f ðx ðt ÞÞþG ðx ðt ÞÞu ðt Þþd ðt Þð31Þwhere the unknown nonlinear function G ðx ðt ÞÞ¼M À1ðh Þ;and f ðx ðt ÞÞ¼M À1ðh ÞÀA m ðh ;_h Þ_h ÀB ð_h ÞÀZ _h Às d ÂÃ;where A m ðh ;_h Þ¼sin ðh 2Þða 2þp 2a 9ÞÀ_h 2Àð_h 1þ_h 2Þ_h 10 !;B ðh Þ¼a 200a 8!;Z ¼a 5cos ðh 1Þþa 6cos ðh 1þh 2Þþp 4½cos ðh 1Þþp 5cos ðh 1þh 2Þa 9a 6cos ðh 1þh 2Þþp 5cos ðh 1þh 2Þa 9 ;M ¼M 11M 12M 21M 22!where M 11¼a 1þ2a 2cos ðh 2Þþðp 1þ2p 2cos ðh 2Þa 9Þ;M 12¼ða 3þa 2cos ðh 2ÞÞþðp 3þp 2cos ðh 2Þa 9Þ;M 21¼ða 3þa 2cos ðh 2ÞÞþðp 3þp 2cos ðh 2Þa 9Þ;M 22¼a 1þp 3a 9[19].In this example,the initial conditions of the states are given as a 1¼6:33;a 2¼0:14;a 3¼0:11;a 4¼27:6;a 5¼31:9;a 6¼3:3;a 7¼0:94;a 8¼4:54;a 9¼1:25;p 1¼0:37;p 2¼0:18;p 3¼0:18;p 4¼4:23;p 5¼4:15;h 1¼50p =180;h 2¼10p =180;_h1¼0;_h 2¼0:For demonstrating the tracking performance of the proposed control system,the desired trajectories for x d 1ðt Þand x d 2ðt Þare set as x d ðt Þ¼x d 1ðt Þx d 2ðt Þ!¼0:5þ0:2ðsin ðt Þþsin ð2t ÞÞ1:3À0:1ðsin ðt Þþsin ð2t ÞÞ!ð32Þwhere x d ðt Þ¼x d 1ðt Þ;x d 2ðt Þ½ T ¼h d 1;h d 2½ T is the reference trajectory vector.For the proposed RSPID control system,the feedback gain matrix is designed as H ¼diag ð40;4Þ;the initial conditions of the learning-rates are chosen as g P =400,g I =300and g D =400,respectively;all the PID gains are set as zero initially;the initial states of two-link manipulator are speciﬁed as x 1ð0Þ¼20p =180;x 2ð0Þ¼10p =180;_x1ð0Þ¼0and _x 2ð0Þ¼0;and the initial states of desired trajectories x d 1ð0Þ¼0;x d 2ð0Þ¼0;_xd 1ð0Þ¼0and _xd 2ð0Þ¼0:In order to study the robustness of the proposed control system,assume that the two-link manip-ulator control system has external disturbance s d ¼½40;25 at t ¼5s.Besides,the attenuation level is chosen as R ¼diag ð0:5;0:5ÞIn order to compare the control performance,the adaptive fuzzy control (AFC)presented in [18]is also applied to this manipulator ing this control system,the tracking trajectories are shown in Fig.4a,b,respectively.The simulation results of the proposed RSPID control system for this two-link manipulator control system are shown in Figs.4,5,6,7.Figure 4c,d represents the tracking responses of x 1ðt Þand x 2ðt Þby the proposed RSPID control scheme,respectively.Moreover,the control inputs and tracking errors of x 1ðt Þand x 2ðt Þare plotted in Fig.5a–d,respectively.In addition,the ﬁtness function and learning-rates are shown in Fig.6a–d,respectively.The PID gains K P ;K I and K D are shown in Fig.7a–c,paring Fig.4c,d with Fig.4a,b,it can be seen the tracking error have been much reduced by using the RSPID controller.Moreover,from Fig.6,it is also seen that the learning-rates of PID controller converge after the 5th second.The simulation results also show that the pro-posed RSPID control system can effectively achieve parameter tuning and favorable control for the two-link manipulator control system.Example 2.Uniﬁed chaotic systemConsider a general master–slave uniﬁed chaotic sys-tems;the master system and slave system are given as [20]._xd 1ðt Þ¼ð25h c þ10Þðx d 2ðt ÞÀx d 1ðt ÞÞMaster _xd 2ðt Þ¼ð28À35h c Þx d 1ðt ÞÀx d 1ðt Þx d 3ðt Þþð29h c À1Þx d 2ðt Þ_x d 3ðt Þ¼x d 1ðt Þx d 2ðt ÞÀ8þh c3 x d 3ðt Þð33Þ_x1ðt Þ¼ð25h c þ10Þðx 2ðt ÞÀx 1ðt ÞÞþd 1ðt Þþu 1ðt ÞSlave _x2ðt Þ¼ð28À35h c Þx 1ðt ÞÀx 1ðt Þx 3ðt Þþð29h c À1Þx 2ðt Þþd 2ðt Þþu 2ðt Þ_x 3ðt Þ¼x 1ðt Þx 2ðt ÞÀ8þh c3x 3ðt Þþd 3ðt Þþu 3ðt Þð34Þwhere x di (t )and x i (t )are the system states variables of master system and slave system,respectively;d i ,i =1,2,3denote the disturbances and u i ,i =1,2,3are the control inputs.Assume a 1¼ð25h c þ10Þ;a 2¼ð28À35h c Þ;a 3¼ð29h c À1Þand a 4¼8þh c 3ÀÁ;then the master system Eq.33and the slave system Eq.34can be rewritten as_xd 1ðt Þ¼a 1ðx d 2ðt ÞÀx d 1ðt ÞÞ_xd 2ðt Þ¼a 2x d 1ðt ÞÀx d 1ðt Þx d 3ðt Þþa 3x d 2ðt Þ_xd 3ðt Þ¼x d 1ðt Þx d 2ðt ÞÀa 4x d 3ðt Þð35Þ_x1ðt Þ¼a 1ðx 2ðt ÞÀx 1ðt ÞÞþd 1ðt Þþu 1ðt Þ_x2ðt Þ¼a 2x 1ðt ÞÀx 1ðt Þx 3ðt Þþa 3x 2ðt Þþd 2ðt Þþu 2ðt Þ_x3ðt Þ¼x 1ðt Þx 2ðt ÞÀa 4x 3ðt Þþd 3ðt Þþu 3ðt Þð36Þthe master–slave system can be expressed as _x d ðt Þ¼f ðx d ðt ÞÞð37Þand_xðt Þ¼f ðx ðt ÞÞþG ðx ðt ÞÞu ðt Þþd ðt Þð38Þwhere x d ðt Þ¼½x d 1ðt Þ;x d 2ðt Þ;x d 3ðt Þ T ;x ðt Þ¼½x 1ðt Þ;x 2ðt Þ;x 3ðt Þ T ;f ðx d ðt ÞÞ¼½a 1ðx d 2ðt ÞÀx d 1ðt ÞÞ;a 2x d 1ðt ÞÀx d 1ðt Þx d 3ðt Þþa 3x d 2ðt Þ;x d 1ðt Þx d 2ðt ÞÀa 4x d 3ðt Þ T ,f ðx ðt ÞÞ¼½a 1ðx 2ðt ÞÀx 1ðt ÞÞ;a 2x 1ðt ÞÀx 1ðt Þx 3ðt Þþa 3x 2ðt Þ;x 1ðt Þx 2ðt ÞÀa 4x 3ðt Þ T ;G ðx ðt ÞÞ¼diag ½1;1;1 ,d ðt Þ¼½d 1ðt Þ;d 2ðt Þ;d 3ðt Þ T and u ðt Þ¼½u 1ðt Þ;u 2ðt Þ;u 3ðt Þ T .The control objective is to ﬁnd a suitable control law u ðt Þ,so that the state trajectories of slave chaotic system x ðt Þcan follow the master chaotic system x d ðt Þunder different initial conditions and subject todisturbances.Fig.5The control inputs and tracking errors of the two-link manipulator system a control input 1,b control input 2,c tracking error 1,d tracking error2Fig.4The state responses of AFC and RSPID.a state response 1of AFC,b state response 2of AFC,c state response 1of RSPID,d state response 2of RSPID (dashed line desired trajectory,continuous line state trajectory)。