Abstract
This paper addresses the Sliding Mode Learning Control (SMLC) of uncertain nonlinear systems with Lyapunov stability analysis. In the control scheme, a conventional control term is used to provide the system stability in compact space while a type-2 neuro-fuzzy controller (T2NFC) learns system behaviour so that the T2NFC completely takes over overall control of the system in a very short time period. The stability of the sliding mode learning algorithm has been proven in the literature; however, it is restrictive for systems without overall system stability. To address this shortcoming, a novel control structure with a novel sliding surface is proposed in this paper, and the stability of the overall system is proven for nth-order uncertain nonlinear systems. To investigate the capability and effectiveness of the proposed learning and control algorithms, the simulation studies have been carried out under noisy conditions. The simulation results confirm that the developed SMLC algorithm can learn the system behaviour in the absence of any mathematical model knowledge and exhibit robust control performance against external disturbances.
Keywords
Introduction
Control of uncertain nonlinear systems is one of the most crucial topics in modern control engineering (Chen et al., 2016; Kayacan and Fossen, 2019; Kayacan and Peschel, 2016; Kayacan et al., 2012, 2013b, 2017). Robust controllers aim to ensure the best control performance in the presence of uncertainties, and high controller gain is a common method to overcome the uncertainity problem in nonlinear control theory (Sun et al., 2016). However, these methods bring about large control actions, and require very powerful actuators (Ren et al., 2017). Moreover, robust control performance against uncertainties is mostly obtained at the cost of sacrificing nominal control performance of the system (Liu et al., 2017a). Therefore, a control method is required to exhibit robust control performance in the presence of uncertainties while either maintaining or improving nominal control performance (Huang et al., 2015). As a model-free method, fuzzy logic control is an alternative solution to the model-based control approaches in the presence of uncertainties and has been applied to different real-time systems, such as mobile robots (Castillo et al., 2012), spherical rolling robots (Kayacan et al., 2013a) and agricultural ground vehicles (Kayacan et al., 2015b, 2018). Even though type-1 fuzzy logic controllers are the most well known and widely used types of fuzzy logic controllers, researchers have recently focused on type-2 fuzzy logic controllers, which can deal with high levels of uncertainty in real-time applications (Castillo et al., 2014; Lee et al., 2015; Liu et al., 2017c; Maldonado et al., 2013; Muhuri and Shukla, 2017; Rubio-Solis and Panoutsos, 2015; Zaheer et al., 2015).
As a learning controller, a feedback–error learning control structure was proposed by Gomi and Kawato (1993) and first implemented for the control of robot manipulators. This relied on the parallel structure of a conventional controller and a neuro-fuzzy based controller. The generated control signal by the conventional controller provides a learning error signal to train learning algorithms. The latter learns the system behaviour online and thus eliminates the former from the control of the system (Kalanovic et al., 2000; Ruan et al., 2007). Since the gradient-based methods in training algorithms are complex and computationally more expensive, sliding mode control (SMC) theory-based learning algorithms have been proposed for feedback-error learning control structures (Efe et al., 2000). These can ensure faster convergence compared to conventional learning techniques for online adjustment in neural networks and type-1 and type-2 neuro-fuzzy structures (Kayacan et al., 2015a). Moreover, the stability and robustness of neuro-fuzzy controllers (NFCs) based on SMC theory-based learning algorithms can also be examined (Topalov et al., 2007a). The novelty of SMC theory-based learning algorithms is that the parameters are adjusted by the designed algorithm in order to fulfil a stable equation by imposing the error instead of minimizing an error function. On the other hand, SMC is inherently robust to uncertainties (Yin et al., 2017a,b).
Despite their practicality and easiness to design, model-free control methods – such as fuzzy control and neuro-fuzzy control – are mostly criticized by the model-based control community due to the fact that there is generally no convincing analytical proof of system stability (Feng, 2006). These criticisms are reasonable since trial-and-error methods cannot be tolerated for sophisticated and expensive robotic systems such as unmanned air vehicles and spacecraft systems. Therefore, analytical stability analysis is an overriding requirement so researchers tend to design model-based controllers (Kayacan et al., 2014, 2016; Lam, 2018; Mayne et al., 2000). With the aim of overcoming this limitation of model-free controllers, the overall system stability proven in this paper lightens the major disadvantage of model-free control methods. In previous studies in the literature, since the overall system stability of the sliding mode learning algorithm could not be proven, stability analysis was accomplished and a proportional-derivative controller was used to ensure the stability in compact space (Topalov et al., 2007b). Recently, it has been shown that if the system is a second-order system, it is possible to guarantee the stability of the overall system by adding a robust term to the control scheme (Khanesar et al., 2015). However, this stability analysis is too restrictive.
The contributions of this paper are as follows.
The stability of the overall system is proven for an nth-order uncertain nonlinear system. The overall system stability analysis removes the most significant shortcoming of model-free control methods.
A novel sliding surface is proposed to train the sliding mode learning algorithm for the type-2 neuro-fuzzy control (T2NFC) algorithm.
The learning rate of the online sliding mode learning algorithm of T2NFC and the controller gain in the conventional control term are adaptive. Therefore, it is possible to control the system without foreknowledge of the upper bounds of the system states and their derivatives.
The developed algorithm adjusts the proportion of the lower and upper membership functions (MFs) in the T2NFC, which allows us to handle non-uniform uncertainties in type-2 fuzzy logic systems (T2FLSs).
This paper consists of four sections. The SMLC structure consisting of the T2NFC and the conventional control term, and the sliding mode control theory-based learning algorithm are given in the next section. Following this, the overall system stability is discussed. Simulation results are presented to discuss control performance and noise analysis, and finally some conclusions are drawn.
Sliding model learning control structure
Control scheme
The whole control structure consists of a conventional controller and a type-2 neuro-fuzzy controller (T2NFC) as illustrated in Figure 1. The T2NFC controls the system by learning the system behaviour owing to the fact its parameters are updated by a sliding mode learning algorithm. The conventional control action is added to guarantee the stability of the overall system in compact space while the T2NFC learns the system behaviour. In this control structure, the aim is that the output of the conventional controller converges to zero in finite time while T2NFC takes the overall control signal. The total control action applied to the system u is determined as follows:
where
where s and k denote the sliding surface and the controller gain, respectively. The controller gain k is positive (i.e.
where

Block diagram of the control scheme.
Type-2 neuro-fuzzy controller
An interval type-2 Takagi–Sugeno–Kang fuzzy if–then rule
where e and

Structure of the proposed T2NFS for two inputs.
The lower and upper Gaussian MFs for T2FLSs are written as follows:
where
The lower and upper memberships
The control signal generated by the network is formulated below (Biglarbegian et al., 2010):
where
The parameter q determines the contributions of the lower and upper firing levels to the control signal and the adaptation rule for the parameter q is given in the next section.
Sliding mode learning algorithm
The sliding surface utilized in the fundamental basis of SMC theory (Utkin, 1992) is written as follows:
where
where
where
where
The time-derivative of the aforementioned Lyapunov candidate function is calculated as follows:
If equations (1) and (23) are inserted into (26), it is obtained as:
As it is stated in Assumption 1, the total input rate
where
Stability analysis of the overall system
An nth-order uncertain nonlinear system is described in the following form:
where
where B is considered as a positive constant.
The time-derivative of V is calculated as:
The sliding surface rate
Since
Then, it is rewritten by taking the nonlinear system in equation (30) into consideration as follows:
The control law in equation (1) and the adaptation for the controller gain in equation (3) are inserted into equation (36).
The conventional control action in equation (2) is inserted into (37):
As stated in Assumption 5, the term E, the nth-order time-derivative of the reference
where
Since
Simulation results
The performance of the developed SMLC algorithm against an external disturbance is first shown in Scenario 1 and then analysed under noisy conditions in Scenario 2. Throughout the simulation studies, the number of membership functions of the T2NFC is set to 3 (i.e.
with
To avoid division by zero in the adaptation laws of equations (13)–(22), an instruction is included in the algorithm to make the denominator equal to 0.001 when its calculated value is smaller than this threshold. Moreover, the controller gain of the conventional controller and learning rate of the learning algorithm must not have infinite values under noisy conditions; therefore, a dead zone is employed. If the sliding surface is within the bounds of the dead-zone parameter
Scenario 1: control performance against external disturbance
To evaluate control performance against disturbances, the developed SMLC algorithm is applied to an adaptive cruise control system, which is illustrated in Figure 3; the control structure is shown in Figure 4. The equations of motion for the longitudinal vehicle dynamics are written as follows (Kayacan, 2017; Stankovic et al., 2000):
where

Schematic diagram of the adaptive cruise control system.

Control structure of the adaptive cruise control system.
The time–headway spacing policy to follow the preceding vehicle with the desired relative distance is formulated as follows:
where z is the desired relative inter-vehicle distance and h is the desired time headway. The spacing policy is called the constant time–headway time policy, which aims for a constant inter-vehicle time gap. The position reference in this study is considered as the position of the preceding vehicle. The relative spacing error is defined by taking the spacing policy in equation (43) into account as follows:
The sliding surface for the adaptive cruise control system is second-order, since the adaptive cruise control system is a third-order system. The slope of the sliding surface
The test trajectory is defined by the position reference
The position, speed and acceleration responses of the vehicle are respectively shown in Figures 5(a)–5(c). The developed SMLC algorithm can learn the system behaviour online in finite time so that it exhibits robust control performance against the external disturbance. As the error is shown in Figure 5(d), it controls the system without any steady-state error.

Control performance on an adaptive cruise control system: (a) position; (b) speed; (c) acceleration; (d) error; (e) control signals; (f) adaptation of the controller gain; (g) adaptation of the learning rate; (h) adaptation of parameter q.
The control signals are shown in Figure 5(e). The generated control signal by the conventional control action
The adaptations of the controller gain k, the learning rate
In the literature, disturbance observer-based control approaches have been generally implemented for uncertain systems (Liu et al., 2017b). In these approaches, unmodelled dynamics are considered as an external disturbance, and a control law including the disturbance estimate is derived. In our work, there is no need for such an observer due to the fact that the proposed algorithm learns system behaviour. This is one of the advantages of the developed algorithm in this paper.
Scenario 2: noise analysis
To evaluate control performance under noisy conditions, the developed SMLC algorithm is applied to the following numerical system (Yang et al., 2013):
The initial conditions of the system states are set to
In order to test the robustness of this control approach, the states
The control signals are shown in Figure 6(c). As seen in this figure, the generated control signal by the robust control action

Control performance under noisy measurements: (a) state
The adaptations of the controller gain k, the learning rate
Conclusion
The developed SMLC algorithm for uncertain nonlinear systems has been investigated in this paper. In addition to the stability of the training algorithm, the analytical proof of the overall system stability has been achieved for nth-order uncertain nonlinear systems. The simulation results show that the developed control structure exhibits robust control performance in the presence of external disturbances and noisy measurements. The parameters of the T2NFC controller are automatically regulated through the sliding mode learning algorithm so that the T2NFC can learn the system behaviour and take overall control of systems in a very short time period. The proposed sliding mode learning algorithm is valid for only single-input systems. As a future work, the extension of the proposed learning algorithm for multi-input systems is an interesting topic to investigate.
Footnotes
Appendix: the time-derivative of the output of the T2NFC u · n
The following equations are obtained by taking the time-derivative of equations (5)–(8):
where
By taking the time-derivative of equation (11), the following equations are obtained as follows:
Since
where
In a similar way,
where
The time-derivative of the output of the T2NFC algorithm is obtained as:
If equations (51) and (52) are inserted into the previous equation:
where
Equation (55) can be obtained as follows:
Equation (55) is inserted into (54):
Since
