Abstract
In this paper, a model-free control framework is proposed to control the tip force of a cantilevered trilayer CPA and similar cantilevered smart actuators. The proposed control method eliminates the requirement of modeling the CPAs in controller design for each application, and it is based on the online local estimation of the actuator dynamics. Due to the fact that the controller has few parameters to tune, this control method provides a relatively easy design and implementation process for the CPAs as compared to other model-free controllers. Although it is not vital, in order to optimize the controller performance, a meta-heuristic particle swarm optimization (PSO) algorithm, which utilizes an initial baseline model that approximates the CPAs dynamics, is used. The performance of the optimized controller is investigated in simulation and experimentally. Successful results are obtained with the proposed controller in terms of control performance, robustness, and repeatability as compared with a conventional optimized PI controller.
Keywords
1. Introduction
Conducting polymer actuators (CPAs) are smart material based actuators that exhibit shape and dimension change when subjected to an electrical stimulus. This mechanical response can be used in positioning and force implementing applications that conventional actuators cannot undertake. Some of the advantages of CPAs such as; suitable to be manufactured as mini/micro-sized actuators, operating in solvent or air, large active strains in response to a driving electrical excitation, biocompatibility, and low cost, make them attractive to use in mini/micro-scale and medical applications such as mechanical stimulation of epithelial cells (Svennersten et al, 2011), a micropump (Fang and Tan, 2010), a bio-inspired compliant robotic fish (El Daou et al., 2012), and a robotic micro-gripper (Alici and Huynh, 2007). These actuators are named artificial muscles because their working mechanism is similar to that of biological muscles (Bar-Cohen and Zhang, 2008; Carpi et al., 2011; Gaihre et al., 2011).
Trilayer bender type CPAs comprise a porous substrate containing electrolyte and two conductive polymer electrodes placed on both surfaces of the substrate to form a sandwiched trilayer structure. The trilayer structure enables CPAs to operate in the air environment and thus provides an advantage in widespread applications compared to the other CPAs. When these actuators are subjected to an electrical voltage stimulation, they generate a bending motion based on an electrochemical redox reaction. Modeling and controlling the CPAs’ mechanical response is vital to understand and improve the capabilities of this class of actuators and broaden their application areas. Initial work on the mechanical response of CPAs concentrated on obtaining their positioning dynamics, including linear and nonlinear models (Alici, 2009; Du et al., 2010; Fang et al., 2007, 2008a; John et al., 2008; Madden, 2000; Nguyen et al., 2012; Shoa et al., 2011; Wang at al., 2009). In addition to modeling their dynamics, a significant amount of work has focused on controlling the CPAs’ position using model-based controllers (Fang et al., 2008b; Itik, 2013; Madden, 2003; Wang et al., 2013). Designing model-based controllers to control the CPAs may not be sufficient as the CPAs represent nonlinear and time-varying dynamics, which often differ from sample to sample. The dynamics of one CPA sample depend on the fabrication process and conditions of the actuator. Therefore, it is always possible to see a dynamic behavior difference between two CPA samples, even if they have identical geometric shapes. They also have some drawbacks such as drift, hysteresis, and degradation in actuation performance due to solvent evaporation, influencing their performance negatively and making their modeling and control insufficient (Yao et al., 2008). As the performance of the model-based controller reduces due to the reasons mentioned above, model-free adaptive controllers are considered as a potential solution instead (Beyhan and Itik, 2016; Druitt and Alici, 2013; Sancak et al., 2019).
Besides controlling the tip displacement of bender-type CPAs, the blocking force generated by their free-end is required to be robustly and accurately controlled in applications such as micro-nano manipulation, cell-injection, etc. Although there are many works in the literature focusing on modeling and control the CPAs’ tip displacement, there is a minimal number of works on the control of blocking force of such actuators (Coskun et al., 2017; Itik et al., 2014). Only quasi-static models (Alici and Huynh, 2006, 2007) are presented to predict the force output of the trilayer CPAs as modeling study which are not convenient to use in applications where the blocking force output of the CPAs needs to be controlled dynamically. Since a dynamic mechanistic model for blocking force dynamics of the CPAs does not exist, one needs to design controllers for the blocking force output of CPAs either by using an empirical model obtained by system identification methods from the input-output behavior of the CPA or by using model-free control methods.
Model-free control methods such as intelligent PID (Fliess and Join, 2009), fuzzy logic (Lee, 1990), or neural networks (Psaltis et al., 1988) are well suited and studied in controlling the position of CPAs since they do not need a precise model of the system in order to design a controller. However, their use in controlling the blocking force output of CPAs is very limited. In our previous work, a linear model obtained by using system identification was used to design a classical PID controller to control the blocking force output of the CPA, which mimics a cell injection process (Coskun et al., 2017). Although the results were successful, the designed controller could not provide the same performance for a long-time working period of the CPA and different CPA sample, due to the dynamic behavior change. Another work of authors shows that a model-free controller also called intelligent PID (i-PID) represents clear performance improvements in controlling the CPAs’ position for long-time usage in air conditions (Sancak et al., 2019). However, there is still room for investigating the proposed controller’s effect on the CPAs’ blocking-force control. Moreover, the proposed model-free controller’s effect on the control performance of different but same-sized CPAs should be investigated.
This paper investigates the effect of a model-free controller on the tip point blocking force control performance of a cantilevered trilayer CPA to eliminate the effects of unmodeled dynamics and disturbances and provide performance repeatability for different same-sized CPA samples. In order to have a fair performance comparison with a model-based PI controller, a PSO algorithm is designed to optimize the controller parameters, which utilizes a model of the CPA obtained by a black box system identification method. We note that this is not required to design the proposed controller since, in general, its parameters are tuned by trial and error (Fliess and Cédric, 2013). Here, we propose that by tuning the parameters of the i-PID controller a priori by using optimization algorithms, a significant performance improvement can be obtained compared to the trial and error method. In order to compare the control performance, a classical PI controller, which is also optimized with the same PSO algorithm, is used. Both controllers are simulated and implemented experimentally to track a step input signal. Moreover, as the CPAs in the same size may represent different dynamics, the repeatability of experiments with the same controller parameters is also investigated on a second CPA sample.
The remainder of this paper is organized as follows. In Section 2, we introduce the details of the CPA. Section 3 includes controller design together with the online estimation of the plant dynamics and optimization of the controllers with the PSO algorithm. In Section 4, simulation and experimental results are given. Finally, concluding remarks are provided.
2. Conducting polymer actuator, its structure and identification
The trilayer structure of the bending-type CPA used in this study is illustrated in Figure 1. There are two polypyrrole (PPy) layers on the outer surface of the actuator. As a separator layer, there is an amorphous, porous, and nonconductive Polyvinylidene Difluoride (PVDF) layer that holds the liquid electrolyte trifluoromethanesulfonimide (

Actuation mechanism of trilayer CPAs.
The actuation mechanism of CPAs is based on the electrochemical reduction-oxidation (redox) reaction, causing volume expansion and contraction, leading to bending motion like a bilayer cantilever due to electrons and ions passing through layers (Madden, 2000). Applying a sufficient electrical voltage to the actuator starts the reaction, and the PPy layer on the anode side oxidizes, while the layer on the cathode side reduces. The reaction process is also depicted in Figure 1 and can be described as
The positively charged PPy electrode absorbs the

Optical image of the CPA in cantilevered configuration.
2.1. Experimental setup and system identification
The schematic of the experimental setup is illustrated in Figure 3. The force measurement system consists of a Sparkfun mini load cell (TAL221), a load cell amplifier (HX711), and an Arduino-UNO board. The load cell has a maximum capacity of 981 mN with 0.1 mN resolution. The Arduino board is used to communicate between HX711 and PC over the USB port. The force measured at the tip point was used as the output, while the voltage applied via clamps to the actuator was used as the input variable. The clamp was coated with a thin gold film in order to improve conductivity. The input voltage was applied to the CPA actuator via National Instruments (NI) connector block (SC-2345) connected to the NI-6251 PCI data acquisition card located in the target PC. Also, the current passing through the actuator during the activation of the CPA was measured with a DROK Amp to Volt Transmitter module.

The schematic representation of the experimental setup.
An autoregressive exogenous (ARX) model representing the relationship between the voltage input and force output of the CPA is obtained by system identification on the MATLAB software. A pseudo-random binary sequence (PRBS) signal with a frequency of 1 Hz was applied to the actuator to acquire training data. The output force data was then acquired via the force measurement system with a sampling frequency of 80 Hz. The transfer function model obtained as a result of the system identification process is
The agreement between the model and the experimental training data is determined using a normalized root mean squared error method as 96.76%. The model is also validated with the experimental step response of the CPA, where 95.01% agreement is achieved. The time response comparison of the model and the experimental data is given in Figure 4(a) and (b) for the PRBS and step inputs, respectively. It is possible to validate the system identification results with other sets of data like sine or ramp signals. However, only step response validation is considered sufficient for this study since the desired force control behavior is generally like step response in many applications of CPAs (Alici and Huynh, 2007; Carpi et al., 2011; Coskun et al., 2017). Also, training and validation with PRBS data covers the frequency range up to 1 Hz signals in our experiments which also ensures satisfactory match with sinusoidal test signals. However, since it is omitted here to present all validation results for the sake of conciseness of the paper.

Comparison of the model and experimental data: (a) PRBS input response and (b) step input response.
3. Controller design and optimization
3.1. Controller design for CPA
In order to control the force output of the CPA, we used a model-free intelligent controller (Fliess and Cédric, 2013) based on continuously updating a local model of the CPA in a small-time interval with only the knowledge of the system input-output behavior. The details of the controller are given in the Appendix 2.
The local blocking force dynamics of the CPA can be represented by a first-order model as follows
By using a proportional feedback controller as a stabilizing controller, the intelligent controller is of the form of i-P control where the control input of the system becomes:
In the design of the stabilizing controller, integral and derivative terms are not used. That is, because a simple proportional controller is enough to ensure convergence of the error to zero since the local model is first order and the integral effect is included in

Block diagram representation of the i-P Controller.
In order to compare the performance of the proposed model-free controller, a classical PI controller is also designed and employed to the CPA. Because using a PI controller, we provide an integral effect to the proportional controller and have the same number of controller parameters with the i-P controller to be optimized. The optimization process of the controller parameters is explained in the next section.
3.2. Parameter optimization with PSO algorithm
Particle swarm optimization (PSO) algorithms are well-known population-based meta-heuristic algorithms (Eberhart and Kennedy, 1995a, 1995b) where a flying bird has a position and a velocity, and the birds change his position by adjusting the velocity when searching for food. The velocity changes based on the experience of the bird and the feedbacks received from his neighbor. Each bird in the swarm is called particle, considered as a candidate solution to the optimization problem. The swarm comprises
Where
The design parameters used for the initialization of the PSO algorithm are given in Table 1. Typically,
PSO parameters for tuning the objective function.
Intervals of the controller parameters to be optimized.
The performance of each particle is evaluated according to the pre-defined objective function, which is related to the problem to be solved. In general, this objective function is considered as integral of squared error (ISE) or integral of absolute error (IAE) because error based objective functions are generally used in the controller design process. However, this objective function can be insufficient in safely controlling the CPA for sensitive application fields because it takes no account of maximum overshoot and control input. Therefore in this study, the maximum overshoot
where
The evaluated objective functions are compared and
Both the i-P and PI controllers have two parameters (
With the given design parameters, the fitness function equation (7) is calculated using the linear time-invariant (LTI) model of the CPA. The noise in the experimental measurement is added as white noise with 10−4 noise power. Then, the noise is filtered with the same filter that is used in the experiments. Also, a time delay due to experimental measurement is added to the simulation as 0.09 s. As a reference signal to be tracked, a unit step input with an amplitude of 5 mN is considered to obtain linear dynamics from the CPA actuator. The total simulation is performed for 5 s in each iteration, and the fitness function is calculated at this time. The optimization process is repeated many times, and the change of fitness functions for both i-P and PI controllers are shown in Figure 6 for some trials. The optimization results for the best trials, where the fitness function cost is the lowest, are given in Table 3 together with the simulation results of the controllers in terms of maximum overshoot
Optimized values of the PI and i-P controller.

Fitness functions of i-P and PI controller for some trials.
As shown in Figure 6, the PSO algorithm converged to an optimal solution on average in the eighth iteration for both controllers. Although the PSO algorithm is used to tune the controllers with the same number of optimization and design parameters, the i-P controller provides better cost values than the PI controller. This implies that the optimized i-P controller improves the transient response of CPA since the evaluated 5-s simulation time mostly encapsulates the transient response.
4. Simulation and experimental results
In force control applications such as micromanipulation, the desired control accuracy is generally under mili-Newton levels. As an example, one possible area of use of CPAs is the biological cell-injection process (Coskun et al., 2017). The time-varying dynamics or slow transient response of the CPAs can be a problem for the work on a living cell. The force applied to the cell should be robustly and accurately controlled to prevent harming the cell. For this kind of application, it is vital to keep the blocking-force in a constant value that corresponds to a step reference signal. Therefore, we used only step references in our simulations and experiments. Nevertheless, the same procedure may be applied for different test signals such as ramp or sinus. However, for the sake of conciseness, we limit our simulations and experiments with step reference.
The LTI model of the CPA is used to simulate the dynamics in the optimization and control simulations. The optimized controller parameters are used to perform simulations, and the results are given in Figures 7 and 8 with the experimental results for PI and proposed i-P controllers, respectively. The simulations and experiments are performed for 20 s, and a unit step input with 5 mN amplitude is used as a reference signal. Before starting the experiments, the actuator was soaked in the electrolyte

Experimental and simulation results of actuator force

Experimental and simulation results of actuator force
As shown in Figures 7 and 8, both force output and control voltage input of the experimental and simulation results of the controllers are quite close to each other. This implies that the LTI model obtained by the system identification process was quite sufficient in the parameter optimization process of the controller for the early stages of the experiments. Although the PI and i-P controllers are optimized with the same PSO algorithm for the same number of optimization parameters, the i-P controller improves the actuator’s transient response compared to the PI controller. The transient response improvement in the experiments is 84% for rise time and 85% for settling time. No significant overshoot was observed for both controllers except noise. The performances of the controllers are compared in Table 4. Here, the steady-state error of the i-P controller is slightly larger than that of the PI controller due to the fact that we mostly focused on the transient response in the optimization process, but this can be solved by only tuning the

Current passing through the CPA during control processes.
Performance results.
4.1. Repeatability
The performance of the CPAs substantially depends on the fabrication process and air conditions of the environment that the actuator operates. A small difference in the polymer structure, solvent amount, or air conditions cause a change in the CPA dynamics. Thus, it is difficult to obtain identical mathematical models for different CPA samples with the same geometry. As a result, the control performance of model-based controllers varies from sample to sample. In this section, we employed the proposed model-free controller, which was designed for the first sample, on a second CPA sample to prove the repeatability of the experiments and robustness of the designed controller for the CPA samples. In Figure 10, the force control responses of first and second CPA samples are compared. The PI controller is also implemented with the same controller parameters to the second sample, and results are given in Figure 11.

Comparison of sample 1 and sample 2 results of actuator force

Comparison of sample 1 and sample 2 results of actuator force
The same trend was obtained in the control results of the second sample with the proposed model-free controller. However, the PI controller showed performance degradation for the second sample, as shown in Figure 11. According to the PI controller performance, the second sample of the CPA has slower dynamics in rise time and settling time. Also, the voltage input for the second sample increased by the controllers to hold the CPA at 5 mN reference value. Although the dynamic behavior change was observed in the second CPA sample, the i-P controller showed almost the same control performance as it did in the first sample. This proves the robustness of the i-P control for different but same sized CPA samples. For a better comparison, the performance results of the controllers for the second CPA sample are given in Table 5.
Performance results for second CPA sample.
5. Conclusion
In this study, a model-free controller was employed in simulation and experimentally to control the blocking force output of a trilayer conducting polymer actuator with PPy electrodes. The controller parameters were optimized by a designed PSO algorithm to achieve the desired performance characteristics with the controller. A classical PI controller, which was also optimized using the same performance index and algorithm, was used for comparison purposes. Although it is not necessary, utilizing a base-line model in order to optimize the controllers for the force control of the CPA provides better performance compared to the trial-error tuning methods used for intelligent controllers. The results demonstrated that the i-P controller with optimized parameters considerably improved the transient force response of the CPA for step reference tracking as compared to the PI controller. The designed controllers were also implemented on another CPA sample with the same dimensions to show the robustness of the controller under model uncertainties of different CPA samples. The i-P controller performed almost similar performance as it showed on the first sample, while the performance of the PI controller reduced due to the dynamic behavior difference between the two actuators. This shows the robustness and repeatability of the proposed i-P controller and the advantage of using the model-free controller framework in controlling the force output of the CPAs. Our work shows that model-free i-P control is quite promising to control the force dynamics of CPAs since the complete mechanism behind the actuation dynamics of the CPAs is not yet fully understood. As a relatively simple and effective method, the proposed controller can be used to control the dynamics of similar actuators.
Footnotes
Appendix 1
Appendix 2
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
