Abstract
Accurate and reliable quality index prediction is indispensable in quality control of the industrial propylene polymerization (PP) processes. This paper presents a novel modeling approach for quality index prediction based on optimal fuzzy wavelet neural network (FWNN) with improved gravitational search algorithm (IGSA), where the constant or a linear function of inputs in conclusion part of traditional TSK fuzzy model is replaced with wavelet neural network (WNN). Then, an online learning algorithm of the FWNN model is derived by using gradient descent algorithm, and an IGSA algorithm is proposed to online adapt the learning rates of FWNN. Research on the proposed soft sensor is carried out with the data from a real industrial PP plant, and the results are compared among the WNN, FWNN and IGSA-FWNN models. The research results show that the proposed prediction model achieves a good performance in practical industrial quality index, melt index, prediction process.
Keywords
Introduction
Polymerization process is one of the most important processes in petroleum chemical industry. The advanced monitoring and control of polymerization processes, in particular the quality index of polymer products, is of major strategic importance to the polymer manufacturing industry. The melt index (MI) of polypropylene, which determines different grades of the product, is one of the most significant quality indexes in practical propylene polymerization (PP) processes. Therefore, building a useful MI prediction model has significant importance. However, the MI of polypropylene is usually sampled online and then measured offline with an analytical procedure in the laboratory, which is not only costly but also time consuming (1.5–2h). Therefore, the development of MI on-line estimation model is significant, not only as an online sensor but also as a forecasting system. It is a usual practice to estimate MI values in terms of measurable key variables such as temperature, pressure, feed rates of each reactants, and so forth.
Many forecasting approaches have been proposed to increase MI forecasting accuracy. Statistical models, which are known as time-series-based models, only employ historical data. This method builds the relationship between the inherent characteristics of a system and the measured data with historical inputs data. The artificial neural network (ANN) is the most popular method for its excellent performance in approximating continuous nonlinear systems (Ding et al., 2015a; Graves et al., 2016; He, 2016; Kim and Lim, 2015; Liu and Wang, 2013; Meng and Chen, 2013), which has been used in many researches on MI forecasting. Shi et al. (2006) proposed a novel soft sensor model with principal component analysis, radial basis function networks and multi-scale analysis that provides promising prediction reliability and accuracy. Zhang et al. (2006) presented a bootstrap aggregated neural network (NN) model, which could overcome the problems of the single NN, such as over-fitting and the lack of generalization capability. Li et al. (2012) researched the MI prediction by adaptively aggregated RBF NN trained with novel ACO algorithm. For this reason, many other artificial intelligence methods were proposed (Ding et al., 2015b, 2015c; Guo et al., 2015; Shi et al., 2014; Zhou and Luo, 2015). Han et al. (2005) employed three approaches, which were supported vector machines (SVM), partial least squares (PLS) and ANN for MI estimation of SAN (styrene-acrylonitrile) and PP process. Ahmed et al. (2013) proposed two parameter update schemes that employ recursive update of PLS model parameters for MI prediction in high density polyethylene processes. Jiang et al. (2012) developed a new MI prediction method by introducing relevance vector machine (RVM). Zhang and Liu (2016) presented a real-time soft sensor based on optimized least squares support vector machine (LSSVM) and the online correcting strategy (OCS). In spite of improvements in MI forecasting approaches, MI forecast even suffers from high errors.
Fuzzy logic systems, first introduced by Zadeh (1973, 1974) and then applied by Kickert and Mamdani (1978), have been a powerful tool for modeling nonlinear processes (Affonso et al., 2015; Melo and Watada, 2016; Mohammed and Lim, 2015). To improve the learning abilities of the fuzzy logic-based systems, the fuzzy neural networks (FNNs), which possess the capability of fuzzy reasoning in handling uncertain information and the capability of ANNs in learning from processes, have been proposed. It has been a powerful tool for modeling nonlinear processes owing to its ability to approximate nonlinear and uncertainties systems without the requirement of mathematical models (Amjady, 2006; Pratama et al., 2016; Tang et al., 2017). Furthermore, owing to the ability of wavelet transformation for revealing the property of function in localize region, the wavelet neural networks (WNNs) have been developed by combining the wavelets with the NNs (Chen et al., 2006; Zhao et al., 2014). The WNN converges quickly and give high precision with reduced network size because of the time-frequency localization properties of wavelets. Taking account of the neural networks’ self-learning ability, fuzzy logic’s handling uncertainty capacity and wavelet transforms’ superiority of analyzing local details, a fuzzy wavelet neural network (FWNN) has been proposed (Abiyev et al., 2013; Solgi and Ganjefar, 2018; Yilmaz and Oysal, 2010). Their combination allows us to develop a system with fast learning capability that can describe a nonlinear system characterized with uncertainties (Abiyev and Kaynak, 2008).
Gravitational search algorithm (GSA) is a novel heuristic optimization method proposed by Rashedi et al. (2009). The attractive features of GSA include ease of implementation and the fact that no gradient information is required. However, like other heuristic algorithms, it takes more calculation time than traditional gradient descent algorithm when achieving the similar accuracy. The global search ability of GSA is strong, but its local search capability is insufficient. At the later stage of iteration, due to the appearance of heavier masses, the GSA becomes dull and requires more time to reach the optimal solution (Sombra et al., 2013). By introducing global memory and group communication, an improved gravitational search algorithm (IGSA) is proposed in our work.
In this paper, a novel fuzzy wavelet neural network (FWNN) is proposed, which uses the concepts of fuzzy logic in combination with WNN. The model replaces the constant or a linear function of inputs in conclusion part of traditional TSK fuzzy model with WNN. Each fuzzy rule corresponds to a WNN consisting of several wavelets with adjustable translation and dilation parameters. In the aspect of optimizing the FWNN, this paper combines the IGSA with the gradient descent algorithm in consideration of the IGSA’s disadvantage of taking more calculation time for a relatively good accuracy. First, the structure learning algorithm of the FWNN model is derived using gradient descent algorithm, which contains the relationship between all the parameters and their learning rates. In addition, an IGSA is proposed to online adapt the learning rates to further improve the online learning capability of FWNN. Thus, the optimal prediction model of MI, IGSA-FWNN, is obtained. Finally, research on the proposed model is carried out with the data from a real industrial PP plant.
The rest of this paper is organized as follows. Section 2 introduces architecture of the proposed FWNN, the structure learning algorithm and learning rates adjustment with IGSA. Section 3 presents the simulation results and performance analysis of the proposed model; conclusion is given in Section 4.
Methods and prediction
WNN
WNN, which incorporates wavelet functions into a neural network, has been proposed by Zhang and Benveniste (1992). It is built based on BP neural network structure, in which the transfer function of hidden layer nodes is the wavelet basis function. At the same time, error reverse transmission is employed to optimize the initial parameters of wavelet function of neural network. The basic wavelet theory is as follows.
For any function
where
where a is the dilation parameter, and b is the translation parameter. They can be used to control the magnitude and position of
Grossmann and Morlet (1984) proved that any function
where
The network topology structure of the WNN is shown in Figure 1. As shown in Figure 1,
where
where

Structure of the WNN.
Several wavelet mother functions have been proposed in the wavelet theory. Each mother function has its suitable application. In this work, the wavelet employed is the Morlet wavelet, because of its directional selectiveness capability of detecting oriented features, fine tuning to specific frequencies, and its good localization in time and frequency (Nason, 2008). This is a sinusoidal signal modulated by a Gaussian wave. It is characterized for its narrow frequency response, which offers a higher spectral resolution than the Mexican hat wavelet. This wavelet is particularly useful for filtering out the background noise of the images. In this paper, the wavelet employed is the Morlet wavelet, which is described as follows
where
Architecture of the proposed FWNN
FWNN is illustrated as a five-layer network structure in Figure 2, including the input layer, membership layer, rule layer, wavelet and consequent layer, and output layer. The fuzzy rules of the proposed FWNN are of the following form
where

Network structure of the proposed FWNN.
Furthermore, the relationship function between input and output in each layer is described as follows:
Layer 1 (Input layer): Each node in this layer represents one input variable and directly transmits the input values
Layer 2 (Membership layer): Each output from layer 1 is served as the input of a membership function. The values of the corresponding membership function are calculated through the following Gaussian function
where
Layer 3 (Rule layer): Since the nodes in the rule layer represent the preconditioning part of one fuzzy logic rule, the node in this layer is denoted by
where
Layer 4 (Wavelet and consequent layer): The wavelet layer and the consequent layer are included in this layer. The wavelet layer accepts the variables
Moreover, the nodes in the consequent layer multiply the incoming signals and can be represented as follows
where
Layer 5 (Output layer): The node in this layer performs the defuzzification method to get the final output. Here, a weighted sum is used as the defuzzification function. The final network output can then be calculated according to the following function
Structure learning algorithm of FWNN
The parameter vector in proposed-FWNN to be updated is
In gradient descent algorithm, parameters
where
The update rules for the parameters in the FWNN are described as follows
where
Layer 5: The error term to be propagated in this layer is calculated as follows
Accordingly, the increment of the connective weight
Layer 4: The error term to be propagated in this layer is calculated as follows
The increment of the weight parameters
The increment of dilation parameters
The increment of translation parameters
Layer 3: The error term to be propagated in this layer is calculated by
Layer 2: The error term in this layer is calculated by
Accordingly, the increment of the center parameters
The increment of the standard deviation parameters
Learning rates adjustment with IGSA
For finding the optimal values for the parameters of FWNN, an IGSA algorithm is proposed to online adapt the learning rates
The GSA is a novel heuristic optimization method proposed by Rashedi et al. (2009). In GSA, masses (agents) attract each other through gravitational attraction. The position of the mass corresponds to a solution of the problem and the gravitational forces and inertias are determined using a fitness function (Mirjalili et al., 2012; Li and Zhou, 2011). It is expected that masses are attracted by the heaviest masses which present the optimum solution in the search space by lapse of time. The algorithm achieves the sharing of optimized information through the interplay of universal gravitational interactions between agents. In the basic GSA algorithm, the velocity and position of agents are updated as follows
where t is the number of iteration, v denotes the velocity of the agent, a denotes the acceleration of the agent and x denotes the position of the agent.
The global search ability of GSA is strong, but its local search capability is insufficient. At the later stage of iteration, due to the appearance of heavier masses, the GSA becomes dull and requires more time to reach the optimal solution. On the other hand, the GSA only considers the influence of the current position when updating the position of the agents without taking account of the agents’ memory. When the agents are near the optimum solution and move very slowly, gbest which represents the best previous position among all the agents is proposed to help them exploit the global best. Each agent can observe the best solution (gbest) and tends toward it. Thus, by introducing global memory and group communication, an IGSA is proposed.
The flow chart of the IGSA algorithm for learning rates adjustment is shown in Figure 3 and the procedure is described as follows
(1) Initialization: The IGSA algorithm randomly generates the initial population. The position of each agent (individual) that is a candidate solution for the learning rates

The flow chart of the IGSA algorithm with learning rates adjustment.
where N is the population size, and D is the dimension of the problem.
(2) Fitness function: For each agent, the fitness value should be evaluated. In fact, the training process of the proposed FWNN using the IGSA algorithm is a minimization process of the error between the desired output and the actual one. The root mean square error (RMSE) is chosen as the fitness function, which is given by
where
(3) Agents’ masses: Agents’ masses are defined using fitness evaluation. The masses of all agents are updated using the following equations
where
(4) Total force: The total force that acts on agent i is calculated by
where
The gravitational constant
where
(5) Acceleration: By the law of motion, the acceleration of the agent i with the inertial mass
(6) Update velocity and position: In the IGSA algorithm, the velocity and position of agents are updated as follows
where
Then, take the new candidate solution as the learning rates of the FWNN model to obtain the new prediction results. Then update the gbest according to the new fitness.
(7) Stopping rule: Repeat step 1 to 6 until the best fitness value is achieved or a preset count of the generation is reached. The solution gbest with the best fitness value is chosen as the best learning rates of the FWNN model. Therefore, by using the obtained online tuning learning rates from the IGSA algorithm, the final optimal MI prediction model is established.
Results and discussion
The proposed soft sensor based on FWNN with improved gravitational search algorithm (IGSA-FWNN) is applied to the prediction of quality index in the real industrial PP processes currently being operated for commercial purposes in China. The Hypol technology is used in the process, which is one of the most widespread commercial methods for producing polypropylene. Figure 4 illustrates a highly simplified schematic diagram of the process. The process consists of a chain of reactors in series: two continuous stirred tank reactors (CSTR) and two fluidized-bed reactors (FBR). The feed to the reactor is comprised of propylene, hydrogen and the Ziegler–Natta catalyst. In the first two reactors, the polymerization reaction takes place in a liquid phase, and in the third and fourth reactors, the reaction is completed in vapor phase to produce the powdered polymer products. The quality index, MI, which depends on the catalyst properties, reactant composition, reactor temperature and so on, can determine different brands of products and different grades of product quality. In this application, a commercial brand named F401 with MI values between 1.4 and 3.2 is considered. It should be noted that the current MI data are from the same grade of polypropylene production, not from the grade transition for the multistage productions in polypropylene manufacturing process. The former is a slowly varying dynamic process; the latter is dynamic process with intense disturbance.

General scheme of PP.
To develop a prediction model for the MI, nine process variables (t, p, l, a, f1, f2, f3, f4, f5), which influence the process greatly, have been chosen, where t, p, l, a stand for the process temperature, pressure, level of liquid and percentage of hydrogen in vapor phase in the first CSTR reactor, respectively; f1-f3 are flow rates of three streams of propylene into the reactor, and f4 and f5 are flow rates of catalyst and aid catalyst respectively. The data used for the prediction model have been acquired from the discrete control system (DCS) historical log recorded in a real PP plant. It has been considered that the sampling time of MI is 2 hours. Data are filtered to discard abnormal situations and to improve the quality of the prediction results. The variables are normalized with respect to their standard deviation value and mean value. Finally, 50 data sets are used for training, 20 data sets are used for testing, and the remaining 15 are used for generalization.
The performance of the proposed model is evaluated with several statistical methods. These are the mean absolute error (MAE), the mean relative error (MRE), the root of mean square error (RMSE), Theil’s inequality coefficient (TIC) and standard deviation of absolute error (STD). All performance measures are defined as follows respectively
where
In this case, the IGSA-FWNN is employed to construct the quality index prediction model. At the same time, the WNN model and the FWNN model with fixed learning rates have also been considered to compare with the IGSA-FWNN model proposed in this paper. The learning rates of the WNN model are designated as
The data listed in Table 1 show the prediction results of different models on the testing dataset and the IGSA-FWNN model performs better than other models. More clearly, the WNN model gives MAE of 0.0404, MRE of 1.56%, RMSE of 0.0722, TIC of 0.0144 and STD of 0.0716. The FWNN model, gives MAE of 0.0296, MRE of 1.15%, RMSE of 0.0446, TIC of 0.0088 and STD of 0.0455. As can be seen from the above data, the FWNN model performs better than the WNN model. Furthermore, the IGSA-FWNN model shows even better results. The MAE, MRE, RMSE, TIC and STD are 0.0179, 0.71%, 0.0233, 0.0046 and 0.0237 respectively. Compared with the WNN model, the error measurement of the proposed model has percentage decreases of 55.7%, 54.5%, 67.7%, 68.1% and 66.9%. When compared with the FWNN model with fixed learning rates, the proposed model exhibits a percentage decrease of 38.3% in MRE from 1.15% to 0.71%.
Performance of the prediction models on the testing dataset of brand F401.
A visual comparison illustration between the analysis value and prediction value is shown in Figure 5 on how better the IGSA-FWNN model performs than the other models do on the testing dataset. The curve marked with crosses is the real MI value obtained from analysis in laboratory, while the results predicted by the IGSA-FWNN model are depicted by curves marked with circles. It is observed that the IGSA-FWNN model’s results better approximate the true MI value on the most data.

Performance of the proposed models on the testing dataset of brand F401.
A good inferential model should have good generalization ability in addition to good fitting accuracy. Table 2 shows that the proposed model obtains the best generalization ability among the WNN model, the FWNN model and the IGSA-FWNN model. Compared with the WNN model, the proposed model exhibits a percentage decrease of 56.5% in MRE from 2.14% to 0.93%. The error reduction also happens in terms of MAE, RMSE, TIC and STD. Figure 6 shows how the proposed model works on the generalization dataset. Clearly, the result strongly supports the conclusion that the IGSA-FWNN model performs well not only on the testing dataset but also on the generalization dataset. In other words, the application of the proposed method on the testing and generalization datasets got from the practical industrial polymerization plant demonstrated its effectiveness and reliability.
Performance of the prediction models on the generalization dataset of brand F401.

Performance of the proposed models on the generalization dataset of brand F401.
Table 3 compares the proposed IGSA-FWNN model with other models presented in the open literatures (Ahmed et al., 2013; Cao et al., 1999; Kim and Yeo, 2010; Shi and Liu, 2006; Xu and Liu, 2014). Note that the research data used in Shi and Liu (2006) and Xu and Liu (2014) are the same as that in this paper, while Ahmed et al. (2013), Kim and Yeo (2010) and Cao et al. (1999) apply different data set. So the results of Ahmed et al. (2013), Kim and Yeo (2010) and Cao et al. (1999) are for reference only. With the same research data, our work improves the prediction precision from MRE 3.27% presented in Shi and Liu (2006) to 0.71%. These data prove that the IGSA-FWNN model could be a powerful tool for online quality index prediction in the PP process.
Comparison between the current work and the published literatures.
The above calculations are performed on a personal computer with an Intel(R) Core(TM) i5-4210M CPU at 2.60 GHz. The computation time of the WNN, FWNN, IGSA-FWNN models are 1.26 s, 2.73 s and 4.92 s, respectively, which increases with the complexity of models. It can be seen that owing to the adoption of the IGSA, the computation time (4.92 s) of the proposed IGSA-FWNN is a little longer than the FWNN (2.73 s). Since the sampling time of the industrial MI prediction is about 2 hours, the proposed method qualifies the online soft sensor for the MI prediction. The computational complex of the IGSA algorithm is O(N) (N is the size of the IGSA population).
To clarify the generality of the proposed IGSA-FWNN approach, another dataset of Brand F400 from a real PP plant is selected to train and test the proposed model, which is divided into 50 pairs for training and 30 pairs for testing. The results are shown in Table 4 and in Figure 7. Table 4 shows the prediction results of different models on the second dataset. In detail, the IGSA-FWNN model outperforms the other models again with a decrease of 44.4% in MRE from 2.16% to 1.20%, compared with the WNN model. The error reduction also happens in terms of MAE, RMSE, TIC and STD. A more clearly contrast for different models is demonstrated in Figure 7. As can been see, the proposed IGSA-FWNN model exhibits fewer estimation errors compared to other models. The experiment results show the proposed IGSA-FWNN model has better performance in the efficiency of predicting MI and generalization ability.
Performance of the prediction models on another testing dataset of brand F400.

Performance on another testing dataset of brand F400.
Conclusion
This paper presents a novel forecasting approach for quality index prediction of PP process, integrating FWNN and the improved gravitational search algorithm (IGSA-FWNN). First, the FWNN model that uses the concepts of fuzzy logic in combination with WNN is developed. Moreover, the structure learning algorithm of the FWNN model is derived using gradient descent algorithm. Then, an IGSA algorithm is proposed to online adapt the learning rates of FWNN. Furthermore, the proposed method is applied to predict the MI in a real industrial PP plant. The forecasting results of the proposed model are compared to those of the WNN and FWNN models. Comparative tests illustrate that the proposed soft sensor outperforms the other comparison models. The application of the proposed model on the testing and generalization data shows its good performance and excellent generalization ability. The soft sensor predicts the MI with an MRE of 0.71% on the testing dataset, which is much more accurate than the FWNN model with an MRE of 1.15%, while much better than the WNN model with an MRE of 1.56%. The proposed soft sensor is therefore supposed to have a promising potential for the practical use.
Footnotes
Declaration of conflicting interest
The authors declare that there is no conflict of interest.
Funding
This work is supported by Zhejiang Province Natural Science Foundation (LY16B040003, LY18D060002), National Natural Science Foundation of China (Grant No. 61603336, 61590921), Shanghai Aerospace Science and Technology Innovation Fund (E81502) and Aerospace Science and Technology Innovation Fund of China Aerospace Science and Technology Corporation (E81601), and their supports are thereby acknowledged.
