Comparison of mathematical and artificial neural network models for inhibition of fuel oil ash under high temperature corrosion

Abstract

This research reports the results of literature data of mass loss tests of high temperature corrosion inhibition of steel in different concentration ratios of MgO, Al₂O₃ and SiO₂ to corrosive fuel ash of V₂O₅ in the temperature range of 550-590°C and time range of 8-100 h. Analysis focused on determining optimum mathematical equation and artificial neural network (ANN) architecture in order to gain good prediction properties. Three mathematical equations and five ANN architectures were suggested. A computer aided program was used for developing these models. Results show that polynomial mathematical equation and multilayer perceptron are able to accurately predict selected data with high correlation coefficients.

Keywords

Corrosion Steel High temperature Inhibitor Fuel ash

Introduction

Oxidation is the greatest important high temperature corrosion reaction. Metals or alloys are oxidised when heated to high temperatures in air or in the highly oxidising surroundings, such as combustion atmospheres with excess air or oxygen. In gaseous environments, high temperature corrosion is defined as the corrosion that takes place above the maximum temperature at which acids condense and dew point corrosion takes place. Although a common high temperature corrosion reaction happening in temperatures above 500°C, severe high temperature corrosion has been encountered in many cases at temperatures lower than 500°C.¹ In many industrial systems, such as boilers and turbines, plant operating conditions can be quite difficult; it is rather complex to use laboratory experiments to simulate plant conditions. However, laboratory tests can provide good general control for making initial alloy selections. In addition, field testing of nominee alloys in the operating plant provides the best way for obtaining the corrosion information that can be dependably used for final materials selection. However, mathematical and other forecasting tools can be helpful method in predicting corrosion rate data. Many researchers concentrated on the corrosion inhibition mechanism, activation parameters, reaction kinetics, etc.^2,3 Few of them reported the usage of mathematical and statistical modelling. Mathematical modelling has already established to be very useful and prevailing method in determining the relation between dependent and independent variables.⁴ Another motivating technique for developing an input–output relationship is an artificial neural network (ANN).^5–7 Artificial neural networks represent one of the fastest developing fields of artificial intelligence due to their ability to bring to mind (to a certain extent) the human problem solving characteristic, which is difficult to simulate using the logical, analytical techniques of expert system and standard software technologies.^8,9 The wide applicability of ANNs stems from their litheness and facility to model linear and non-linear systems without prior knowledge of an empirical model. This gives ANNs a benefit over old fashioned fitting methods for some chemical applications.¹⁰ The ANN excludes the restrictions of the classical approaches by extracting the desired information using the input data. Applying ANN to a system needs satisfactory input and output data instead of a mathematical equation. The ANN can be trained using input and output data to familiarise the system. Mathematical and ANN modelling are capable of forecasting and predicting any response function (corrosion rate in the present study) as function of operating conditions (such as time, temperature, etc.) that avoid the use of tedious and boring experimental laboratory work. In the present work, the high temperature corrosion data of steel as a function of temperature, time and inhibitor concentration were selected from the literature¹¹ and analysed using mathematical and ANN method.

Experimental data and methodology

Mathematical and statistical methodology

Esia,¹¹ in his previous work, studied the corrosion of steel in high temperature environment via weight loss technique through 53 runs without taking into account the ANN. The runs were designed and distributed according to the Box–Wilson central composite rotatable design. The effect of time (8-100 h), temperature (550-950°C) and inhibitor to artificial fuel ash ration (0-5) was evaluated in details. Three inhibitors were selected (MgO, Al₂O₃ and SiO₂), and artificial fuel ash was prepared (V₂O₅).¹¹ Corrosion rate as a function of different variables is given in Table 1. In the present work, three kinetic mathematical models are constructed. Parabolic kinetics, linear kinetics and logarithmic kinetics were used. These models are listed respectively as follows¹ (1) (2) (3) X is the mass loss per unit area; k _p, k _l and k _e are parabolic, linear and logarithmic kinetics rate constants; t is the time; and a is constant. The rate constants for activation parameters for some systems can be estimated from an Arrhenius type equation¹² (4) Corrosion rate data as a function of inhibitor concentration can be used to show the rate dependence of media concentration. The model proposed by Mathur and Vasudevan¹³ for the corrosion of steel in aqueous environment can be tested. The model is described by the following (5) where r is the corrosion rate, k is the reaction rate constant, C is the concentration, B is constant for the reaction studies, A is a frequency factor (pre-exponential factor), E is the activation energy (J mol^{− 1}), R is a gas constant (8.314 J mol^{− 1} K) and T is the absolute temperature (K). Equations (1)–(3) can be rearranged in terms of corrosion rate as follows (6) (7) (8) In order to find a comprehensive, compact form taking into account the effect of independent variables on corrosion rate, the following equation can be proposed (9) F(t) is a function describing the effect of time on corrosion rate depending on equations (6)–(8). In terms of yx notation; equation (9) can be rewritten (10) (11) (12) Equations (10)–(12) were suggested as parabolic, linear and logarithmic kinetics models to find the best fitting between the response y and the variables of Table 1 (x ₁, …, x ₅). Furthermore, the mathematical model of the second order polynomial equation was also suggested (13) where a ₀, a ₁, …, a ₁₉ are constants of models.

Table 1

Experimental corrosion rate y _Exp in g m^{− 2} day^{− 1}, as function of temperature (x ₁), time (x ₂) and inhibitors to vanadium ratios (x ₃, x ₄ and x ₅) obtained from Eisa¹¹ (y _ANN is predicted corrosion rate by ANN)

No.	T (x ₁)/°C	t (x₂)/h	Mg/V (x ₃)	Al/V (x ₄)	Si/V (x ₅)	CR (y _Exp)	CR (y _ANN)	Set (train/test)
1	840	75	3.618	3.618	3.618	432.01	468.103	Test
2	660	75	3.618	3.618	3.618	98.9	141.489	Train
3	840	34	3.618	3.618	3.618	518.39	468.103	Test
4	660	34	3.618	3.618	3.618	181.62	141.489	Train
5	840	75	1.382	3.618	3.618	586.19	522.519	Train
6	660	75	1.382	3.618	3.618	311.47	376.353	Test
7	840	34	1.382	3.618	3.618	661.08	522.519	Test
8	660	34	1.382	3.618	3.618	357.43	376.353	Test
9	840	75	3.618	1.382	3.618	550.23	694.202	Train
10	660	75	3.618	1.382	3.618	263.96	297.717	Train
11	840	34	3.618	1.382	3.618	648.9	694.202	Test
12	660	34	3.618	1.382	3.618	332.06	297.717	Train
13	840	75	1.382	1.382	3.618	907.47	754.113	Test
14	660	75	1.382	1.382	3.618	449.42	538.114	Train
15	840	34	1.382	1.382	3.618	265.32	754.113	Train
16	660	34	1.382	3.618	3.618	504.86	376.353	Train
17	840	75	3.618	3.618	1.382	482.06	492.877	Test
18	660	75	3.618	3.618	1.382	181	184.882	Test
19	840	34	3.618	3.618	1.382	524.89	492.877	Train
20	660	34	3.618	3.618	1.382	236.08	184.882	Train
21	840	75	1.382	3.618	1.382	200.35	549.823	Test
22	660	75	1.382	3.618	1.382	422.09	437.879	Test
23	840	34	1.382	3.618	1.382	714.14	549.823	Train
24	660	34	1.382	3.618	1.382	468.47	437.879	Test
25	840	75	3.618	1.382	1.382	667.26	732.424	Train
26	660	75	3.618	1.382	1.382	398	366.886	Test
27	840	34	3.618	1.382	1.382	688.49	732.424	Train
28	660	34	3.618	1.382	1.382	442.33	366.886	Train
29	840	75	1.382	1.382	1.382	847.16	793.618	Test
30	660	75	1.382	1.382	1.382	623.5	623.765	Test
31	840	34	1.382	1.382	1.382	875	793.618	Test
32	660	34	1.382	1.382	1.382	634.15	623.765	Test
33	950	54	2.5	2.5	2.5	1233.01	1020.794	Train
34	750	100	2.5	2.5	2.5	281.22	374.185	Train
35	750	54	5	2.5	2.5	286.23	270.61	Train
36	750	54	2.5	5	2.5	322.15	159.762	Train
37	750	54	2.5	2.5	5	328.95	312.096	Train
38	750	54	2.5	2.5	2.5	354.19	374.185	Train
39	750	54	2.5	2.5	2.5	42.39	374.185	Train
40	750	8	2.5	2.5	2.5	569.95	374.185	Train
41	750	54	0	2.5	2.5	737.46	553.542	Train
42	750	54	2.5	0	2.5	707.98	628.354	Test
43	750	54	2.5	2.5	0	608.96	414.427	Test
44	840	54	2.5	2.5	2.5	522.13	625.845	Test
45	750	75	2.5	2.5	2.5	331	374.185	Test
46	750	54	3.618	2.5	2.5	291.71	320.673	Train
47	750	54	2.5	3.618	2.5	326.44	271.636	Train
48	750	54	2.5	2.5	3.618	337.71	350.283	Train
49	660	54	2.5	2.5	2.5	259.04	359.409	Train
50	750	34	2.5	2.5	2.5	388.61	374.185	Test
51	750	54	1.382	2.5	2.5	482.42	442.925	Test
52	750	54	2.5	1.382	2.5	462.51	485.116	Train
53	750	54	2.5	2.5	1.382	448.94	393.774	Train

Artificial neural network methodology

The corrosion rate data were also used as feed for building the ANN. An ANN is an intelligent data driven modelling instrument that is able to arrest and represent complex and non-linear input/output relationships; they simulate the learning process of the human mind. Like the brain, the network structure composed of several processing features is called neurons or nodes.¹⁴ The simplest custom of ANN is the linear model. A neural network with no hidden layers, and an output with dot product synaptic function and identity activation function, actually implements a linear model. The weights correspond to the matrix and the thresholds to the bias vector. When the network is performed, it effectively multiplies the input by the weights matrix then adds the bias vector. The linear network delivers a good benchmark against which to compare the performance of neural networks. The multilayer perceptrons (MLP) network trained using back propagation algorithm is a widely used network type and is commonly applied to all types of engineering as well as research modelling problems. A radial basis function neural network is a new class of robust neural network that has been used to a restricted extent in modelling various research problems.¹⁵ The neural network model used in this study was created using STATISTICA 7 software package; it is a broad, state-of-the-art, influential and extremely fast neural network data analysis package. This feature has different options and subsoftware. One of them is an Intelligent Problem Solver (IPS). Unit activation levels are (by default) presented in colour: red for positive activation levels and green for negative. Triangles pointing to the right indicate input neurons. These neurons perform no processing and simply introduce the input values to the network. Squares indicate dot product synaptic function units (e.g. as found in MLP). Circles indicate radial synaptic function units. Small open circles that represent input and output variables are clarified using a small open circle joined to the corresponding input or output neuron. In some conditions, a number of neurons are joined to a single input or output variable. These networks were constructed using different activation functions such as sigmoid, hyperbolic, exponential, step, ramp, sine, square root, etc. Intelligent Problem Solver chooses the best activation function for network building. Each input comes via a connection that has a strength (or weight); these weights correspond to synaptic efficacy in a biological neuron. Each neuron also has a single threshold value. The weighted sum of the inputs is designed, and the threshold subtracted, to compose the activation of the neuron (also known as the post-synaptic potential of the neuron). The activation signal is passed through an activation function (also known as a transfer function) to produce the output of the neuron. If the step activation function is used (i.e. the neuron's output is 0 if the input is < 0, and 1 if the input is ≥ 0), then the neuron acts just like the biological neuron described earlier (subtracting the threshold from the weighted sum and comparing with zero is equivalent to comparing the weighted sum to the threshold). It is also noticeable that weights can be negative, which implies that the synapse has an inhibitory rather than the excitatory effect on the neuron: inhibitory neurons are found in the brain. However, there also can be hidden neurons that play an internal role in the network. The input, hidden and output neurons need to be connected together. The key issue here is feedback.¹⁶ A simple network has a feed forward structure: signals flow from inputs, forward through any hidden units, eventually attainment of the output units. Such a structure has firm behaviour.

Results and discussion

Mathematical and statistical considerations

STATISTICA 7 software was used to estimate the coefficients of these models. This software was based on the Levenberg–Marquardt non-linear estimation least squares method. The maximum number of iterations was 1000, and the convergence criterion of 1 × 10^{− 6}. The following equations are obtained (14) (15) (16) (17) The correlation coefficients of above equations were 0.604, 0.739, 0.239 and 0.9215 respectively. Generally, correlation coefficient up to 0.30 indicates a weak relationship and is of uncertain validity; between 0.50 and 0.70 indicates a significant relationship and is of practical importance; while above 0.90 means a strong relationship.¹⁷ In equations (10)–(12), the constants a ₀–a ₅ represents the coefficients accompanying each variable. Furthermore, equations (13) contains the intercept of the equation (a ₀), the individual effect of each variables (a ₁–a ₅), the interaction among variable (a ₆–a ₁₄) and self-interaction (a ₁₅–a ₁₉). The analysis of variance (F test) was used for testing the significance of each effect in equations (10)–(13). The calculations are given in Table 2. This table provides the percentage of variance explained by the mathematical models in comparison to the variance contained within the experimental results. Probability (p value) is the smallest level of significance that would lead to the rejection of the null hypothesis. The probability for ANOVA is smaller than 5%, which confirmed the validity of the suggested model. Furthermore, tabulated F values were lower than the calculated one. This means that, at 95% confidence levels, the regression coefficients are statically greater than zero, and it should be kept in the models. Graphical representation of models is shown in Fig. 1. The validity of polynomial model was observed. The larger deviation was observed in the case of logarithmic model.

Table 2

Statistical analysis of mathematical models

Model	Effect	Sum of squares	Degree of freedom	Mean squares	F _cal	F _tab	p value
Parabolic	Regression	12 536 835	5	2 507 367	73.908	2.844	< 0.0001
	Residual	1 628 411	48	33 925
	Total	14 165 247	53
Linear	Regression	13 001 183	5	2 600 237	107.221	2.844	< 0.0001
	Residual	1 164 063	48	24 251
	Total	14 165 247	53
Logarithmic	Regression	11 745 016	5	2 349 003	46.587	2.844	< 0.0001
	Residual	2 420 231	48	50 421
	Total	14 165 247	53
Polynomial	Regression	13 576 736	19	714 565.1	41.282	2.153	< 0.0001
	Residual	588 511	34	17 309.1
	Total	14 165 247	53

Experimental against predicted corrosion rate from mathematical regression

Artificial neural network considerations

The ANNs were constructed using IPS. Many networks were tested and selected to represent the corrosion rate data. Figure 2 shows the created structure of ANN, while Tables 3 and 4 collected the most important data. Table 3 shows the investigation of each net ANN. The index represents a sole lifelong number assigned to each neural network when it is fashioned. The indices are assigned in chronological order. The profile is the most beneficial summary statistic, packing a great deal of information into a short piece of text. It tells us the network type, the number of input and output variables, the number of layers and the number of neurons in each layer. The arrangement is < type> < inputs>: < layer1>- < layer2>- < layer3>: < outputs>, where the number of layers may vary. For example, the profile MLP 4:4-5-1:1 signifies an MLP with four input variables and one output variable and three layers of 4, 5 and 1 units respectively. Columns Train Perf., Select Perf. and Test Perf. give the performance of the networks on the training, selection and test subsets respectively. It was shown that training sets did not give too much credence to the performance rate, which is often deceptively good (indicating overlearning). Furthermore, avoid using the test set performance to select models, as that overthrows the object of having it (which is to maintain some data not used for training or model selection, so that a composed final valuation of performance can be made). Use the performance measure on the selection subset to distinguish between, and choose between, networks. The meaning of the performance measure depends on the network type. It is the proportion of the prediction to observation standard deviations. Train error, Select error and Test error columns report the error rates on the subsets. The error rate is less directly interpretable than the performance measure, but is of more significance to the training algorithms themselves. Figure 3 shows the predicted corrosion rate against experimental one; the best results are obtained by MLP 4:4-5-1:1. (i.e. index 7). Table 4 shows the sensitivity analysis. It provides some information about the relative significance of the variables used in a neural network. In this analysis, the IPS test how the neural network would handle if each of its input variables was unavailable. The data set is submitted to the network recurrently, with each variable in turn treated as absent, and the subsequent network error is recorded. If a significant variable is canceled in this style, the error will increase an excessive deal; if an unimportant variable is removed, the error will not increase very much. As shown in Table 4, the sensitivity is stated in two rows: the Ratio and the Rank. The basic sensitivity number is the ratio. For each variable, the network is performed as if that variable is ‘unavailable’. Unavailability of a variable used by the model will apparently cause some decline in its performance. The ratio described is the ratio of the error with the variable absent to the ratio with its presence. Important variables have a high ratio, indicating that the network performance fails badly if they are not available. If the ratio is one or lower, then creating the variable ‘unavailable’ either has no effect on the performance of the network, or actually enhances it. The rank lists the variables in direction of importance (i.e. order of descending ratio) and is delivered for convenience in interpreting the sensitivities. In the current work data, it was found that most variables have a sensitive effect on corrosion rate. The same behaviour was observed with the mathematical polynomial model.

Artificial neural network created by Intelligent Problem Solver a Linear Model, b MLP Model, c GRNN Model, d RBF Model (5– 10 – 1), e RBF Model (5 – 13 – 1)

Table 3

Artificial neural network analysis via IPS

Index	Profile	Train Perf.	Select Perf.	Test Perf.	Train error	Select error	Test error	R ²
6	Linear 5:5-1:1	0.6355	0.7551	0.5866	0.1441	0.26022	0.1547	0.7321
7	MLP 4:4-5-1:1	0.6144	0.6259	0.4119	0.1395	0.21377	0.0978	0.8152
8	GRNN 5:5-27-2-1:1	0.6111	0.8798	0.8971	0.00314	0.00696	0.0054	0.6716
9	RBF 5:5-10-1:1	0.5599	0.7932	0.7161	0.00285	0.00607	0.0040	0.7187
10	RBF 5:5-13-1:1	0.4804	0.7713	0.6340	0.00244	0.00591	0.0035	0.7632

Table 4

Sensitivity analysis of ANN via IPS

	X ₁	X ₂	X ₃	X ₄	X ₅
Ratio 6	1.254	1.06	1.107	1.136	1.027
Rank 6	1	5	3	2	4
Ratio 7	1.648	1	1.179	1.174	1.042
Rank 7	1	1	2	3	4
Ratio 8	1.171	1.031	1.051	1.045	1.033
Rank 8	1	5	2	3	4
Ratio 9	1.313	0.985	1.126	1.141	1.031
Rank 9	1	5	3	2	4
Ratio 10	1.427	0.965	1.171	1.223	1.051
Rank 10	1	5	3	2	4

Experimental corrosion rate against predicated by ANN

Optimum mathematical and ANN models

Mathematical and ANN analysis revealed that both methods represent the high temperature corrosion rate data in a powerful and effective manner. Figure 4 compares the mathematical polynomial models with the optimum ANN model. This indicates that both polynomial equations (from mathematical analysis) and MLP 4:4-5-1:1 (from ANN analysis) forecasted the corrosion rate values with higher correlation coefficients.

Experimental corrosion rate against predicated by optimum polynomial and optimum ANN models

Conclusions

The results obtained from weight loss measurements that were taken from the literature indicate that corrosion of steel in high temperature environment decreased with inhibitor concentration increase and increased with temperature and time of exposure. An attempt has been prepared to use mathematical regression, statistical analysis and ANNs to correlate the corrosion processes of steel as a function five operating conditions. For mathematical considerations, all suggested equations represented the corrosion rate data with different correlation coefficients. Polynomial model was the best one. In ANN studies, MLP 4:4-5-1:1 was the better architect.

Footnotes

Acknowledgement

This work was supported by Diyala University, Chemical Engineering Department, which is gratefully acknowledged.

References

Lai

G. Y.

: ‘High-temperature corrosion and materials applications’ 2007

Materials Park, OH

ASM International.

Gao

and Li

: ‘Nano-structured alloy and composite coatings for high temperature applications’ Mater. Res., 2004 7 175–182. doi: 10.1590/S1516-14392004000100023

Karlsson

Åmand

L. -E.

and Liske

: ‘Reducing high-temperature corrosion on high-alloyed stainless steel superheaters by co-combustion of municipal sewage sludge in a fluidised bed boiler’ Fuel 2015 139 482–493. doi: 10.1016/j.fuel.2014.09.007

Khadom

A. A.

Yaro

A. S.

Altaie

A. S.

and Kadhum

A. H.

: ‘Mathematical modeling of corrosion inhibition behavior of low carbon steel in HCl acid’ J. Appl. Sci., 2009 9 2457–2462. doi: 10.3923/jas.2009.2457.2462

Khadom

A. A.

: ‘Modeling of corrosion reaction data in inhibited acid environment, using regressions and artificial neural networks, Korean’ J. Chem. Eng., 2013 30 2197–2204.

Dua

: ‘An artificial neural network approximation based decomposition approach for parameter estimation of system of ordinary differential equations’ Comput. Chem. Eng., 2011 35 545–553. doi: 10.1016/j.compchemeng.2010.06.005

Rashidi

A. M.

: ‘A galvanostatic modeling for preparation of electrodeposited nanocrystalline coatings by control of current density’ J. Mater. Sci. Technol., 2012 28 1071–1076. doi: 10.1016/S1005-0302(12)60175-3

Moral

Aksoy

and Gokcay

C. F.

: ‘Modeling of the activated sludge process by using artificial neural networks with automated architecture screening’ Comput. Chem. Eng., 2008 32 2471–2478. doi: 10.1016/j.compchemeng.2008.01.008

Fahmi

and Cremaschi

: ‘Process synthesis of biodiesel production plant using artificial neural networks as the surrogate models’ Comput. Chem. Eng., 2012 46 105–123. doi: 10.1016/j.compchemeng.2012.06.006

10.

Ahmad

A. L.

Azid

I. A.

Yusof

A. R.

and Seetharamu

K. N.

: ‘Emission control in palm oil mills using artificial neural network and genetic algorithm’ Comput. Chem. Eng., 2004 28 2709–2715. doi: 10.1016/j.compchemeng.2004.07.034

11.

Eisa

M. Y.

: ‘The inhibition of synthetic fuel oil ash under high temperature corrosion’. MSc thesis, Department of Chemical Engineering, College of Engineering, University of Baghdad, Iraq, 2000.

12.

Khadom

A. A.

Yaro

A. S.

Kadum

A. A. H.

AlTaie

A. S.

and Musa

A. Y.

: ‘The effect of temperature and acid concentration on corrosion of low carbon steel in hydrochloric acid media’ Am. J. Appl. Sci., 2009 6 1403–1409. doi: 10.3844/ajassp.2009.1403.1409

13.

Mathur

P. B.

and Vasudevan

: ‘Reaction Rate Studies for the Corrosion of Metals in Acids-I, Iron in Mineral Acids’ Corrosion 1982 38 3. doi: 10.5006/1.3579270

14.

Birbilis

Cavanaugh

M. K.

Sudholz

A. D.

Zhu

S. M.

Easton

M. A.

and Gibson

M. A.

: ‘A combined neural network and mechanistic approach for the prediction of corrosion rate and yield strength of magnesium-rare earth alloys’ Corros. Sci., 2011 53 168. doi: 10.1016/j.corsci.2010.09.013

15.

Platt

J. A.

: ‘A resource-allocating network for function interpolation’ Neural Comput., 1991 3 213–218. doi: 10.1162/neco.1991.3.2.213

16.

Haykin

: ‘Neural networks: a comprehensive foundation’ 1994

New York

Macmillan Publishing.

17.

Lazić

Z. R.

: ‘Design of experiments in chemical engineering’ 2004

Weinheim

Wiley-VCH Verlag GmbH & Co. KGaA.