Prediction of coke quality using adaptive neurofuzzy inference system

Abstract

The production rate, quality and cost of hot metal production through the blast furnace route depend mainly on the quality of the metallurgical coke. Coke reactivity index and coke strength after reaction (CSR) are the most important parameters used for the assessment of the high temperature properties of coke. Many coke plants and blast furnaces around the world use CSR as a specification just as important as cold strength, size and chemistry. The present work aims to fulfil the need for a model that will predict the coke CSR from coal blend characteristics. In this work, the functional relationship between the coal blend properties (ash, volatile matter, average vitrinite reflectance, crucible swelling number, total reactives, vitrinite distribution V ₉–V ₁₃ and basicity index) and the corresponding coke CSR has been mapped using an adaptive neurofuzzy inference system (ANFIS). The ANFIS model is formulated with different sets of coal blend properties as input variable, and the singular value decomposition and QR factorisation based techniques have been employed for model reduction. It has been found that the developed ANFIS model predicts the CSR with reasonable accuracy.

Introduction

The hot metal quality and blast furnace operation depend mainly on the quality of raw materials fed into the blast furnace. Among all the raw materials, coke is considered to be the most important as it serves as a fuel, a reducing agent and a supporting material for the burden. It is a well known fact that the role of metallurgical coke as a permeable support cannot be replaced by any other material, even though it can be substituted by oil, coal and gas to suffice the purpose of fuel and reducing agent. Furthermore, feeding high quality coke to a blast furnace will ensure lower coke rate, higher productivity and lower hot metal cost.

Good quality coke is made generally by carbonisation of good quality coking coals in terms of rank, maceral composition, ash content and ash composition. It is evident that very few individual coals possess all the requisite properties for making a coke of good quality. However, the deficiency of any particular property of a coal can often be supplemented by blending it with other compatible coals. In addition, the coke properties are also influenced by the carbonisation conditions such as bulk density, temperature, oven width and coking time. Even though the coke quality is attributed by both carbonisation condition and coal blend composition, it has been reported1 that the coal blend composition plays a predominant role (∼70%) than the carbonisation condition in fixing the coke property.

At present, the coke quality is widely assessed through the hot strength parameters, coke strength after reaction (CSR) and coke reactivity index (CRI), introduced by Nippon Steel Corporation in the 1970s. In this method, 200 g coke sample with a size range from −21mm to +19 mm is heated at 1100°C under 1 atm pressure of CO₂ for 2 h. The percentage weight loss is known as the CRI. The reacted coke is placed in an ‘I’ drum and subjected to 600 revolutions. The material removed from the drum passes through a +10 mm square hole. The percentage of material remaining on the top of the +10 mm screen is known as CSR.

In order to optimise the coal blend composition and to incorporate the new source of coal into the blend that produces the required quality (CSR/CRI) of coke, it is important to have a mathematical model. Keeping in view the strong correlation existing between CSR and CRI, usually knowing the value of CSR, the coke CRI can be estimated.2 Since all the properties of coal are not additive in nature and also carbonisation itself is a very complex and highly non-linear phenomenon, most of the existing models are based on data driven regression models.3 ^– 5 In general, these models involve the mapping of coal blend properties with coke strength parameters.

Owing to the emergence of computational power and the development of artificial intelligence based algorithms, knowledge extraction from data has become a distinct possibility. In this connection, the coke CSR had been predicted from coal blend properties using artificial neural network (ANN).6 ^– 8

However, since the ANN models have poor interpretability due to the ‘black box’ structure, it is often merged with the transparent and more interpretable fuzzy logic to form neurofuzzy inference systems. Among the various neurofuzzy models existing in the literature, adaptive neurofuzzy inference system (ANFIS)9 has been found to possess an excellent ability to learn from the available information.10 ^– 13

The adaptive network based fuzzy inference system is a mathematical representation of fuzzy rule base, which maps the relationship between the input and output variables utilising the excellent learning ability of ANNs. Owing to these advantages, ANFIS based models are presently gaining importance. The ANFIS model has been found successful in predicting the properties of polymer blend made at different operating conditions with fixed blend composition.14 Similarly, Khorami et al. 15 employed ANFIS to predict the free swelling index of coal.

In the present work, an ANFIS based structure has been selected to relate the coal blend properties with the coke CSR. The training algorithm used is based on the Levenberg–Marquardt method.16 A schematic representation of the present work is illustrated in Fig. 1.

Figure 1.

Schematic representation of coke property prediction using ANFIS model

Experimental

Materials used

In this work, a total of 13 coals from different sources have been used. These coals are blended in different combinations and compositions in order to formulate 67 coal blends. Each blend is characterised using standard methods for properties such as ash, volatile matter (VM), average vitrinite reflectance Ro, total reactives, total inerts, vitrinite distribution V ₉–V ₁₃, crucible swelling number (CSN) and basicity index (BI).

Carbonisation process

The carbonisation tests have been performed under stamp charging conditions in a 7 kg electrically heated laboratory scale test oven.7 The construction and operation of the 7 kg electrically heated test oven are based on the recommendations of the British Carbonisation Research Association. The operating parameters, like bulk density, oven temperature, moisture, granulometry and carbonisation time, are maintained constant for all the tests. For each experiment, the coke CSR and CRI have been recorded. In this manner, the total number of data sets generated is 67.

Model formulation

Input selection

It is considered that coal properties such as ash content, VM, ash composition, maceral composition and coal rank have influence on the coke property. The parameters average vitrinite reflectance and V ₉–V ₁₃ distribution are incorporated to represent the coal rank. The ash composition is represented by incorporating the BI. Maceral composition is considered through total reactives, total inerts and vitrinite distribution. Similarly, CSN includes the caking and swelling properties of the coal for coke making application. The total reactives and total inerts are linearly dependent on each other as their summation is always equal to 100. Hence, the variable total inerts are not included in the input model set. The data range of the coal blend properties generated is tabulated in Table 1.

Table 1.

Minimum and maximum ranges of coal blend properties

	Ash/%	Volatile matter	Average vitrinite reflectance	Total reactives	V ₉–V ₁₃	CSN	Basicity index
Min.	7·73	19·04	0·928	49·02	39	1·5	0·046
Max.	17·52	28·47	1·329	63·52	90	6·5	0·296

Data preprocessing and clustering

All the input and output variables are normalised between 0 and 1 using ‘min.–max.’ normalisation technique. Using statistical indices such as mean and standard deviation, the outlier data sets have been removed and brought down the total number of data set to 62. The performance of data driven models like ANFIS depends not only on the quality of data but also on the distribution of data between training and test sets. The presence of data covering the entire range of variables in both training set as well as test set further improves the prediction capability of the model. In order to achieve this, the available number of data is classified into different groups using a clustering technique.17 In the present work, fuzzy C-means program of the fuzzy toolbox of Matlab 7·0 has been used for clustering. After clustering the entire data set into eight groups, 17 data sets were uniformly picked up from the eight groups as a test set, and the remaining 45 constitute the training set.

Model configuration

In the present work, the first order Sugeno type based ANFIS structure has been used to predict the CSR. Figure 2 illustrates the ANFIS model architecture for two number of fuzzy rules. The number of neurons in the input and output layers are fixed based on the number of input (i.e. 7) and output (i.e. 1) variables respectively. The number of neurons in the in between layers is fixed by the number of fuzzy rules used in the rule base. It consists of six layers in which each node performs a particular function on incoming signals.

Figure 2.

Adaptive neurofuzzy inference system model structure for prediction of CSR

On the basis of earlier studies, the coal blend properties, such as ash, VM, average vitrinite reflectance, total reactives, vitrinite distribution V ₉–V ₁₃, CSN and BI, have been considered as input variables, and the coke CSR has been fixed as an output variable. These input variables are related to the output variable through fuzzy rules.

Fuzzy rules

The following fuzzy rules are framed to relate the input and output variables:

Rule 1: if (Ah is A ₁) and (VM is B ₁) and (Ro is C ₁) and (TR is D ₁) and (VD is E ₁) and (CN is F ₁) and (BI is G ₁), then f ₁ = p ₁ Ah+q ₁ VM+r ₁ Ro+s ₁ TR+t ₁ VD+u ₁ CN+v ₁ BI+z ₁

Rule 2: if (Ah is A ₂) and (VM is B ₂) and (Ro is C ₂) and (TR is D ₂) and (VD is E ₂) and (CN is F ₂) and (BI is G ₂), then f ₂ = p ₂ Ah+q ₂ VM+r ₂ Ro+s ₂ TR = t ₂ VD+u ₂ CN+v ₂ BI+z ₂

Rule 3: if (Ah is A ₃) and (VM is B ₃) and (Ro is C ₃) and (TR is D ₃) and (VD is E ₃) and (CN is F ₃) and (BI is G ₃), then f ₃ = p ₃ Ah+q ₃ VM+r ₃ Ro+s ₃ TR+t ₃ VD+u ₃ CN+v ₃ BI+z ₃

Rule 4: if (Ah is A ₄) and (VM is B ₄) and (Ro is C ₄) and (TR is D ₄) and (VD is E ₄) and (CN is F ₄) and (BI is G ₄), then f ₄ = p ₄ Ah+q ₄ VM+r ₄ Ro+S ₄ TR+t ₄ VD+u ₄ CN+v ₄ BI+z ₄

Rule 5: if (Ah is A ₅) and (VM is B ₅) and (Ro is C ₅) and (TR is D ₅) and (VD is E ₅) and (CN is F ₅) and (BI is G ₅), then f ₅ = p ₅ Ah+q ₅ VM+r ₅ Ro+s ₅ TR+t ₅ VD+u ₅ CN+v ₅ BI+z ₅

where Ah, VM, Ro, TR, VD, CN and BI represent percentage ash content, VM, average vitrinite reflectance, total reactives, vitrinite distribution V ₉–V ₁₃, CSN and BI respectively. A _i, B _i, C _i, D _i, E _i, F _i and G _i are fuzzy sets of the ith fuzzy rule, which are characterised by the membership function that is of sigmoidal type. In addition, p, q, r, s, t, u, v and z are termed as consequent parameters.

The computational procedure of each layer for two inputs (x, y) and two numbers of fuzzy rules is explained as follows.

Layer 1

Calculation of membership value using sigmoid membership function (1) where x is the input variable, and A is the fuzzy set. a and b are called antecedent parameters.

Layer 2

Calculation of the firing strength of a fuzzy rule (2) where M is the number of fuzzy rules.

Layer 3

Calculation of the normalised firing strength of a fuzzy rule (3)

Layer 4

(4) where p, q and r are called consequent parameters.

Layer 5

Calculation of overall output (5) The antecedent and consequent parameters of the ANFIS structure have been tuned using the Levenberg–Marquardt method, which is an effective non-linear least squares approach to non-linear regression problems, including neural networks and fuzzy modelling.

Model reduction using singular value decomposition (SVD)

It is a fact that an increase in the number of fuzzy rules in the rule base increases the model complexity, and the model becomes more specific to the training data set. However, the model becomes incapable of predicting the untrained (test) data set due to poor generalisation. In this work, the SVD technique18, 19 has been employed to select the optimum number of fuzzy rules from a given rule base. Subsequently, QR with a column pivoting factorisation algorithm has been applied to select the most important fuzzy rules that are contributing a higher extent to model prediction.

Results and discussion

The first part of this section involves the multivariable linear regression of the coal blend properties with coke CSR. The next section shows the ANFIS model performance in CSR prediction before and after model reduction using SVD. In addition, simulation experiments are conducted to study the effect of various combinations and numbers of input variables (i.e. coal blend properties) on the coke CSR using ANFIS. Statistical indices like correlation coefficient R ², root mean square error (RMSE) and standard deviation have been used to quantify the model performance.

Multivariable linear regression

In this section, using the Levenberg–Marquardt method, seven input variables such as ash, VM, Ro, TR, V ₉–V ₁₃, CSN and BI have been correlated with coke CSR. In this case, the normalised data sets have been used in order to study the extent of contribution of each input variable in predicting the CSR. The resultant linear relation between the input and output variables is shown in the following equation (6) From the magnitude of the coefficients of each input variable in the above equation, the variables are sorted in descending order of contribution to model the coke CSR, and the same is tabulated in Table 2. It is observed that the degree of influence of the input variables on the CSR in descending order is VM>CSN>(V ₉–V ₁₃)>ash>Ro>BI>TR. Furthermore, it can be inferred from equation (6) that the coke CSR decreases when the value of input variables such as VM, average vitrinite reflectance and BI increases. On the contrary, the ash content shows a positive contribution to CSR. This may be due to the fact that the range of data included for this work may be having the trend of increasing CSR with the increase in ash content. It is also true that in order to get a better CSR, the minimum range of ash content is also essential.

Table 2.

Input variables sorted in descending order of contribution towards CSR prediction

Input variable	Coefficient of linear relation with CSR
Volatile matter/%	−0·255
Crucible swelling number	0·231
V ₉–V ₁₃ distribution	0·188
Ash content/%	0·176
Average vitrinite reflectance	−0·069
Basicity index	−0·057
Total reactives	0·045

Furthermore, equation (7) shows the correlation of CSR with input variables for the original data (i.e. not normalised) set (7) where R ² = 0·8735, and RMSE = 3·6897.

The scatter plot in Fig. 3 presents the comparison of model CSR estimated from equation (7) against the experimentally measured CSR. The normal distribution of difference between experimental CSR and predicted CSR from equation (7) is shown in Fig. 4.

Figure 3.

Comparison of CSR measured against CSR estimated through multivariable regression equation (7)

Figure 4.

Normal distribution of difference between experimentally measured CSR and estimated CSR from multivariable regression analysis

Adaptive neurofuzzy inference system

Naturally, the ANFIS model is capable of mapping the highly non-linear functionality between input and output variables. However, special care needs to be taken in order to fix the number of fuzzy rules and the number of input variables. First, the SVD and QR factorisation methods have been employed to remove the redundant fuzzy rules and to retain the minimum number of fuzzy rules that have higher contribution in estimating the output. Moreover, it is also important to have as minimum as possible number of input variables in the model in order to reduce the model complexity and to increase the generalisation capability of the model. In this connection, based on the multivariable regression results (see Table 2) and self-intuition, three different input sets [model a: VM, CSN, V ₉–V ₁₃ distribution and ash content (i.e. first four most influencing inputs variables); model b: VM, CSN, V ₉–V ₁₃ distribution, ash content and Ro (i.e. first five most influencing inputs variables); and model c: all seven coal blend properties] have been formulated to predict coke CSR (Table 3). In this work, the sigmoidal function has been chosen as a membership function after studying the performance of the model with bell shape and Gaussian membership functions.

Table 3.

Input sets for ANFIS model prediction

Model	Input variables	Output Variable
Model a	Volatile matter, crucible swelling number, V ₉–V ₁₃ distribution and Ash content	Coke CSR
Model b	Volatile matter, crucible swelling number, V ₉–V ₁₃ distribution, Ash content and Ro
Model c	All seven coal blend properties

Selection of fuzzy rules

In this section, model a is trained and tested with five numbers of fuzzy rules. Subsequently, the singular values of firing strength matrix are estimated using the SVD method, as tabulated in Table 4. From the singular values, it is identified that three numbers of rules itself can constitute a model with significant accuracy. Since the SVD method always gives the singular values in descending order, the QR method is further employed to position the best three numbers of rules in the rule base. The permutation matrix generated using the QR method is shown in Table 5. From the matrix, it is observed that fuzzy rules 2, 3 and 4 occupy the first three places. Then, the model is formulated with this reduced number of rules and again further fine tuned for best test set performance. The performance of model a in the prediction of CSR before and after model reduction is shown in Table 6. It is interesting to note from Table 6 that after model reduction, the test set performance (in terms of RMSE and R ²) improves, whereas the training set performance declines compared to the original rule base. It clearly shows that the generalisation capability of the model improves after model reduction. The similar model reduction procedure is also applied for models b and c.

Table 4.

Singular values of firing strength matrix for model a

Fuzzy rules	Singular values of the firing strength matrix
1	3·5808
2	0·3218
3	0·1248
4	0·1089
5	0·0147

Table 5.

Permutation matrix formulated using QR method for model a

Rule 1	Rule 2	Rule 3	Rule 4	Rule 5
0	1	0	0	0
0	0	0	1	0
0	0	1	0	0
1	0	0	0	0
0	0	0	0	1

Table 6.

Effect of model reduction on model performance for CSR prediction

Model reduction	Fuzzy rules	Epoch	Training set		Test set
Model reduction	Fuzzy rules	Epoch	RMSE	R ²	RMSE	R ²
Before	5	100	3·0583	0·8397	2·3164	0·9218
After	3	7	3·1720	0·8266	2·1077	0·9349

Prediction of CSR

After applying model reduction, the ANFIS performance in the prediction of CSR for models a–c is presented in Fig. 5, and the corresponding statistical performance indices are listed in Table 7. The training of ANFIS is stopped at the epoch, in which the best performance of the test set is achieved. The RMSEs for models a–c are 2·1077, 2·6902 and 2·3653 respectively. The R ² obtained for models a–c are 0·9349, 0·8896 and 0·9222 respectively. It is observed from the results that model a, which included only four inputs, performs better than model c, which included all the seven input variables. It can be inferred that due to the less number of input variables, thereby less complex structure, model a performs better than models b and c. However, the same trend is not observed when comparing the performance of models b and c. It can be due to the fact that the average reflectance Ro provides also the same information as that of vitrinite distribution V ₉–V ₁₃, and hence, it becomes a redundant input variable. Therefore, the addition of Ro only adds the model complexity without increasing the prediction capability of the model.

Figure 5.

Prediction of CSR for a model a, b model b and c model c

Table 7.

Effect of input variables on model performance for CSR prediction

Model	Fuzzy rules	Epoch	Training set		Test set
Model	Fuzzy rules	Epoch	RMSE	R ²	RMSE	R ²
a	3	7	3·1720	0·8266	2·1077	0·9349
b	3	98	3·0175	0·8443	2·6902	0·8896
c	3	6	3·0752	0·8381	2·3653	0·9222

The normal distribution of difference between experimental and predicted CSR from the ANFIS model is shown in Fig. 6, and the corresponding statistical parameters, such as mean and standard deviation, are shown in Table 8. It can be concluded from the results that model a performs better than models b and c. Figure 7 shows the graphical comparison of ANFIS model performance in CSR prediction among models a–c.

Figure 6.

Normal distribution of difference between experimentally measured CSR and predicted CSR from ANFIS

Figure 7.

Comparison between predictions of CSR from models a–c

Table 8.

Statistical parameters for normal distribution of difference between experimentally measured CSR and predicted CSR by ANFIS

Model	Mean	Standard deviation
a	−0·6562	2·0030
b	−0·6228	2·6171
c	−0·7512	2·2428

Conclusions

The developed ANFIS model predicts the coke CSR from the coal blend properties with reasonable accuracy when compared to the multivariable linear regression method. The ANFIS model with input variables, such as VM, CSN, V ₉–V ₁₃ distribution and ash content (i.e. model a), exhibits better prediction capability when compared to models b and c. This model can be utilised in selecting different types of coal sources for coal blending in coke making applications. In addition, the optimisation of coal blend properties for maximising the coke CSR can very well be achieved with this model. However, the developed model application is limited only for the coal from the same geographical origin and similar carbonisation conditions. Efforts can be made to extend the model to include various other coal properties and different carbonisation conditions, like bulk density, oven temperature, carbonisation time, etc., for improving the predictability.

References

Nakamura

Togino

Tateoka

: ‘Coal, coke and blast furnace’, 1–18; 1977, London, The Metals Society.

Diez

Alvarez

Sirgado

Marsh

: ISIJ Int., 1991, 31, 449–457.

Zhang

Feng

Shi

: Fuel Process. Technol., 2004, 86, 1–11.

Diez

Alvarez

Barriocanal

: Int. J. Coal Geol., 2002, 50, 389–412.

Alvarez

Diez

Barriocanal

Diaz-Faes

Cimadevilla

JLG

: Fuel, 2007, 86, 2159–2166.

Bhattacharjee

Dash

Das

: Tata Search, 1999, 163–166.

Dash

Guha

Chakraborty

Krishnan

Banerjee

: Tata Search, 2005, 89–100.

Maharana

Biswas

Ganguly

Kumar

: World Acad. Sci. Eng. Technol., 2010, 69, 652–656.

Jang

JSR

: IEEE Trans. Syst. Man Cybern., 1993, 23, 665–685.

10.

Tsai

Wang

: J. Mater. Process. Technol., 2001, 117, 111–124.

11.

: J. Mater. Process. Technol., 2003, 142, 665–675.

12.

Yeh

: J. Mater. Process. Technol., 2006, 173, 136–144.

13.

Khalifehzadeh

Forouzan

Arami

Sadrnezhaad

: Comput. Mater. Sci., 2007, 40, 359–365.

14.

Suresh

Sureshkumar

Kargupta

Ganguly

Naskar

Nando

: Plast. Rubber Compos., 2009, 38, 173–182.

15.

Khorami

Chelgani

Hower

Jorjani

: Int. J. Coal Geol., 2011, 85, 65–71.

16.

Jang

JSR

Mizutani

: Proc. Biennial Conf. of the North American Fuzzy Information Processing Society, Berkeley, CA, USA, June 1996, IEEE, 87–91.

17.

Dennis

: IEE Proc. Vis. Image Signal Process., 1994, 141, 413–421.

18.

Yen

Langari

: ‘Fuzzy logic: intelligence, control and information’; 2003, Singapore, Pearson Education.

19.

Prasad

Bequette

: Comput. Chem. Eng., 2003, 27, 1741–1754.