Abstract
The Bayesian network (BN) structure learning from the observational data has been proved to be a NP-hard problem. The expert knowledge is beneficial to determine the BN structure, especially when the data are scarce and the related variables are huge in the researched domain. In this paper, we propose a new BN structure learning method by integrating expert knowledge. On the one hand, to improve the performance of expert knowledge usage, the intuitionistic fuzzy set (IFS) is introduced to express and integrate the expert knowledge. The determination of BN priori structure is transformed into the group decision making problem. On the other hand, the improved Bayesian information criterion score function and the Genetic Algorithm search algorithm are used to obtain the most suitable structure under the constraints from the priori structure. Some experiments demonstrate the validity of proposed scheme and compare the performance with the existing research results. The obtained BN structure owns better performance. The more the quantity of expert knowledge is, the better the performance of BN structure learning would be. Finally, the proposed method is applied to the thickening process of gold hydrometallurgy to solve the practical problem.
Introduction
Bayesian network (BN) is an effective intelligent method to represent the dependent relationship and joint probability distribution among variables in the form of a directed acyclic graph (DAG), which has been widely applied to all kinds of fields, such as, genomics [2], industry [13], risk analysis [4], manufacturing [29], ecology social sciences [26] and so on. BN is the uncertain knowledge representing and reasoning model, which consists of two components: structure and parameter. In this paper, we focus on the structure learning.
Automatic structure learning of BNs from observational data has been developed in a great deal of researches, which mainly include two types: constraint-based methods and score-based methods. The constraint-based methods identify the dependent and independent relationships among variables by conditional independence tests. The score-based methods use a search algorithm to find the structure with the highest score based on the score function. In this paper, we mainly focus on the latter one. For the score-based methods, first of all, the score function needs be determined. Various score functions have been proposed including Bayesian Dirichlet equivalence [2], minimum description length [21, 36] and Bayesian information criterion (BIC) [30]. Furthermore, a search algorithm needs to be specified to find the BN structure with the highest score. When the number of random variables increases in the problem domain, the number of possible structures will grow exponentially. When the sample size is small, it is more difficult to determine the structure. It has been proved to be a NP-hard problem for the determination of structure which best adapts to the observational data. A variety of search algorithms have been proposed, such as heuristic search techniques [1, 11], evolutionary algorithms [16, 19, 20]. Among the search algorithms, as the global optimization algorithms, the evolutionary algorithms own strengths to determine the optimal structure.
For the BN structure learning, the introduction of expert knowledge is beneficial to determine the relationships among the variables. Especially when the data are scarce and the related variables are huge in the researched domain, the introduction of expert knowledge provides an effective way to reduce the inherent uncertainty of structure. Under the restrictions from the expert knowledge, the search spaces of possible structures are reduced greatly. Only the unknown relationships need to be learned, and the best BN structure can be obtained as soon as possible. This is an excellent approach to boost the performance of structure learning methods. The learned results will fit the practical domain better and reduce the search cost. How to integrate the expert knowledge and the automatic learning algorithm is an important problem to solve.
In this paper, we propose a new method of BN structure learning by integrating expert knowledge and observational data based on the score-based method. This method includes two parts: the determination of BN priori structure based on the expert knowledge and the structure searching by the observational data under the constraints from the priori structure. For the former one, the intuitionistic fuzzy set (IFS) method is applied to express and integrate the opinions of group experts. The BN priori structure can be determined by the group decision making (GDM) based on IFS. For the latter one, the score-based method is used to search for the most suitable structure under the constraints from the priori structure. For the score-based method, the BIC score function and Genetic Algorithm (GA) are improved to satisfy the expert knowledge in this paper.
The contribution of this paper reflects on two aspects. On the one hand, in the BN structure learning framework, the expert knowledge is introduced to increase the performance of structure learning. To improve the performance of expert knowledge usage, the IFS is introduced to express and integrate the expert knowledge. The determination of BN priori structure is dealt with in the intuitionistic fuzzy environment, which is transformed into the GDM problem. On the other hand, the score-based method is used to search for the most suitable structure under the constraints from the priori structure. In this process, the BIC score function and the GA are improved to integrate the expert knowledge, which provides a new angle to improve the performance of BN structure learning based on GA.
The remaining sections of this paper are organized as follows. The related work is discussed in Section 2. Section 3 discusses the determination of BN priori structure. The corresponding decision scheme is analyzed in detail. In Section 4, the score-based method is used to search for the most suitable structure under the constraints from the priori structure. The BIC score function and GA are improved to integrate the expert knowledge. In Section 5, some experiments are provided to demonstrate the validity of proposed scheme and compare the performance with the existing research results. The influence of quantity of expert knowledge also is discussed. The proposed method is applied to the thickening process of gold hydrometallurgy to solve the practical problem. Finally, the conclusions are presented in Section 6.
Related work
Many researches have been proposed to integrate the expert knowledge and the automatic learning algorithm [6, 7, 12, 22, 25]. In the paper [12], the structural restrictions can be represented as the following forms: the existence or absence of arcs and the causal ordering restrictions. If the expert holds the view that the presence possibility of an edge is high, the probability of BN structure including this edge is high. The priori probability distributions of BN structure can be obtained from expert knowledge in a certain degree. The paper [25] proposes an interactive approach to integrate domain/expert knowledge. The information from the data learning provides the support for obtaining the domain/expert knowledge. The active interaction plays its role at the different stages of learning process. However, the above researches are not enough, and two problems need to be discussed further. On the one hand, in the BN structure learning environment, the problem of extraction, expression and integration of expert knowledge needs to be improved. In the specific researched background, when the expert knowledge on some relationships among the variables is relatively obvious, the extraction of expert knowledge is easy. However, the expert knowledge is ambiguous, imprecise and subjective in nature. When the specific relationship is not expressed, the extraction of expert knowledge changes to be difficult. As the strengths and knowledge degrees of every expert are different, when the group experts express their opinions on the same problem, the determination of relationship between two variables (i.e. two nodes in the BN) is more difficult. Therefore, it is necessary to search for the method to obtain more reasonable relationship between two variables by integrating the opinions of group experts. The obtained relationship will be more explicit and reliable. In such cases, it is not proper to use the crisp numbers to express the expert knowledge. The IFS theory proposed by Atanassov [3] can express and integrate the expert knowledge in a proper way. As a generalization of FS theory, it can deal with vagueness, ambiguity and hesitation. Except the membership function, the IFS are associated with the non-membership function and the hesitancy function. The hesitancy degree of IFS can express the ignorance, i.e. the lack of knowledge for the membership degree. In the process of decision making or evaluation, the hesitancy degree plays an important role when the experts cannot express their opinions in an accurate way. On the other hand, the way of integrating expert knowledge and observational data for the BN structure learning methods needs to be researched further. Taking the score-based methods using the GA search method as the example, GA is the common global search method, which is effective to search for the most suitable BN structure from the population by the score function and evolution operators. The BN structure learning method based on the GA has been developed in the paper [14]. It uses GA as the search method and analyzes the behaviour of algorithm by the different parameters. Since then, many researches have been proposed to improve the BN structure learning method based on the GA from the different angles [8, 18, 32]. A cooperative coevolutionary GA is proposed to learn BN structure in the paper [8]. The paper [32] proposes a new hybrid BN structure learning algorithm by reducing the search space using mutual dependencies and applying the GA to search over the reduced space. The paper [18] proposes a chain-model GA approach to reduce the complexity of BN structure learning. However, few researches are developed to improve the BN structure learning methods based on the GA from the angle of integrating the expert knowledge. Therefore, how to integrate the expert knowledge and GA to improve the performance of BN structure learning is an important problem to solve. In this process, two problems need to be considered: the determination of initial population based on the expert knowledge; the improvement of fitness function satisfying the constraints from the expert knowledge.
BN priori structure determination
Problem description
For the specific problem of complex industrial environment, for example, when monitoring and diagnosing whether an abnormal event appears or not, or predicting some key parameters related with the performance of system, the relevant variables are limited and the expert knowledge about the relationships is available. It makes the collection and integration of expert knowledge possible. The extraction of expert knowledge can be realized by inquiring the experts about the relationship between two variables. The specific process is as follows. First of all, the related variables need to be determined as the nodes of BN in the problem domain, and even the variables can be allocated at different levels of causality. The knowledge about the relationships among the nodes needs to be collected from the experts. Taking the relationship between two nodes
In a word, the four conditions are equivalent to four alternatives, and the group experts express their opinions on the four alternatives. The most suitable relationship between two nodes needs to be determined by integrating the opinions from the group experts. Therefore, the determination of BN priori structure based on the expert knowledge can be transformed into the GDM problem to solve based on IFS. A GDM problem can be concisely expressed in the matrix format as follow:
which
To select the most appropriate relationships among the BN nodes, the scheme for the BN priori structure determination is described by the following framework. Taking the relationship between two nodes as the example, in Fig. 1, for the step 1, the group expert knowledge is obtained in the form of intuitionistic fuzzy decision matrix shown in Eq. (1).
The scheme for the BN priori structure determination.
For the step 2, we use the knowledge measure [17] to determine the weight of every expert. The knowledge measure of every expert can be obtained by the following equation
The weights of experts are determined by following equation
For the step 3, the aggregation operator from the Dempster-Shafer (D-S) theory is chosen to aggregate the opinions from different experts, which is shown in Eq. (4) [34].
Let
The hesitancy degree of
Before aggregation operation, the IFVs need be handled by the discounting operation. We use the weights of experts as the discount coefficient. The specific aggregation process can refer to the related research results [10, 34].
For the step 4, the technique for order preference by similarity to ideal solution (TOPSIS) method is used to select the most suitable relationships among the nodes. Firstly, the distances from the alternative relationship to the ideal solutions should be calculated. The ideal solutions include the positive-ideal solution and negative-ideal solution, which are represented as
The relative closeness coefficient
where
Repeating the above steps, the BN priori structure can be obtained, which is represented by matrix
where
For the determination of matrix
If the If the If the If the If
Improved score function
In this paper, we use the BIC as the score function which is proposed in the paper [28]. When the expert knowledge is available, the BIC score function should be improved to satisfy it. The traditional BIC score function consists of two parts, an accuracy term and a complexity term, which is shown as follow
where
In the score-based method, the ways of producing new structures include adding arcs, deleting arcs and reversing arcs. The reversing arcs is equivalent to the process that adding arcs after deleting arcs. Therefore, the ways can be summarized into two kinds: adding arcs and deleting arcs. Taking adding the arc
The criterion which the IBIC needs to satisfy
The transform operation needs to be carried out to satisfy the criterion in Table 1 for the IBIC. For convenience of calculation, considering that the BIC score is negative, when
The following results can be obtained:
When When When
The IBIC can be obtained by the way that the BIC score multiplies the corresponding transform coefficient
The GA is a heuristic search method, which is inspired by the evolution theory. By simulating the natural evolution process, the selection, crossover and mutation operators are proposed. Using these operators, the global optimization problem is solved by searching the best individual from the search space until a termination criterion is satisfied. The fitness function needs be determined to measure the quality of individual. For the BN structure learning, the individuals refer to the alternative BN structures. The IBIC score function in Section 4.1 is used as the fitness function. The search process based on GA can be described by the following figure.
The flow diagram of BN structure learning based on GA.
Because of the introduction of expert knowledge, some parts of flow diagram in Fig. 2 are different from the traditional BN structure learning based on GA [14]. Firstly, the BN priori structure
where
A problem needs to be focused. When
In addition, the relative closeness coefficient
The element can be obtained by the following criterion: when
It can obtain that the bigger the value of
where
For the specific domain problem, especially when the researched process can be divided into sub-processes, the relevant variables are relatively small and the collection of expert knowledge is relatively easy [15, 23, 24, 35]. In the paper [23], for the abnormal conditions in the thickening process of gold hydrometallurgy, the established BN for the thickening sub-process only includes the relevant variables. The paper [24] proposes the diagnostic BN based on operation procedures, the variables are divided into the different layers. Above BNs are all not large networks. Therefore, to solve the similar small BN construction and verify the effectiveness of proposed approach, in this paper, the cancer and the Asia networks are chosen to verify the proposed method. They are classic BNs, which have been widely used for comparative purposes in BN structure learning [18, 20, 32]. Based on the proposed method, the BN structure is determined by the datasets generated from the true BNs. The performance is evaluated by comparing the learned BNs and the true ones. This paper improves the performance of score-based method based on GA by integrating the expert knowledge. Therefore, in the following examples, at first, the proposed method is verified and compared with the traditional GA used in the BN structure learning [14, 27], and the influence of expert knowledge is shown by the Examples 1–3. What’s more, in the Example 4, the proposed method is compared with the existing research results. In the end, the proposed method is applied to the thickening process of gold hydrometallurgy in the Example 5. The established BN is analyzed and compared with the results in the paper [23].
The structure of cancer network.
From the matrix
The termination criterion is that the algorithm reaches to the maximum generation, or the quality of population doesn’t change for the successive generations. From the table, we can obtain that the algorithm is convergent before the generation reaches to the set maximum. Comparing the network structures, we can find that the obtained network structure by the proposed method is much closer to the true cancer network than the traditional method that no expert knowledge is used. Therefore, when integrating the expert knowledge into the score-based structure learning method based on GA, the obtained BN structure owns the better performance. This proofs that the proposed method is valid to increase the performance of BN structure learning.
The results of structure learning by integrating the different quantities of expert knowledge for the cancer network
From the results in Table 3, we can conclude that if the collected quantity of expert knowledge increases, the performance of structure learning will improve further. Therefore, the expert knowledge plays the important role in the improvement of performance for the BN structure learning.
The results of structure learning by integrating the different expert knowledge for the Asia network
The structure of Asia network.
From the results in Tables 4 and 5, the same conclusions can be obtained with the Examples 1 and 2. We can find that when the BN nodes increase and the structure is more complex, the difference of learned structure with the true structure will become bigger for the traditional method without expert knowledge. It is very difficult to search for the suitable structure. However, when the expert knowledge for the relationships among the nodes is available, the performance of structure learning is increased gradually. The more the nodes in the BN are, the greater the role of expert knowledge is.
The running results of different algorithms for the Asia network
Except the running result of proposed method, the else running results of different algorithms in Table 6 are from the paper [32]. The specific simulation process can refer to the section three in the paper [32]. By the comparison in Table 6, it can conclude that the score of proposed method is better than the methods in the paper [32].
In the paper [33], the method is carried out to learn the Asia network structure under the different data sizes 200, 500 and 800. To compare our method with the paper [33], similarly, the proposed method in this paper is used to learn the Asia network structure under the different data sizes. The obtained optimal Asia network structures under the different data sizes are shown in Fig. 5. It can obtain that the proposed method can obtain good structure learning result when the data size is 500. However, the method in the paper [33] can obtain good structure learning result when the data size is 800. The proposed method in this paper owns better performance than the method in the paper [33].
The obtained optimal Asia network structures under the different data sizes (a) data size is 200; (b) data size is 500.
(a) The learned BN structure by the proposed method. (b) The established BN in the paper [23] for the thickening process of gold hydrometallurgy.
From Fig. 6, we can obtain that the learned BN structure by the proposed method has two different arcs compared with the structure in the paper [23]. The difference is that the learned structure adds one arc
In this paper, we propose a new method of BN structure learning by integrating expert knowledge and the observational data based on the score-based method. In the BN structure learning framework, to improve the performance of expert knowledge usage, the representation and integration of expert knowledge are implemented by the IFS method. The determination of BN priori structure is transformed into the GDM based on IFS. To search for the most suitable structure under the constraints from priori structure, the score-based method based on the GA is used. The score function BIC and GA are improved to satisfy the constraints from expert knowledge. Some experiments are provided to demonstrate the validity of proposed scheme and compare the performance with the existing research results. The obtained BN structure owns better performance than the score-based method without expert knowledge. The more the quantity of expert knowledge is, the better the performance of BN structure learning would be. Finally, the proposed method is applied to the thickening process of gold hydrometallurgy to solve the practical problem. For the further research, we will consider more uncertain information when using the expert knowledge to determine the BN priori structure.
Footnotes
Acknowledgments
This work was supported by the National Nature Science Foundation of China [grant numbers 61533007; 61873049], the National Key Research and Development Program of China [grant numbers 2017YFB0304205].
