Abstract
Since the intuitionistic fuzzy set (IFS) was proposed by Atanassov, many explorations of this particular fuzzy set were conducted. One of the most important areas is the study of similarity and distance between IFSs, which can measure the degree of deviation of objects with uncertain and vague features, and this technique has great value and potential to solve the fuzzy and uncertain problems in the real world. Based on our previous similarity/distance measure model D JJ (α, β), a new method is proposed for improving the performance of similarity/distance measure model of IFSs, which is derived from the sum of the areas of two triangles constructed by the transformed isosceles triangles of two IFSs. A great effort is made to prove the validity of the proposed method by mathematical derivation. In order to further demonstrate the performance of the proposed method, we apply this method to solve some practical problems such as pattern recognition, medical diagnosis, and cluster analysis. In addition, we also list a series of the existing methods which are used to compare with the proposed method to prove the effectiveness and superiority. The experimental results confirm that the performance of the proposed method exceeds most of the existing methods.
Keywords
Introduction
In traditional set theory, there are only two possible relationships (that are belong and not belong to) between each element and a set. A large number of uncertain and fuzzy information exist extensively in the real world causing that the membership between an element and a set is imprecise and vague, so traditional set theory cannot handle these problems effectively [1, 2]. In order to overcome this problem, many theories were explored, and fuzzy set theory is the most representative one [3].
In 1965, Zadeh [4] proposed fuzzy set (FS) theory based on the traditional set theory, this work laid the foundation of the FS theory and fuzzy mathematics. In FS theory, the concept of membership is defined, and a mathematical function is used to represent the degree of membership that an element belongs to a specific fuzzy set. Thus, FS theory can effectively handle the uncertain or vague problems that traditional set theory cannot. However, the concept of membership degree in original FS theory is obviously unable to perfectly solve the uncertain and vague problems in the real world because of the limitation of the membership function. In 1986, Atanassov [5, 6] introduced the concept of IFS and used three indicators including membership, non-membership and hesitation to indicate the relationship between an element and a particular set, which can improve the ability of IFS for describing the fuzzy relationship.
For IFS theory, the research of similarity and distance measure is a hot topic recently [7, 8]. Distance and similarity measure of IFSs is a pair of complementary concept that can be used to measure the degree of deviation and similarity between IFSs [7]. The two techniques have been widely-used in many fields of computer science, such as pattern recognition [7–10], image processing [11–13], cluster analysis [13–17], medical diagnosis [8, 18], and decision-making [8, 18–20]. As the core theory of IFSs, similarity and distance measures have been intensively studied by researchers.
In 1997, Chen [21] proposed a distance measure between vague sets and elements, and then this method was applied to the behavior analysis problem by a practical example. However, the method proposed by Chen [21] may produce non-ideal results in some cases which were demonstrated by Hong and Kim in their work published in 1999 [22]. At the same time, Hong and Kim proposed an improved method for measuring the degree of similarity, and then the effectiveness of their method was verified as well. In 2002, Dengfeng and Chuntian [23] gave a definition of the similarity between IFSs, and then proposed several methods of similarity measure for IFSs. In 2003, Mitchell [24] pointed out that there were some counter-intuitive cases of Dengfeng and Chuntian¡¯s measures, a new method was proposed to improve the measure performance, and he also applied it to pattern recognition problem; in the meantime, Liang and Shi [25] summarized several similarity measures and proposed a new similarity measure method for IFSs. Based on Hausdorff distance measure, Hung and Yang [26] proposed a method to measure the degree of similarity between IFSs based on the proof of the properties of similarity measure, and then the effectiveness of this method was proved through some experiments. The similarity measure methods before 2007 were reviewed by Li et al. [2] who made a comprehensive comparison and detailed analysis of these methods. Hung and Yang [27] analyzed several similarity methods between traditional FSs, which inspired two new similarity measure between IFSs, and the new methods were employed to evaluated the studentsąŕ answer-scripts. Ye [28] used the two functions of membership and non-membership to form a vector to represent two elements, and he defined the cosine similarity measure between IFSs and introduced a method to medical diagnosis and pattern recognition.
In addition, Zhang and Yu [29] proposed a distance measure between IFSs and interval-valued fuzzy sets based on the transformed triangle in 2013. Beliakov et al. [30] defined a novel similarity measure that was utilized to the problems of decision making and image segmentation. By introducing the intuitionistic fuzzy number of the centroid point of the right triangle of the IFSs, Chen et al. [8] proposed a method for measuring the relationship between IFSs, this work improved the performance of the distance measure and solved some pattern recognition problems. Jiang et al. [7] proposed a new geometric model based similarity/distance measure method that constructed by a diagonal line and two isosceles triangles transformed by IFSs, and this method was also employed to some practical problems, such as medical diagnosis, cluster analysis, and decision-making. Generally, geometric model-based similarity measure method has favorable interpretability because of the perceptible description.
At present, the most existing methods can be divided into three strategies. (1) Some of these are put forward originally [23, 28]. (2) Some methods are improved based on the previous models [8, 26]. (3) The rest are proposed by geometric modeling methods [7, 30]. Unfortunately, these existing models are not perfect with some unavoidable defects, for example, the intelligibility and unreasonable decision-making. Therefore, we propose a geometric model-based measure model to improve the performance of similarity/distance measure in this work.
In this work, we first introduce the related concepts of IFS theory, and then some existing methods for measuring the similarity/distance between IFSs are reviewed. Second, we present a new similarity measure between IFSs based on the transformation of isosceles triangle fuzzy numbers that is constructed previous work, and a large number of mathematical derivation is used to prove the validity of the proposed method. Some experiments are implemented to compare the performance of the proposed measure with the existing methods.
This paper is structured as follows. In Section 2, the related knowledge of FS and IFS is briefly introduced. Section 3 lists sixteen existing methods for measuring the distance and similarity between IFSs. In Section 4, the proposed model is introduced and verified. Section 5 contains all the experiments and applications. In Section 6, we summarize this work and present our next plan.
Basic knowledge related to fuzzy set theory
To help readers understand this paper easily, we introduce some basic concepts related to IFS theory in this section.
For IFSs, Atanassov [5] introduced three intuitionistic fuzzy numbers to indicate the relationship between an element and a special set. IFS α is defined as
α ={ 〈x i , μ α (x i ) , ν α (x i ) 〉|x i ∈ U } where μ α (x i ) represents the membership degree of x i belonging to IFS α. ν α (x i ) indicates the non-membership degree of x i belonging to IFS α. In addition, π α (x i ) = 1 - μ α (x i ) - ν α (x i ), and π α (x i ) indicates the hesitation degree of x i belonging to IFS α. μ α (x i ), ν α (x i ) and π α (x i ) are satisfied with 0 ≤ μ α (x i ) ≤ 1, 0 ≤ ν α (x i ) ≤ 1, 0 ≤ π α (x i ) ≤ 1, and μ α (x i ) + ν α (x i ) + π α (x i ) = 1.
In 1975, Zadeh [31] proposed a new extended FS called interval-valued fuzzy set (IvFS) that is defined as the following
There is a conversion relationship between IFS and IvFS as follows.
T : IFS (U) → IvFS (U) where IFS (U) and IvFS (U) are respectively denoted by α and α#, and α# ={ 〈x i , μ α (x i ) 1 - ν α (x i ) 〉|x i ∈ U }.
1) α = β ⇔ ∀ x i ∈ U, μ α (x i ) = μ β (x i ) and ν α (x i ) = ν β (x i );
2) α ≤ β ⇔ ∀ x i ∈ U, μ α (x i ) ≤ μ β (x i ) and ν α (x i ) ≥ ν β (x i )
3) α ≥ β ⇔ ∀ x i ∈ U, μ α (x i ) ≥ μ β (x i ) and ν α (x i ) ≤ ν β (x i )
4) α C ={ 〈x, ν α (x i ) , μ α (x i ) 〉|x i ∈ U }
The distance can be used to represent the deviation between two elements (objects) in the real world. Similarly, the degree of deviation between IFSs can also be measured by distance. The distance between IFSs has the following relationship.
D1 : 0 ≤ D (α, β) ≤ 1;
D2 : D (α, β) = 0 if and only if α = β;
D3 : D (α, β) = D (β, α), for all α, β ∈ U are established;
D4 : For α, β, γ ∈ U, if α ⊆ β ⊆ γ, then D (α, β) ≤ D (α, γ), D (β, γ) ≤ D (α, γ).
The similarity measure can characterize the degree of similarity between IFSs. Like distance, a reasonable similarity measure must satisfy the following four conditions.
S1 : 0 ≤ S (α, β) ≤ 1;
S2 : S (α, β) = 1 if and only if α = β;
S3 : S (α, β) = S (β, α), for all α, β ∈ U are established;
S4 : For α, β, γ ∈ U, if α ⊆ β ⊆ γ, then S (α, β) ≥ S (α, γ), S (β, γ) ≥ S (α, γ).
S (α, β) = 1 - D (α, β) or D (α, β) = 1 - S (α, β).
Existing distance and similarity measures between IFSs
Since the introduction of IFS by Atanassov, various measure methods have been proposed successively. In this section, sixteen classic similarity/distance measures are listed. α and β denote two IFSs, and the methods are shown as follows.
1) Boran and Akay [10] proposed an IFS similarity measure with two parameters in 2014:
where t=2, 3, 4, ⋯, and p=1, 2, 3, ⋯.
2) In 1995, Chen [36] proposed a method S
C
(α, β) of similarity measure. The theoretical basis of this method is that IFS and Vague Set are equivalent.
3) Chen made a valuable contribution in measuring the distance between IFSs. In 2016, Chen [8] proposed a model to derive an effective similarity measure S
CCL
(α, β):
4) The similarity measure S DC (α, β) proposed by Dengfeng and Chuntian [23] is a highly representative method.
5) Hong and Kim [22] improved Chen’s method in 1999 to represent a new method S
H
(α, β):
6) Hung and Yang [26] proposed a similarity measure d
HY
(α, β) based on Hausdorff distance:
Based on this distance measure between IFSs, Hung and Yang further proposed a series of corresponding similarity measure methods, as:
7) After a deeply study of L
P
measure, Hung [37] proposed a series of similarity measures based on L
P
measure in 2007.
8) Afterwards, Hung [27] proposed a series of methods based on the model proposed by Pappis [38]. The specific method is as follows:
9) In 2018, Hwang [13] proposed a new similarity measure S
HW
(α, β) using the Jaccard-factor:
10) In 2019, Jiang [7] derived a new distance measure D
JJ
(α, β) based on a diagonal line and the intersection of isosceles triangles transformed IFSs.
11) Liang [25] proposed three similarity measures
where φα1 (x
i
) = (μ
α
(x
i
) - φ
α
(x
i
))/2, φβ1 (x
i
) = (μ
β
(x
i
) - φ
β
(x
i
))/2, φα2 (x
i
) = (φ
α
(x
i
) +1 - ν
α
(x
i
))/2, φβ2 (x
i
) = (φ
β
(x
i
) +1 - ν
β
(x
i
))/2, φ
α
(x
i
) = (μ
α
(x
i
) +1 - ν
α
(x
i
))/2, φ
β
(x
i
) = (μ
β
(x
i
) +1 - ν
β
(x
i
))/2.
12) Mitchell [24] pointed out that Dengfeng and Chuntian’s [23] similarity measure had some unavoidable problems, so he proposed a new method S M (α, β) that was expected to overcome the shortcomings of S DC (α, β).
where g (x i ) = μ α (x i ) - μ β (x i ), q (x i ) = ν α (x i ) - ν β (x i )
13) Song [39] proposed a similarity measure S S (α, β).
where g (x i ) =1 - μ α (x i ) - ν α (x i ), q (x i ) =1 - μ β (x i ) - ν β (x i )
In 2019, Song proposed [20] an improved measure of similarity S
SN
(α, β) after he presented the shortcomings of the previous method S
S
(α, β).
14) In 2005, Wang and Liu [33, 40] proposed two methods, one was the similarity measure S
L
(α, β) and the other was the distance measure D
W
(α, β).
where φ (x i ) = |μ α (x i ) - μ β (x i ) |, φ (x i ) = |ν α (x i ) - ν β (x i ) |, g (x i ) = μ β (x i ) - μ α (x i ), q (x i ) = ν β (x i ) - ν α (x i )
15) Ye [28] explored similarity measure from a new perspective, and proposed a similarity measure S
Y
(α, β) based on the cosine theorem.
16) Vlachos [11] proposed a distance measure method D
V
(α, β) using cross-entropy, as:
The existing measure methods have some defects in some cases, which may lead to unsatisfactory results in its applications. To improve the performance of similarity/distance measure, we therefore propose an effective distance/similarity measure method of IFSs based on the area of the transformed isosceles right triangle fuzzy number, which is modified on the basis of our previous work [7].
Firstly, we transform IFSs α and β into IvFSs which are expressed as:
For convenience, the transformed intuitionistic fuzzy number (μ α (x i ) , 1 - ν α (x i )) and (μ β (x i ) , 1 - ν β (x i ) is expressed by (Z1, Z2) and (H1, H2), and 0 ≤ H1 ≤ H2 ≤ 1, 0 ≤ Z1 ≤ Z2 ≤ 1. As shown in Fig. 1, triangle H1H2H3 and triangle Z1Z2Z3 are two isosceles triangles obtained by the transformation of IFSs α and β, respectively. H3 and Z3 are the vertices of the two isosceles triangles.

The isosceles right triangles mentioned in the proposed method
The intersection points of two isosceles triangles transformed by IFSs α and β are defined as P1 and P2. Lines
The area of triangle P1O1Q1 is:
The area of triangle P2O2Q2 is:
Point P1 is the intersection point of line
The equation of line
The equation of line
According to Eq. (30) = Eq. (31), the coordinate of point P1 on the X-axis is:
The coordinate of point P1 on Y-axis is:
Point P2 is the intersection of line Z2Z3 and line H2H3. The coordinates of point P2 can be calculated as follows.
The equation of line Z2Z3 is:
According to Eq. (32) = Eq. (33), the coordinate of point P2 on X-axis is:
The coordinate of point P2 on Y-axis is:
The length of
The equation for the diagonal is: x - y = 0. According to the distance equation
The proposed distance measure is denoted by D (α, β) that is the distance between IFSs α and β. D (α, β) can be calculated as:
So, the distance D (α, β) between α and β can be expressed as:
Because of S (α, β) = 1 - D (α, β), the corresponding similarity measure S (α, β) is:
Because of the complementary characteristic of distance measure and similarity measure, the distance measure method must satisfy properties (D1-D4) while the similarity measure method satisfies properties (S1-S4). Therefore, we only need to prove that the similarity measure method satisfies properties (S1-S4).
The proof as follows:
(a) Because of
According to
The maximum values of
(b) When
When μ
α
(x
i
) = μ
β
(x
i
) andν
α
(x
i
) = ν
β
(x
i
) , we can find that P1 and O1 become a coincide point, P2 and O2 become a coincide point. So
The above analysis can get this conclusion:
0 ≤ S (α, β) ≤1.
The proof is completed.
(a) S (α, β) =1 ⇒ α = β;
According to
Because 0 ≤ 1 - μ α (x i ) - ν α (x i ) ≤1, 0 ≤ 1 - μ β (x i ) - ν β (x i ) ≤1, and 4 - (1 - μ α (x i ) - ν α (x i )) (1 - μ β (x i ) - ν β (x i )) > 0, we only need to prove the next equation set:
Take Eq. (37) into Eq. (38), we can get μ β (x i ) - μ α (x i ) = ν β (x i ) - ν α (x i ) which is then took into Eq. (37), and an item μ β (x i ) ν β (x i ) is also took into Eq. (37). Then, we can get follows:
Thus, we can infer that ν β (x i ) - ν α (x i ) =0 or 1 - μ β (x i ) + ν β (x i ) =0.
For the first case: ν β (x i ) - ν α (x i ) =0 ⇒ ν β (x i ) = ν α (x i ), and then we take it into Eq. (37) and Eq. (38) to get μ α (x i ) = μ β (x i ). By this way, we can obtain α = β.
For the second case: 1 - μ β (x i ) + ν β (x i ) =0 ⇒ μ β (x i ) - ν β (x i ) =1. Because 0 ≤ μ β (x i ) ≤1 and 0 ≤ ν β (x i ) ≤1, we can infer that μ β (x i ) - ν β (x i ) =1 is established if and only if μ β (x i ) =1 and ν β (x i ) =0. We take μ β (x i ) =1 and ν β (x i ) =0 into Eq. (38) to get μ α (x i ) =1 and ν α (x i ) =0. We can obtain α = β.
To sum up, we can get: μ α (x i ) = μ β (x i ) , ν α (x i ) = ν β (x i ) ⇒ α = β.
(b) α = β ⇒ S (α, β) = 1;
We have known that α = β is established, so μ α (x i ) = μ β (x i ) and ν α (x i ) = ν β (x i ) can be inferred. Take μ β (x i ) = μ α (x i ) and ν β (x i ) = ν α (x i ) into S (α, β), we can obtain:
Hence, we can prove that S (α, β) = 1 if and only if α = β.
The proof is completed.
According the feature of square operation and Eq. (35), we can get:
Hence, we can get:
To summarize the above, S (α, β) = S (β, α).
The proof is completed.
Let
Hence, we can prove that S (α, β) = 1 if and only if α = β.
The proof is completed.
So, we can rewrite Eq. (35) as S (α, β) =1 - 2 [(S1 (α, β)) 2 + (S2 (α, β)) 2], and we can further get: S (α, γ) =1 - 2 [(S1 (α, γ)) 2 + (S2 (α, γ)) 2]. If S1 (α, β) ≤ S1 (α, γ), and S2 (α, β) ≤ S2 (α, γ), we can get S (α, β) ≥ S (α, γ). Therefore, we divide S (α, β) ≥ S (α, γ) into two steps to prove it, and the proof as follows:
(a) First step: we should prove S1 (α, β) ≤ S1 (α, γ). So we can get:
Because 0 ≤ 1 - μ α (x i ) - ν α (x i ) ≤1, 0 ≤ 1 - μ β (x i ) - ν β (x i ) ≤1, 0 ≤ 1 - μ γ (x i ) - ν γ (x i ) ≤1, 4 - (1 - μ α (x i ) - ν α (x i )) (1 - μ β (x i ) - ν β (x i )) ≥ 0, and 4 - (1 - μ α (x i ) - ν α (x i )) (1 - μ γ (x i ) - ν γ (x i )) ≥ 0. We can get:
We only need to prove that the numerator part of S1 (α, γ) - S1 (α, β) is larger than 0, as:
For convenience, we divide Eq. (39) into two parts as:
Because α ⊆ β ⊆ γ that is 0 ≤ μ α (x i ) ≤ μ β (x i ) ≤ μ γ (x i ) ≤1 and 0 ≤ ν γ (x i ) ≤ ν β (x i ) ≤ ν α (x i ) ≤1, then we can get:
Thus, we can get that Eq. (40)- Eq. (41) ≥ 0 is always satisfied.
From the above analysis, we can know that the numerator of S1 (α, γ) - S1 (α, β) is always larger than 0, which means S1 (α, β) ≤ S1 (α, γ).
(b) Second step: we proof S2 (α, β) ≤ S2 (α, γ). So we can get:
From the proof of the first step, we know that:
We only need to prove that the numerator of S2 (α, γ) - S2 (α, β) is larger than 0, as:
For convenience, we divided Eq. (42) into two parts, as:
Then, we can get:
Because α ⊆ β ⊆ γ (that is 0 ≤ μ α (x i ) ≤ μ β (x i ) ≤ μ γ (x i ) ≤1 and 0 ≤ ν γ (x i ) ≤ ν β (x i ) ≤ ν α (x i ) ≤1), we can get:
Hence, Eq. (43)- Eq. (44) ≥ 0 is always satisfied.
From the above analysis, we can know that the numerator of S2 (α, γ) - S2 (α, β) is always larger than 0, which means that S1 (α, β) ≤ S1 (α, γ).
Summarize the above analysis, we can get S (α, β) ≥ S (α, γ). Correspondingly, S (β, γ) ≥ S (α, γ) can be proved in the same way.
Overall, we can draw a conclusion that if α ⊆ β ⊆ γ, then S (α, β) ≥ S (α, γ) and S (β, γ) ≥ S (α, γ).
The proof is completed.
Numerical experiment is the most basic task of verifying the effectiveness of similarity/distance measure method. In this section, two sets of numerical experiments are performed, and two results are shown in Table 1 and Table 2. Each numerical experiment contains six pairs of IFSs α and β. If the similarity of the six pairs of IFSs are different, we can infer that the measure method is reasonable in these cases; otherwise, the method is unreasonable. Some classical similarity/distance measure methods from the
The first set of numerical results of the proposed method compared with other methods
The first set of numerical results of the proposed method compared with other methods
The second set of numerical results of the proposed method compared with other methods
The result of the first experiment is shown in Table 1, the data were also used in [8, 41]. Most results of contrast methods are unreasonable, two of the twenty-five IFS pairs are reasonable, and one of them is the proposed method (expressed as S
HJF
), and another is S
JJ
. The unreasonable methods are S
C
, S
HK
, S
DC
,
The similarity of the second experiment are shown in Table 2, the data were also used in [41]. Although S JJ obtained reasonable results in the first numerical experiment, the results produced by S JJ are unreasonable in the second experiment. Some unreasonable results are produced in S Y and D V because the denominator is zero(this situation is represented in the table as N/A). Expect the proposed method, other comparison methods are all unreasonable because the same similarity are appeared in different IFSspairs.
Overall, none of the comparison methods satisfy both the two numerical experiments. Fortunately, the results which are produced by the proposed method are reasonable in both numerical experiments, which also confirms that our proposed method is more effective than other methods.
In this section, we introduce three kinds of applications, such as pattern recognition, medical diagnosis, and cluster analysis, to test the performance of the proposed method. In this section, the parameter p is set as 1 in S
DC
,
Applications for pattern recognition
We tend to classify two most similar object into one class in the real world. The similarity measure method can be used to measure the degree of similarity between IFSs from the real world, such as pattern recognition. In this sub-section, we give four pattern recognition problems. Several different patterns are given in each experiment, and then the degree of similarity between a test pattern and the given patterns are calculated. There is a basic rule: the test pattern should be classified into the class (one of the given pattern) that is most similar with the test pattern. However, if the similarity between a test pattern and the given patterns are all same, it will be a failed case, which indicates that the decision result of the method is unreasonable.
α1 ={ 〈x1, 0.1, 0.1〉, 〈x2, 0.5, 0.1〉, 〈x3, 0.1, 0.9〉 }
α2 ={ 〈x1, 0.5, 0.5〉, 〈x2, 0.7, 0.3〉, 〈x3, 0.0, 0.8〉 }
α3 ={ 〈x1, 0.7, 0.2〉, 〈x2, 0.1, 0.8〉, 〈x3, 0.4, 0.4〉 }
The test pattern is denoted by β in U ={ x1, x2, x3 }, as:
β ={ 〈x1, 0.4, 0.4〉, 〈x2, 0.6, 0.2〉, 〈x3, 0.0, 0.8〉 }
The results of different measure methods are shown as Table 3. Obviously, there are two similarity measure methods, as S C and S DC cannot classify the unknown pattern into one given class because S1 (α1, β) = S2 (α2, β) and S1 (α1, β) = S2 (α2, β) > S3 (α3, β). D V cannot classify the test pattern because the problem of dividing by zero in this method(this situation is represented in the table as N/A). However, other methods exclusively classify β into S2, and the result is the same as the proposed method.
The first experiment of different similarity measure methods on pattern recognition problem
The first experiment of different similarity measure methods on pattern recognition problem
α1=< x1,0.5,0.3>,< x2,0.7,0.0>,< x3,0.4,0.5>,< x4,0.7,0.3>
α2=< x1,0.5,0.2>,< x2,0.6,0.1>,< x3,0.2,0.7>,< x4,0.7,0.3>
α3=< x1,0.5,0.4>,< x2,0.7,0.1>,< x3,0.4,0.6>,< x4,0.7,0.2>
The test pattern is denoted by β in U ={ x1, x2, x3 }, as:
β ={ 〈x1, 0.4, 0.3〉, 〈x2, 0.7, 0.1〉
〈x3, 0.3, 0.6〉, 〈x4, 0.7, 0.3〉 }
The results of different similarity measure methods are shown in Table 4. S
HK
,
The second experiment of different similarity measure methods on pattern recognition problem
α1 ={ 〈x1, 0.34, 0.34〉, 〈x2, 0.19, 0.48〉, 〈x3, 0.02, 0.12〉 }
α2 ={ 〈x1, 0.35, 0.33〉, 〈x2, 0.20, 0.47〉, 〈x3, 0.0, 0.14〉 }
α3 ={ 〈x1, 0.33, 0.35〉, 〈x2, 0.21, 0.46〉, 〈x3, 0.01, 0.13〉 }
The test pattern is denoted by β in U ={ x1, x2, x3 }, as:
β ={ 〈x1, 0.37, 0.31〉, 〈x2, 0.23, 0.44〉,
〈x3, 0.04, 0.10〉 }
The results of different similarity measure methods are shown in Table 5. There are only seven methods that can classify the unknown pattern, including the proposed method,
The third experiment of different similarity measure methods on pattern recognition problem
α1=< x1,0.94,0.00>,< x2,0.88,0.00>,< x3,0.82,0.00>,
< x4,0.78,0.02>,< x5,0.75,0.05>,< x6,0.72,0.08>
α2=< x1,0.86,0.07>,< x2,0.92,0.04>,< x3,0.98,0.01>,
< x4,0.98,0.00>,< x5,0.95,0.00>,< x6,0.92,0.00>
α3=< x1,0.66,0.14>,< x2,0.72,0.08>,< x3,0.78,0.02>,
< x4,0.84,0.00>,< x5,0.90,0.00>,< x6,0.96,0.00>
The test pattern is denoted by β in U = x1,x2,x3,x4,
x5,{x6, as:
β=< x1,0.53,0.27>,< x2,0.56,0.24>,< x3,0.59,0.21>,
< x4,0.64,0.18>,< x5,0.70,0.15>,< x6,0.76,0.12>
The results of different similarity measure methods are shown in Table 6. There is only one method (D V ) cannot classify the unknown pattern because the problem of dividing by zero(this situation is represented in the table as N/A). Other methods (including the proposed method) can make a decision that classifies the unknown pattern β into S3.
The fourth experiment of different similarity measure methods on pattern recognition problem
In this sub-section, the effectiveness of the proposed method is proved because obtains more reasonable classification results compared with most of the existing methods.
Medical diagnosis is one of the most common problems in the real world, the idea of classifying patients is similar to that of pattern recognition. Hence, we apply the idea of pattern recognition to medical diagnosis. After analyzing the fuzzy numbers of an unknown patient, the obtained similarity can be used to diagnose the unknown patient according to the given patterns of disease. In this sub-section, the value of the special parameter in comparison methods is the same as that of the sub-section 6.1.
{α1, α2, α3, α4, α5}, the five elements are represented by the IFSs S1, S2, S3, S4, and S5 in the universe of discourse U ={ x1, x2, x3, x4, x5 } and they were used in [30,43,44, 30,43,44], where x1 means Temperature, x2 means Headache, x3 means StomachPain, x4 means Cough, and x5 means ChestPain. The experimental results are shown in Table 7. The IFSs S1, S2, S3, S4, and S5 are expressed as follows:
The result of the comparison between the proposed method and the existing method
The result of the comparison between the proposed method and the existing method
S1(Viral fever)=< x1,0.4,0.0>,< x2,0.3,0.5>,
< x3,0.1,0.7>,< x4,0.4,0.3>,< x5,0.1,0.7>
S2(Malaria)=< x1,0.7,0.0>,< x2,0.2,0.6>,
< x3,0.0,0.9>,< x4,0.7,0.0>,< x5,0.1,0.8>
S3(Typhoid)=< x1,0.3,0.03,< x2,0.6,0.1>,
< x3,0.2,0.7>,< x4,0.2,0.6>,< x5,0.1,0.9>
S4(Stomach problem)=< x1,0.1,0.7>,< x2,0.2,0.4>,
< x3,0.8,0.0>,< x4,0.2,0.7>,< x5,0.2,0.7>
S5(Chest problem)=< x1,0.1,0.8>,< x2,0.0,0.8>,
< x3,0.2,0.8>,< x4,0.2,0.8>,< x5,0.8,0.1>
Suppose the diagnosis of a patient is unknown, but the patient’s symptoms are expressed by IFS β in the universe of discourse U ={ x1, x2, x3, x4, x5 } as follows:
β(Patient)=< x1,0.8,0.1>,< x2,0.6,0.1>,
< x3,0.2,0.8>,< x4,0.6,0.1>,< x5,0.1,0.6>
In this experiment, twenty-four methods are selected to compare with the proposed method. As shown in Table 7, D
V
cannot make a decision because the problem of dividing by zero(this situation is represented in the table as N/A). The classification results that obtained by S
C
, S
DC
,
{α1, α2, α3, α4, α5}, the four elements are represented by the IFSs S1, S2, S3, and S4 in the universe of discourse U ={ x1, x2, x3, x4, x5 } and they were used in [7] where x1 means characterofstool, x2 means bellyache, x3 means ictussileus, x4 means chronicsileus, and x5 means anemia. IFSs S1, S2, S3, and S4 are represented as:
S1(metastasis)=< x1,0.4,0.4>,< x2,0.3,0.3>,
< x3,0.5,0.1>,< x4,0.5,0.2>,< x5,0.6,0.2>
S2(recurrence)=< x1,0.2,0.6>,< x2,0.3,0.5>,
< x3,0.2,0.3>,< x4,0.7,0.1>,< x5,0.8,0.0>
S3(bad)=< x1,0.1,0.9,< x2,0.0,0.1>,
< x3,0.2,0.7>,< x4,0.1,0.8>,< x5,0.2,0.8>
S4(well)=< x1,0.8,0.2>,< x2,0.9,0.0>,
< x3,1.0,0.0>,< x4,0.7,0.2>,< x5,0.6,0.4>
Suppose the diagnosis of a patient is unknown, but the patient’s symptoms are expressed as IFS β in the universe of discourse U ={ x1, x2, x3, x4, x5 } as follows:
β(Patient)=< x1,0.3,0.5>,< x2,0.4,0.4>,
< x3,0.6,0.2>,< x4,0.5,0.1>,< x5,0.9,0.0>
The experimental results of different methods are shown in Table 8. As it can be seen from Table 8, S
HK
,
The result of the comparison between the proposed method and the existing method
The proposed method obtains reasonable results that are analyzed in these experiments. However, some of comparison methods cannot make a decision in above experiments, which reveal the proposed method has competitiveness and potential to medical diagnosis.
Fuzzy theory-based clustering methods is suitable for handling the uncertain and vague problem in the real world. In 1969, Ruspini [18] introduced the concept of fuzzy partition and then applied it for handling cluster analysis problems. In 2008, the clustering method based on intuitionistic fuzzy was proposed by Xu [15]. We employed the proposed similarity measure method to produce an association matrix according to the work from [15], so the proposed method can be used for dealing with cluster problems. In this sub-section, two clustering experiments are used to test the performance of the similarity measure-based clustering method.
The car data set
The car data set
Firstly, according to the intuitionistic fuzzy numbers in Table 9, a similarity matrix
Secondly, according to the operation method in
Obviously,
By comparing
Thirdly, we get
The clustering results of the proposed method
According to different confidence levels, a variety of clustering results are obtained. In Table 10, the confidence level of No. 2, 3, 5, 6, 7, and 8 respectively classify these cars into two, three, four, five, seven, and eight categories, which are the same as that of [7]. In addition, the clustering result of No. 2 classifies the data set into two categories: {A1, A6} and {A2, A3, A4, A5, A7, A8, A9, A10}, and the same results can be found in [9, 15]. The clustering result of three categories: {A1, A6}, {A4, A9}, and {A2, A3, A5, A7, A8, A10} can be also found in [9][13]. The results of the four categories: {A1, A6}, {A4, A9}, {A2, A3, A7, A8} and {A5, A10} can be verified by [9, 10]. Overall, the proposed method can be used for clustering.
A1 ={ x, (0.910, 0.080) } A2 ={ x, (0.930, 0.070) } A3 ={ x, (0.870, 0.120) } A4 ={ x, (0.850, 0.140) } A5 ={ x, (0.790, 0.200) } A6 ={ x, (0.190, 0.800) } A7 ={ x, (0.100, 0.820) } A8 ={ x, (0.450, 0.550) } A9 ={ x, (0.030, 0.820) } A10 ={ x, (0.070, 0.730) } A11 ={ x, (0.500, 0.500) } A12 ={ x, (0.910, 0.080) } A13 ={ x, (0.400, 0.500) } A14 ={ x, (0.420, 0.480) } A15 ={ x, (0.460, 0.460) }
Similar to
The clustering results of the proposed method
The clustering results obtained according to different confidence levels are shown in Table 11. The results of two cluster categories: {A1,A2,A3,A4,A5} and {A6,A7,A8,A9,A10,A11,A12,A13,A14,A15} are confirmed by [7]. We can find that the results of the three cluster categories:{A1,A2,A3,A4,A5}, {A6,A7,A8,A9,A10} and {A11,A12,A13,A14,A15} are the same as [7][9][11][14]. The results are supported by many previous works, which shows the effectiveness of the proposed method.
In this paper, we propose a new method for measuring the degree of distance and similarity between IFSs. Different from our previous work, this method is derived from the sum of areas of two right triangles that are constructed by the diagonal line in square area and the transformed isosceles triangles of two IFSs. The new similarity measure method is proposed based on a geometrical model in a rectangular coordinate system with unit one, which has a favorable interpretability. A lot of mathematical derivation are conducted to prove the validity of the structured similarity measure model.
In addition, we also compare many the existing methods with the proposed method through numerical experiments. In terms of the measure of degree of similarity and distance, the proposed method has a strong superiority on our previous work and other existing methods. Subsequently, we apply the proposed method to pattern recognition, medical diagnosis, and cluster analysis to further illustrate the practicality of the proposed method. Our next research is to explore how to extend the similarity/distance measure method between IFSs to more applications, and we will also try to introduce the ideas of geometrical modeling into other fuzzy measure model fields.
Footnotes
Acknowledgments
This study was supported in part by the National Natural Science Foundation of China (No. 62002313), Key Laboratory in Software Engineering of Yunnan Province (No. 2020SE408), Postdoctoral Science Foundation ofYunnan Province in China, Doctoral Candidate Academic Award of Yunnan Province in China, and Yunnan University’s Research Innovation Fund for Graduate Students (No. 2019164).
