Abstract
Uncertainty measure is one of the key issues in the study of rough set theory, however, the existing studies on uncertainty measure are restricted to set-theoretic rough set model(crisp or fuzzy). This paper extends the uncertainty measure of formulae in rough logic to probabilistic environments. By employing the probability measure theory, a new notion of probabilistic rough truth degree (P-rough truth degree for short) is proposed. This notion is demonstrated to be adequate for measuring the extent to which any formula is roughly true in probabilistic environments. Then based upon the fundamental notion, the notions of P-rough similarity degree, P-accuracy degree and P-roughness degree of formulae in rough logic are also proposed. The properties of these concepts are investigated in detail. Moreover, the notion of P-rough similarity degree can also induce, in a natural way, three kinds of pseudo-metrics on the set of rough formulae, which can be used to develop a kind of approximate reasoning in rough logic.
Introduction
Rough set theory [1, 2] is proposed to account for the definability of a concept in terms of some elementary ones in an approximation space. It captures and formalizes the basic phenomenon of information granulation. The finer the granulation is, the more concepts are defined in it. For those concepts not definable in an approximation space, the lower and upper approximation (See Definition 2.2) can be defined. As an effective tool in extracting knowledge from data tables, rough set theory has been widely applied in intelligent data analysis, decision making, machine learning and other related fields [3–5].
As one of the most important issues in rough set theory, uncertainty of a set has been widely studied. Pawlak presented several numerical measures in [2], which are accuracy and roughness of a set and approximation accuracy of a rough classification. Although these measures are effective, they have some restrictions. The applications of rough set theory in some fields are hence limited. To solve this problem, an improved accuracy measure for rough sets was given in [6], which calculates the imprecision of a set using an excess entropy. However, this measure has a complex mathematical form. To overcome the shortcomings of the existing measures, Liang address the issues of uncertainty of a set in an information system and approximation accuracy of a rough classification in a decision table in [7]. There are, of course, some other research works along this research line, for instance, Mi [8] extended the uncertainty measure of Pawlak’s rough set to partition-based fuzzy rough sets, Hu [9] studied uncertainty measure of covering generated rough Set, etc. However, all the above mentioned studies are restricted to set-theoretic rough set model (crisp or fuzzy). The notion of uncertainty measure is seldom explored from the viewpoint of rough logic theory, which, however, will be helpful in knowledge discovery and data-based reasoning in the framework of rough logic. In [10], She et al. proposed the notion of rough truth degree for formulae in rough logic for the first time, owing to the main tool he uses, i.e., even probability measures, the rough (upper, lower) truth degrees of each atomic proposition are all the same.
Motivated by such a consideration, we proceed to study the uncertainty measure of formulae in rough logic in a more reasonable probabilistic environment. Specifically, by employing the probability measure theory, a new notion of P-rough truth degree is proposed. This notion is demonstrated to be adequate for measuring the extent to which any formula is roughly true. Then, based upon the fundamental notion, the notions of P-rough similarity degree, P-accuracy degree and P-roughness degree are also proposed.
Pre-rough algebra and Pre-rough logic
Let’s briefly review the basic notions of rough set theory initially proposed by Pawlak [1, 2].
Then we call the rough upper(lower) approximation of X. Note that X is a definable set if and only if , and therefore, we also treat definable sets as special cases of rough sets.
(P, ≤ , ⊓ , ⊔ , ⇁ , L, → , 0, 1) is a bounded distributive lattice with least element 0 and largest element 1, ⇁⇁ a = a, ⇁ (a ⊔ b) = ⇁ a ⊓ ⇁ b, La ≤ a, L (a ⊓ b) = La ⊓ Lb, LLa = La, L1 = 1, MLa = La, ⇁La ⊔ La = 1, L (a ⊔ b) = La ⊔ Lb, La ≤ Lb and Ma ≤ Mb imply a ≤ b, a → b = (⇁ La ⊔ Lb) ⊓ (⇁ Ma ⊔ Mb),
where ∀a ∈ P, Ma = ⇁ L ⇁ a.
It can be easily checked that P is closed under the above operations, and moreover, (P, ≤ , ⊔ , ⊓ , ⇁ , → L, (∅ , ∅) , (U, U)) forms a pre-rough algebra.
In this section, by employing product probability measure on the set of valuations Ω3, we introduce the notion of probabilistic truth degree (P-rough truth degree) for rough formulae in PRL, with the intention of measuring the extent to which formulae in PRL are roughly true. As shown below, we adopt the integrated approach, that is, the valuations are taken collectively but not individually.
Then φ A is called the A-induced function.
Some fundamental results concerning the infinite dimensional product of measure spaces are stated int he following theorem.
Especially, in case E = A1 × ⋯ × A
n
,
In the subsequent discussion, unless stated otherwise, for any probability measure space , will always denote the largest σ-algebra on X k and μ k denotes the general probability measure on X k .
We are now to define the notion of P-rough truth degree for rough formulae in PRL, before we go, it is necessary to show that each formula-induced function is integrable in the product probability measure space .
Note that for each valuation v : F (S) →
We are now ready to define the notion of randomized truth degree for rough formula in PRL.
Then we call the randomized rough (upper, lower) truth degree of A, respectively, and call the randomized rough (upper, lower) truth degree mapping, respectively.
Also, since the value of A under each valuation v is uniquely determined by its segment v (n) = (v (p1) , ⋯ , v (p n )), enjoy the form of (8). Denote
Then it follows from the fact μ is the infinite product measure of the probability measures that
For , since ∀v ∈ Ω3, v (MA) ∈ {0, 1} and v (MA) =1 if and only if , we have that
Similarly, we can obtain
(2) In case are even probability measures on X
k
, then (14 – 16) reduce to the following form:
Similarly, we can obtain that
(3) Note that the idea of applying probability method to mathematical logic has already been introduced in some literatures. The idea was initiated by Adams E.’s influential work “probability logic” [14]. In probability logic, the probability p (A) of a proposition A, which is supposed to measure the credibility of A, is given by means of a probability distribution p on a set of finite state-descriptions. This idea was further brought to great height of development by Wang [15], who initiated a kind of “quantitative logic” by combining mathematical logic with probability computation. The key notion of quantitative logic is the truth degree of a proposition in many- valued propositional logic systems. Note that the notion of truth degree is thoroughly different from the probability of a proposition in probability logic. The probability of a proposition in probability logic depends on a probability distribution on a finite set, and hence is arbitrarily given. However, in quantitative logic, the truth degree assigned to each proposition A is not arbitrarily given, but is completely determined by the logic structure of A. Hence it is intrinsic but not arbitrary. However, the above analysis are all restricted to some commonly used propositional logic systems, they are seldom explored from the viewpoint of rough logic theory, consequently, the proposed notions such as the probability of a proposition in probability logic, the notion of truth degree in quantitative logic can not reflect the idea of rough approximations, however, the combination of probability and rough set theory will be helpful in the knowledge discovery, as stated in [15].
Compute the P-rough truth degrees of A = p1, B = p1 ⊓ p2.
Similarly, we have that
The notions of P-rough (upper, lower) truth degrees obey the following properties.
,
, , If ⊢A, i.e., A is a theorem in PRL, then , If ⊢MA, i.e., MA is a theorem in PRL, then , If ⊢A, i.e., LA is a theorem in PRL, then , If ⊢MA → MB, then ; If ⊢LA → LB, then ; If ⊢A → B, then τ (A) ≤ τ (B), and , If A and B are logically equivalent, i.e., both ⊢A → B and ⊢B → A are theorems in PRL, then τ (A) = τ (B), If A and B are rough-upperly equivalent, i.e., ⊢MA ↔ MB, then , If A and B are rough-lowerly equivalent, i.e., ⊢LA ↔ LB, then , τ (⇁ A) =1 - τ (A), , , τ (A ⊔ B) = τ (A) + τ (B) - τ (A ⊓ B);
;
.
(3) Suppose that A contains n atomic propositions p1, ⋯ , p n , then it follows from Theorem 2.8 that A is a valid formula, which in turn entails that . Hence, we have , . Moreover, follows immediately from Proposition 3.8(1).
It can be shown in a similar way that if ⊢MA, then , and if ⊢A, then .
(4) We assume, without any loss of generality, that both A and B contain the same atomic formulae p1, ⋯ , p n . If ⊢A → B, v (A → B) =1 for any valuation v ∈ Ω3, which immediately entails , . Hence .
If ⊢MA → MB, then it can be proved in a similar manner as above (by letting A = MA, B = MB in the above argument) that τ (MA) ≤ τ (MB), applying Proposition 3.8 (2) here, we obtain immediately that .
Similarly, we can conclude under the premise of ⊢LA → LB.
(5) It follows immediately from Proposition 3.8 (4).
(6) It follows from v (⇁ A) =1 - v (A) that
Then, we have from (14) that
It can be easily checked that M (⇁ A) is provably equivalent to ⇁LA, applying τ (⇁ A) =1 - τ (A), Propositions 3.8(2) and 3.8(5) here, we conclude immediately that .
Similarly, we can show that .
(7) Assume that both A and B contain n atomic formulae p1, ⋯ , p
n
. Then it is clear that
Hence by (14), we have that
The other two equations can be proved in an analogous way and hence are omitted here.
Let μ be the infinite product measure of probability measures , where each μ
k
satisfies , μ
k
(1) =1. Then for each atomic proposition p
k
;
Nonetheless, we can obtain a weak result as follows.
In this section, based upon the proposed notion of P-rough truth degree, we extent the uncertain measure for set-theoretic rough set model to rough logic theory for the first time. Concretely, we introduce the notions of P-accuracy degree and P-roughness degree, with the aim of measuring the degree to which any rough formula in PRL is accurate and rough, respectively.
Then we call Acc(A), Rou(A) the accuracy degree and the roughness degree of A, respectively.
(2) Similarly, , then by definition, Acc( and Rou(.
The notions of accuracy degree and roughness degree obey the following properties.
Acc(LA) = Acc(MA) = 1, Rou(LA) = Rou(MA) = 0; If A ∼ MA, or equivalently, A ∼ LA, then Acc(A) = 1 and Rou(A) = 0. A ∼ B implies that Acc(A) = Acc(B) and Rou(A) = Rou(B), Acc(A) = 1 if and only if Rou(A) = 0, if and only if ; For a fixed , Acc(A) strictly monotonically increase with ;
For a fixed , Acc(A) strictly monotonically decrease with .
(2) It is clear that A ∼ MA if and only if A ∼ LA. If one of them holds, then by Proposition 3.8(5), we have Acc(A) = τ (MA → LA) = τ (A → A) =1, Rou(A) = 1 - 1 =0.
(3) If A ∼ B, then it can be easily checked that MA → LA is logically equivalent to MB → LB. Hence, by Proposition 3.8(5), τ (MA → LA) = τ (MB → LB), then by Definition 4.1, Acc(A) = Acc(B) and Rou(A) = Rou(B) immediately follow.
Both (4) and (5) follow immediately from Proposition 4.3.
P-rough similarity between formulae in PRL
Recall first that two formulae A and B in PRL are said to be roughly equal if and only if ⊢MA ↔ MB and ⊢LA ↔ LB, they are said to be rough-upperly equal if and only if ⊢MA ↔ MB and rough-lowerly equal if and only if ⊢LA ↔ LB, these notions can be graded to introduce the notions of rough similarity degree, rough upper similarity degree and rough lower similarity degree, respectively. As will be shown below, such notions can be introduced by the notion of rough truth degree in a natural way.
Then we call ξ (A, B) probabilistic rough (upper, lower) similarity degree (P- rough (upper, lower) similarity degree for short) between A and B.
, where B = (Mp1 → Mp2) ⊓ (Mp2 → Mp1).
Similarly, we have that .
P-rough (upper, lower) similarity degree enjoys the following properties.
, ξ (A, B) = ξ (B, A), , ,
, , If ⊢A ↔ B, then ξ (A, B) =1; If ⊢MA ↔ MB, then ; If ⊢LA ↔ LB, then ,
; ξ (A, B) + ξ (B, C) ≤ ξ (A, C) +1;
, If ⊢MA ↔ ⇁ MB, then ,
if ⊢LA ↔ ⇁ LB, then .
(4) It follows immediately from (17–19) and Proposition 3.8(3).
(5) Assume that A, B and C contain the same atomic formulae p1 ⋯ , p n . In what follows, for the sake of convenience, we will use D, E and F to denote (MA → MB) ⊓ (MB → MA), (MB → MC) ⊓ (MC → MB) and (MA → MC) ⊓ (MC → MA), respectively.
Let
Then by (14) and (18),
Trivally, . Then, it follows immediately from M1 ∩ M2 ⊆ M3 that .
The proof of the other inequalities can be given in a similar way, and hence is omitted here.
(6) Assume, without any loss of generality, that A, B and C contain the same atomic formulae p1, ⋯ p n . If ⊢MA ↔ ⇁ MB, we have that v (MA ↔ ⇁ MB) =1 for each v ∈ Ω3, which shows that v (MA) =0, v (MB) =1 or v (MA) =1, v (MB) =0. In either case, one can not obtain v ((MA → MB) ⊓ (MB → MA)) =1, that is, , and hence .
Similarly, we can prove that ⊢LA ↔ ⇁ LB implies .
However, we have the following weak results.
If we can induce a pseudo-metric on the set F (S) of all formulae in PRL which is compatible with the connectives on F (S), then it is convenient for us to study approximate reasoning problem under the framework of rough logic PRL. As we will see, such pseudo-metrics can be induced by the notions of P-rough similarity degree, P-rough upper similarity degree and P-rough lower similarity degree, respectively.
The following proposition states that ρ, and are indeed the pseudo-metrics on the set of rough formulae in PRL.
, ρ (A, B) = ρ (B, A); ; , ρ (A, C) ≤ ρ (A, B) + ρ (B, C);
;
.
By virtue of Proposition 5.7, in what follows, we call ρ, and the rough pseudo-metric, the rough upper pseudo-metric and the rough lower pseudo-metric, respectively.
In this paper, we combine rough logic and probability theory together by means of the infinite product measure on the valuation set 3 in PRL.We initially introduce the notion of P-rough truth degree for formulae in PRL, then, based upon the fundamental notion of P-rough truth degree, the uncertainty measure such as P-accuracy measure, P-roughness measure and P-rough similarity degrees are also proposed and their properties are investigated in detail. It is important to note that it has been shown in [12] that pre-rough logic is equivalent to 3-valued Lukasiewicz logic, then an interesting topic is to extend the present study to multiple-valued logic liking rough sets, we will report it in our forthcoming papers.
