第五次训练
第一次作业
问题1:写出本例中的
U
,
C
,
D
和
V
\mathbf{U},\mathbf{C},\mathbf{D}和\mathbf{V}
U,C,D和V. 注: 最后两个属性为决策属性
U
=
{
x
1
,
x
2
,
…
,
x
7
}
\mathbf{U} = \{x_1, x_2, \dots, x_7\}
U={x1,x2,…,x7}
C
=
{
H
e
a
d
a
c
h
e
,
T
e
m
p
e
r
a
t
u
r
e
,
L
y
m
p
h
o
c
y
t
e
,
L
e
u
k
o
c
y
t
e
,
E
o
s
i
n
o
p
h
i
l
}
\mathbf{C}=\{\mathrm{Headache}, \mathrm{Temperature}, \mathrm{Lymphocyte}, \mathrm{Leukocyte}, \mathrm{Eosinophil} \}
C={Headache,Temperature,Lymphocyte,Leukocyte,Eosinophil}
D
=
{
H
e
a
r
t
b
e
a
t
,
F
l
u
}
\mathbf{D} = \{\mathrm{Heartbeat}, \mathrm{Flu}\}
D={Heartbeat,Flu}
V
=
{
Y
e
s
,
N
o
,
N
o
r
m
a
l
,
A
b
n
o
r
m
a
l
,
H
i
g
h
,
L
o
w
}
\mathbf{V} = \{\mathrm{Yes}, \mathrm{No}, \mathrm{Normal}, \mathrm{Abnormal}, \mathrm{High}, \mathrm{Low}\}
V={Yes,No,Normal,Abnormal,High,Low}
问题2:定义一个标签分布系统, 即各标签的值不是 0/1, 而是 [ 0 , 1 ] [0, 1] [0,1]区间的实数, 且同一对象的标签和为 1.
Definition 3. A Label Distribution system is a tuple S = ( X , Y ) \mathbf{S} = (\mathbf{X}, \mathbf{Y}) S=(X,Y) , where
- X = [ x i j ] n × m ∈ R n × m \mathbf{X} = [x_{ij}]_{n \times m} \in \mathbb{R}^{n \times m} X=[xij]n×m∈Rn×m is the data matrix,
-
Y
=
[
y
i
k
]
n
×
l
∈
[
0
,
1
]
n
×
l
\mathbf{Y} = [y_{ik}]_{n \times l} \in [0, 1]^{n \times l}
Y=[yik]n×l∈[0,1]n×l is the lable matrix satisfying
- ∀ y i ⊂ Y , ∑ t = 1 l y i t = 1. \forall y_i \subset \mathbf{Y}, \sum_{t = 1}^{l} y_{it} = 1. ∀yi⊂Y,∑t=1lyit=1.
- m is the number of features,
- l l l is the number of distribution labels.