实际的分词结果有四种:
1. True -> True, TP — true positive (真正, TP)表示被模型预测为正的正样本
2. True -> False,FP — false positive (假正, FP)被模型预测为正的负样本
3. False -> True,TN — true negative (真负 , TN)被模型预测为负的负样本
4. False -> False,FN — false negative (假负 , FN)被模型预测为负的正样本
分词结果评测:
1. 准确率(Precision) — the measure of the proportion of selected items that the system got right:P = TP / ( TP + FP)
2. 召回率(Recall) — the measure of the target items that the system selected:
R = TP / ( TP + FN )
3. F值(F-measure):
F = 2 ∗ PR / (R + P)
NOTE:
Word precision (P) is the percentage of proposed brackets that match word-level brackets in the annotation;
Word recall (R) is the percentage of word-level brackets that are proposed by the algorithm.