文章目录 Building a spam classier Recommended approach Error Analysis Building a spam classier 1.选择样本中出现频率最高的n(10000)个单词作为特征值,构成特征向量。 x x x = features of email, y y y = spam(1) or not spam (0). Featu