资源分享:
1、po一篇免费的停用字下载:
https://blog.csdn.net/u010533386/article/details/51458591
复制以后,粘贴保存到txt文件。然后利用python读取该txt文件时注意使用语句:
stpwrdlst = open(stopword_path).read().replace('\n', ' ').split()
来调整格式,否则程序会出现警告:
UserWarning: Your stop_words may be inconsistent with your preprocessing. Tokenizing the stop words generated tokens [·····] not in stop_words. sorted(inconsistent))
博客笔记:
一、机器学习相关:
1、关于矩阵求协方差详细讲解
http://www.elecfans.com/dianzichangshi/20171205594693.html
2、马氏距离与欧氏距离讲解:
https://blog.csdn.net/sinat_27652257/article/details/80483325
3、关于读取的文件永久转化为对象