本文为PU-Learning/文本分类/文本聚类/情感分析相关研究提供部分常用数据集下载地址
(所有数据集都有大量文献使用,暂时只列一篇代表性文章)
- Lang K . NewsWeeder : Learning to filter net-news[C]// Twelfth International Conference on International Conference on Machine Learning. Morgan Kaufmann Publishers Inc. 1995.
Sources:http://qwone.com/~jason/20Newsgroups/
- Craven M, Freitag D, Mccallum A, et al. Learning to Extract Symbolic Knowledge from the World Wide Web[C]// Proc of the National Conference on Artificial Intelligence. 1998.
Sources:http://www.cs.cmu.edu/~webkb/
- M. Ott, Y. Choi, C. Cardie, and J.T. Hancock. 2011. Finding Deceptive Opinion Spam by Any Stretch of the Imagination. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies.
Sources:https://myleott.com/op-spam.html
- Reuters-21578<