对于简单的例子不用使用hadoop也可运行相关例子,也不用安装maven到eclipse,在Windows即可运行。
1、数据准备
GroupLens_MovieLens数据集
去官网下载很慢,有人放在CSDN上了,http://download.csdn.net/download/zhongwen7710/8510549,
2、使用mathout
mahout 下载,网上给的很多网址都不能用,目前能用的:http://archive.apache.org/dist/mahout/有各个版本。
mahout-distribution-0.6.zip解压后可以看到几个jar文件,
只需要在eclipse创建工程时导入即可,
然后将例程复制进来即可,举个例子:
<span style="white-space:pre"> </span> RandomUtils.useTestSeed();
DataModel model = new GroupLensDataModel(new File("D:\\mahout学习资料\\ml-10m\\ml-10M100K\\ratings.dat"));
RecommenderIRStatsEvaluator evaluator =
new GenericRecommenderIRStatsEvaluator();
// Build the same recommender for testing that we did last time:
RecommenderBuilder recommenderBuilder = new RecommenderBuilder() {
@Override
public Recommender buildRecommender(DataModel model) throws TasteException {
UserSimilarity similarity = new PearsonCorrelationSimilarity(model);
UserNeighborhood neighborhood =
new NearestNUserNeighborhood(2, similarity, model);
return new GenericUserBasedRecommender(model, neighborhood, similarity);
}
};
// Evaluate precision and recall "at 2":
IRStatistics stats = evaluator.evaluate(recommenderBuilder,
null, model, null, 2,
GenericRecommenderIRStatsEvaluator.CHOOSE_THRESHOLD,
1.0);
System.out.println(stats.getPrecision());
System.out.println(stats.getRecall());
放进主函数即可运行。