推荐系统：Slope One 算法

最新推荐文章于 2025-06-11 11:32:32 发布

原创

最新推荐文章于 2025-06-11 11:32:32 发布 · 8k 阅读

5 ·

CC 4.0 BY-SA版权

文章标签：

#算法 #matrix #python #user #each #express

Slope One算法是一种由Daniel Lemire提出的简单而有效的Item-Based推荐系统算法，旨在实现易理解、实时更新、高效查询、对新用户友好及合理准确性。通过Python实现，该算法可快速预测用户对未评分项目的评分，适用于在线评分的协同过滤。文章介绍了算法原理，并提供了Python代码示例。

Slope One 算法是由 Daniel Lemire 教授在 2005 年提出的一个 Item-Based 推荐算法。
Slope One 算法试图同时满足这样的的 5 个目标：

   1. 易于实现和维护：普通工程师可以轻松解释所有的聚合数据，并且算法易于实现和测试。
   2. 运行时可更新的：新增一个评分项，应该对预测结果即时产生影响。
   3. 高效率的查询响应：快速的执行查询，可能需要付出更多的空间占用作为代价。
   4. 对初次访问者要求少：对于一个评分项目很少的用户，也应该可以获得有效的推荐。
   5. 合理的准确性：与最准确的方法相比，此方法应该是有竞争力的，准确性方面的微小增长不能以简单性和扩展性的大量牺牲为代价。

使用这个图可以简明扼要的说明一下 Slope One 算法。

   1. User A 给 Item I 打分为 1；给 Item J 打分为 1.5。
   2. Uesr B 给 Item I 打分为 2。
   3. 问题是：User B 给 Item J 打分为多少？
   4. 使用 Slope One 算法，答案是：2.5，2+(1.5-1)=2.5。

是不是非常简单？！Slope One 算法就是这么简单，而且它居然还相当有效！详细的试验分析可以看这里“Slope One Predictors for Online Rating-Based Collaborative Filtering”。

喜欢 Python 的朋友可以看这篇 Blog，“tutorial about how to implement Slope One in Python”，非常详细的介绍了 Slope One 算法在 Python 下的实现步骤。当然了，这只是一个非常简单的实现，你可以使用 MovieLens 或者 EachMovie 的数据集进行一些简单地试验。但如果真正要把它投入到商业环境，还有许多其他的工作必须做好。

python实现

During a lunchtime conversation the other day, a coworker mentioned that he was hacking in his spare time on an entry for the Netflix Prize. This got me to thinking about collaborative filtering: why had I never seen a good description of how to do it? I suspect that people who might ordinarily have a casual interest in the subject hear that there are some statistics involved, whereupon they immediately freeze in the mathematical headlights, and turn the conversation to something else, anything else. In early 2005, a researcher named Daniel Lemire published, with Anna Maclachlan, a paper with the jazzy title of “Slope One Predictors for Online Rating-Based Collaborative Filtering“. This is an important paper, because it presents a family of really simplecollaborative filtering schemes. I mean really simple: there are no statistics involved, just a little bit of linear algebra. Unfortunately, because the Lemire/Maclachlan paper is aimed at an academic audience, it’s not trivial to tell by merely reading it how simple the technique it presents is, so what follows is my attempt to translate its ideas into my native language of Attention Deficit Programmerese. To make things more concrete, I’m going to present an implementation in less than 40 lines of Python (and I’ll try to explain any obscure bits of Python as I go). Cool, huh? Regardless of the underlying implementation, collaborative filters tend to try to solve the same broad problem using much

最低0.47元/天解锁文章