ElasticSearch ​What is Learning to Rank?

Learning to Rank (LTR) 是一种机器学习技术,用于优化搜索结果的相关性排序。不同于回归预测数值或分类问题,LTR 的目标是构建一个函数,该函数能根据查询来接近用户理想中的文档排序。它不直接预测具体值,而是预测文档相对于查询的相对有用性。通过使用判断列表(评级)来表达理想排序,LTR 旨在提升搜索体验的准确性。
摘要由CSDN通过智能技术生成

What is Learning to Rank?

Learning to Rank (LTR) applies machine learning to search relevance ranking. How does relevance ranking differ from other machine learning problems? Regression is one classic machine learning problem. In regression, you’re attempting to predict a variable (such as a stock price) as a function of known information (such as number of company employees, the company’s revenue, etc). In these cases, you’re building a function, say f, that can take what’s known (numEmployees, revenue), and have f output an approximate stock price.

排序学习 (LTR) 将机器学习应用于搜索相关性排名。相关性排名与其他机器学习问题有何不同?回归是一种经典的机器学习问题。在回归中,您试图根据已知信息(例如公司员工人数、公司收入等)预测变量(例如股票价格)。在这些情况下,你正在构建一个函数,比如 f,它可以采用已知的(员工数,收入),并让 f 输出一个近似的股票价格。

Classification is another machine learning problem. With classification, our function f, would classify our company into several categories. For example, profitable or not profitable. Or perhaps whether or not the company is evading taxes.

In Learning to Rank, the function f we want to learn does not make a direct prediction. Rather it’s used for ranking documents. We want a function f that comes as close as possible to our user’s sense of the ideal ordering of documents dependent on a query. The value output by f itself has no meaning (it’s not a stock price or a category). It’s more a prediction of a users’ sense of the relative usefulnes of a document given a query.

Here, we’ll briefly walk through the 10,000 meter view of Learning to Rank. For more information, we recommend blog articles How is Search Different From Other Machine Learning Problems? and What is Learning to Rank?.

Judgments: expression of the ideal ordering

Judgment lists, sometimes referred to as “golden sets” grade individual search results for a keyword search. For example, our demo uses TheMovieDB. When users search for “Rambo” we can indicate which movies ought to come back for “Rambo” based on our user’s expectations of search.

For example, we know these movies are very relevant:

  • First Blood
  • Rambo

We know these sequels are fairly relevant, but not exactly relevant:

  • Rambo III
  • Rambo First Blood, Part II

Some movies that star Sylvester Stallone are only tangentially relevant:

  • Rocky
  • Cobra

And of course many movies are not even close:

  • Bambi
  • First Daughter

Judgment lists apply “grades” to documents for a keyword, this helps establish the ideal ordering for a given keyword. For example, if we grade documents from 0-4, where 4 is exactly relevant. The above would turn into the judgment list:

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

AI生成曾小健

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值