信息检索的top-R准确率曲线（Precision@top-R Curve）作图

最新推荐文章于 2023-12-20 23:43:20 发布

HackerTom

最新推荐文章于 2023-12-20 23:43:20 发布

阅读量6.6k

点赞数 1

分类专栏：机器学习文章标签：多模态检索 Precision@top-R top-R准确率 python matlab

本文链接：https://blog.csdn.net/HackerTom/article/details/89576824

版权

机器学习专栏收录该内容

120 篇文章 16 订阅

订阅专栏

Notes

多模态检索中常用几种评价指标：

师兄的说法，只要将 P-R 曲线中的 R 从 Recall 改为 top-R 之 R（即第 R 个位置）就行，代码直接从 P-R 曲线作图代码修改而来，同师兄对拍过样例，是一样的。

Code

python

参照前作：信息检索的PR曲线（Precision-Recall Curve）作图

import matplotlib.pyplot as plt
import numpy as np
from scipy.spatial.distance import cdist


# 画 Precision@top-R 曲线
def p_at_topR(qF, rF, qL, rL, what=0, topK=-1):
    n_query = qF.shape[0]
    if topK == -1 or topK > rF.shape[0]:
        topK = rF.shape[0]
    P, R = [], []
    Gnd = (np.dot(qL, rL.transpose()) > 0).astype(np.float32)
    if what == 0:
        Rank = np.argsort(cdist(qF, rF, 'cosine'))
    else:
        Rank = np.argsort(cdist(qF, rF, 'hamming'))

    for k in range(1, topK+1):
        # ground-truth: 1 vs all
        p = np.zeros(n_query)
        # r = np.zeros(n_query)
        for it in range(n_query):
            gnd = Gnd[it]
            gnd_all = np.sum(gnd)
            if gnd_all == 0:
                continue
            # the id of sorted dis
            # (but left dis as it is)
            asc_id = Rank[it][:k]

            gnd = gnd[asc_id]
            gnd_r = np.sum(gnd)

            p[it] = gnd_r / k
            # r[it] = gnd_r / gnd_all

        P.append(np.mean(p))
        # R.append(np.mean(r))
        R.append(k)

    fig = plt.figure(figsize=(5, 5))
    plt.plot(R, P)
    plt.grid(True)
    # plt.xlim(0, 1)
    # plt.ylim(0, 1)
    plt.xlabel('recall')
    plt.ylabel('precision')
    plt.legend()
    plt.show()
    # return R, P

matlab

师兄给的这份代码好像是来自 CCQ 的，见引用[2]

function precision = precision_at_k(ids, Lbase, Lquery)

nquery = size(ids, 2);
K = 1000;
P = zeros(K, nquery);

for i = 1 : nquery
    label = Lquery(i, :);
    label(label == 0) = -1;
    idx = ids(:, i);
    imatch = sum(bsxfun(@eq, Lbase(idx(1:K), :), label), 2) > 0;
    Lk = cumsum(imatch);
    P(:, i) = Lk ./ (1:K)';
end
precision = mean(P, 2);

end