Scalable Recognition with a Vocabulary Tree

最新推荐文章于 2023-04-16 21:03:28 发布

ZhiyiQian

最新推荐文章于 2023-04-16 21:03:28 发布

阅读量2.8k

点赞数 1

本文链接：https://blog.csdn.net/ZhiyiQian/article/details/26587397

版权

本文介绍了使用词汇树进行大规模图像识别的方法，包括文本检索方法的原理、视觉词的构建、词汇树的层次聚类以及基于相关分数的相似度衡量。通过将图像表示为视觉词的直方图向量，并利用TF-IDF权重，实现快速的图像检索。

摘要由CSDN通过智能技术生成

Scalable Recognition with a Vocabulary Tree

1,Text Retrieval Approach

The text retrieval approach:

(1)Parsing an article into words;

(2)Some words have the same stem, e.g.,“walk”,”walking”,”walks”, these different variants have the same stem: walk;

(3)Some words like “the” and “an” are extremely common in articles,and have almost no contribution to text retrieval. So they should be excluded;

(4)Each article represented as a histogram vector, and each element of the vector is the frequency of some word (actually some stem); such as TF (short for Term Frequency):

where t is the number of total stems；ni is the number of words which have the same stem i in the article；

(5)Considering the fact that different words have different contribution to the retrieval, so weighting is necessary.Such as IDF (short for Inverse Document Frequency):

where wi is for the weight of stem i;

(6)Finally, an article represented as: