UIUC大学之Coursera课程Text Retrieval and Search Engines:Week 1 Quiz

Week 1 QuizHelp Center

Warning: The hard deadline has passed. You can attempt it, but you will not get credit for it. You are welcome to try it as a learning exercise.

Question 1

The sentence “A man saw a boy with a telescope” is syntactically ambiguous and has two distinct syntactic structures.

Question 2

Which of the following is false:

Question 3

Consider the instantiation of the vector space model where documents and queries are represented as  bit vectors. Assume we have the following query and two documents: 

Q = “healthy diet plans” 
D1 = “healthy plans for weight loss. Check out other healthy plans” 
D2 = “the presidential candidate plans to change the educational system.” 

Let V(X) = [b1 b2 b3] represent a part of the bit vector for document or query X, where b1, b2, and b3 are the bits corresponding to “healthy”, “diet”, and “plans”, respectively. Which of the following is true:

Question 4

Consider the same scenario as in question (3) with dot product as the similarity measure. Which of the following is true:

Question 5

When we use the Okapi/BM25 retrieval function to score documents for a query that has only one term, the ranking of documents is not affected by IDF weighting, i.e. if we remove the IDF weighting term from the ranking function, we will still get the same ranked list of documents.

Question 6

Which of the following is  not true about the Okapi/BM25 ranking function:

Question 7

Suppose we compute the term vector for a baseball sports news article in a collection of general news articles using  TF weighting only. Which of the following words do you expect to have the highest weight?

Question 8

Assume the same scenario as in (7) but with  TF-IDF weighting. Which of the following words do you expect to have the highest weight in this case?

Question 9

Consider the following retrieval formula: 



where c(w, D) is the count of word w in document D, dl is the document length, avdl is the average document length of the collection, N is the total number of documents in the collection, and df (w) is the number of documents containing word w. Which of the following is true about the given scoring function:

Question 10

When using the Okapi/BM25 retrieval function on a corpus where each document has exactly the same length, removing the document length normalization term from the retrieval function will change the ranking of documents.
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值