1. 课程描述
在这一课程中,将涵盖一些基础和高级的用于构建基于文本的信息检索的技术,包括以下几个方面的主题:
- Efficient text indexing(高效文本索引)
- Boolean and vector-space retrieval models
- Evaluation and interface issues
- IR techniques for the web, including crawling, link-based algorithms, and metadata usage(元数据使用)
- Document clustering and classification
- Traditional and machine learning-based ranking approaches
2. 教材
- (IIR)Introduction to Information Retrieval, by C. Manning, P. Raghavan, and H. Schütze (Cambridge University Press, 2008).
- (MG) Managing Gigabytes, by I. Witten, A. Moffat, and T. Bell.
- (IRAH) Information Retrieval: Algorithms and Heuristics, by D. Grossman and O. Frieder.
- MIR) Modern Information Retrieval, by R. Baeza-Yates and B. Ribeiro-Neto.
- (FSNLP) Foundations o