一些蛮好的信息检索(IR)的参考资料

Information Retrieval Resources

Information on Information Retrieval (IR) books, courses, conferences and other resources.

Books on Information Retrieval (General)
Introduction to Information Retrieval. C.D. Manning, P. Raghavan, H. Schütze. Cambridge UP, 2008. Classical and web information retrieval systems: algorithms, mathematical foundations and practical issues.
Modern Information Retrieval. R. Baeza-Yates, B. Ribeiro-Neto. Addison-Wesley, 1999. Widely used and cited.
Information Retrieval: Algorithms and Heuristics. D.A. Grossman, O. Frieder. Springer, 2004. Excellent textbook.
Managing Gigabytes. I.H. Witten, A. Moffat, T.C. Bell. Morgan Kaufmann, 1999. The authority on index construction and compression.
Finding Out About. R. Belew. Cambridge UP, 2001. More suitable for undergraduate classes than other books listed here.
Information Retrieval: A Health and Biomedical Perspective. W.R. Hersh. Springer, 2002. As the title says: a health/biomedical perspective.
TREC: Experiment and Evaluation in Information Retrieval. E.M. Voorhees, D.K. Harman. MIT Press, 2005. A survey of recent research results.
Language Modeling for Information Retrieval. W.B. Croft, J. Lafferty. Springer, 2003. Language models are of increasing importance in IR.
Readings in Information Retrieval. K. Sparck Jones, P. Willett. Morgan Kaufmann, 1997. A collection of classical IR papers.
Recommended Reading for IR Research Students. A. Moffat, J. Zobel, D. Hawking. SIGIR Forum, 39(2), 2005. Not a book, but a collection of seminal papers, more up-to-date than Sparck-Jones et al.
Information Storage and Retrieval Systems. G. Kowalski, M.T. Maybury. Springer, 2005. "... takes a system approach, discussing all aspects of an Information Retrieval System."
The Geometry of Information Retrieval. C.J. van Risjbergen. Cambridge UP, 2004. Am ambitious attempt to develop quantum mechanics as a new foundation for IR.
Introduction to Modern Information Retrieval. G.G. Chowdhury. Neal-Schuman, 2003. Intended for students of library and information studies.
Text Information Retrieval Systems. C. Meadow, B. Boyce, D. Kraft. Academic Press, 2000. Also takes a library/information science perspective.
More Books

Books on Web Information Retrieval
Mining the Web: Analysis of Hypertext and Semi Structured Data. S. Chakrabarti. Morgan Kaufmann, 2002. The best introduction for web-centric IR.
Google's PageRank and beyond: The science of Search Engine Rankings. Amy N. Langville, Carl D. Meyer. Princeton University Press, 2006. More focused on the algorithms of PageRank, but also covers general web IR.
Modeling the Internet and the Web: Probabilistic Methods and Algorithms. P. Baldi, P. Frasconi, P. Smyth. Wiley, 2003. A bit terse. Recommended for those who have a good foundation in probability theory, but are new to IR.

Good books for implementing a search engine
Managing Gigabytes (see above)
Building Search Applications: Lucene, Lingpipe, and Gate. M. Konchady. Mustru Publishing, 2008.
Lucene in Action. O. Gospodnetic, E. Hatcher. Manning Publications, 2004.
Spidering Hacks. K. Hemenway, T. Calishain. O'Reilly, 2003.

Online Books - Browsable
Introduction to Information Retrieval (see above)
Finding Out About (see above)
Information Retrieval. C. J. van Rijsbergen. Butterworths, 1979. The classic. Almost 40 years old, but still worth reading.
Information Retrieval. T. van der Weide. 2004. Introduction to IR and hypertext.

Online Books - PDF
Introduction to Information Retrieval (see above)
Information Retrieval in Practice. B. Croft, D. Metzler, T. Strohman. Pearson Education, 2009. (two chapters)
Information Retrieval. C. J. van Rijsbergen. Butterworths, 1979.
Information Retrieval Interaction. P. Ingwersen. Taylor Graham, 1992. Focuses on user interaction in IR.
Information Retrieval: A Survey. Ed Greengrass. 2000. Good survey of "classical" IR, but little or no coverage of recent work (e.g., language models, PageRank, SVMs).
Various tutorials at Mi Islita

Research Centers
CMU (LTI)
Dublin CU
Geneva (Viper)
Glasgow
Helsinki Institute for Information Technology
IBM
Illinois Institute of Technology
Information Retrieval Facility (IRF)
Microsoft Research
NIST
Peking
Pittsburgh
Queen Mary
Sheffield
UIUC
UMASS

Courses
Berkeley (SIMS)
CMU
Cornell
DePaul
IIT
Johns Hopkins I
Johns Hopkins II
Maryland
MPI
Otago
Princeton
Stanford
Stuttgart
Texas
UMASS

Problem Sets / Assignments
Bilkent
DePaul
Georgetown
Minas Gerais
North Texas
Stuttgart
Tennessee

Web Information Retrieval
webir.org
Search Engine Watch
Users' Guide to Web Searching
PageRank

Subareas, Applications, Methods
Information Retrieval & Extraction
Information Retrieval & Machine Learning
Text Mining & Web Mining
INEX: XML retrieval
Geographic Information Retrieval
Music Information Retrieval
CLIR & Multilingual Information Retrieval
Cross-Language Information Retrieval (CLIR) Resources
N-Grams in Information Retrieval
Agent-based Information Retrieval
Audio Information Retrieval
Adversarial Information Retrieval

Conferences
TREC
Cross Language Evaluation Forum (CLEF)
SIGIR 2007 (last), SIGIR 2008 (next)
CIKM 2007, CIKM 2008
WWW 2008, WWW 2009
JCDL 2008, JCDL 2009
RIAO 2004, RIAO 2007
ECIR 2008, ECIR 2009
AIRS 2006, AIRS 2008
SPIRE 2007, SPIRE 2008
Norbert Fuhr's IR conference calendar

Journals
ACM Transactions on Information Systems (TOIS): dblp home
Information Processing and Management (IP&M): dblp home
Information Retrieval: dblp home
International Journal on Digital Libraries: dblp home
Journal of the American Society of Information Science and Technology (JASIST): dblp home
SIGIR Forum: dblp home
Journal of Documentation
D-Lib Magazine
Data & Knowledge Engineering: dblp home
Information Processing Letters: dblp home
Information Research
Information Systems: dblp home
Journal of Intelligent Information Systems: dblp home
Knowledge and Information Systems: dblp home
Foundations and Trends in Information Retrieval: home

Popular Articles
Wikipedia: Information Retrieval
A. Singhal: Modern Information Retrieval: A Brief Overview
S.E. Robertson, K. Sparck Jones: Simple, proven approaches to text retrieval
Bruce Croft: What Do People Want From IR
Information Retrieval on the World Wide Web
Michael Lesk: The Seven Ages of Information Retrieval

Software
C. Middleton, R. Baeza-Yates: A Comparison of Open Source Search Engines (contains an up-to-date list of available search engine software)
Doug Oard's list of available text retrieval systems
Avi Rappoport: open source search engines
MySQL full text search
Text to Matrix Generator, a MATLAB toolbox for indexing, retrieval and other text processing tasks

Collections
U. of Glasgow list of available text retrieval collections
NLP/IR corpus list at NUS
NLP/IR corpus list at Edinburgh
SMART at Cornell (downloads of a number of collections, stop lists, SMART retrieval system etc.)
Internet archive (limited availability)
Linguistic Data Consortium

Professional Organizations
ACM SIGIR
BCS IRSG

Other Collections of Information Retrieval Links
ACM SIGIR
David Karger

Other Resources
Glossary (Modern Information Retrieval)
Information retrieval research links @ Search Tools
BUBL: Information Retrieval Links
LSU: Information Retrieval Systems
Open Directory: Information Retrieval Links
UBC: Indexing Resources
IR & Neural Networks, Symbolic Learning, Genetic Algorithms
A stop list (a list of stop words)
Chris Manning's NLP resources
Weiguo Patrick Fan's text mining links

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值