KDD 2011的关于topic modeling的Tutorial
首先,神马是topic model? wikipedia说是这个:
In machine learning and natural language processing, a topic model is a type of statistical model for discovering the abstract “topics” that occur in a collection of documents. An early topic model wasprobabilistic latent semantic indexing (PLSI), created by Thomas Hofmann in 1999.[1] Latent Dirichlet allocation (LDA), perhaps the most common topic model currently in use, is a generalization of PLSI developed by David Blei, Andrew Ng, and Michael Jordan in 2002。
然后这个David Blei在今年的KDD上做了一个Tutorial,有Slides,异常的新鲜。。。大家看看。。(表示我自己不是很懂这个,有懂这个的可以自告奋勇写篇Tutorial。。。)