Topic model:materials and tools

http://www.cs.princeton.edu/~blei/topicmodeling.html

Topic modeling

Topic models are a suite of algorithms that uncover the hidden thematic structure in document collections. These algorithms help us develop new ways to search, browse and summarize large archives of texts.

Below, you will find links to introductory materials, corpus browsers based on topic models, and open source software (from my research group) for topic modeling.

Introductory materials
Corpus browsers based on topic models

The structure uncovered by topic models can be used to explore an otherwise unorganized collection. The following are browsers of large collections of documents, built with topic models.

Also see Sean Gerrish's  discipline browser  for an interesting application of topic modeling at JSTOR.

To build your own browsers, see Allison Chaney's excellent Topic Model Visualization Engine (TMVE). For example, here is a browser of 100,000 Wikipedia articles that uses TMVE.

Topic modeling software

Our research group has released many open-source software packages for topic modeling. Please post questions, comments, and suggestions about this code to the topic models mailing list. 

LinkModel/AlgorithmLanguageAuthorNotes
lda-cLatent Dirichlet allocationCD. BleiThis implements variational inference for LDA.
class-sldaSupervised topic models for classifiationC++C. WangImplements supervised topic models with a categorical response.
ldaR package for Gibbs sampling in many modelsRJ. ChangImplements many models and is fast . Supports LDA, RTMs (for networked documents), MMSB (for network data), and sLDA (with a continuous response).
online ldaOnline inference for LDAPythonM. HoffmanFits topic models to massive data. The demo downloads random Wikipedia articles and fits a topic model to them.
online hdpOnline inference for the HDPPythonC. WangFits hierarchical Dirichlet process topic models to massive data. The algorithm determines the number of topics.
tmve(online)Topic Model Visualization EnginePythonA. ChaneyA package for creating corpus browsers. See, for example, Wikipedia .
ctrCollaborative modeling for recommendationC++C. WangImplements variational inference for a collaborative topic models. These models recommend items to users based on item content and other users' ratings.
dtmDynamic topic models and the influence modelC++S. GerrishThis implements topics that change over time and a model of how individual documents predict that change.
hdpHierarchical Dirichlet processesC++C. WangTopic models where the data determine the number of topics. This implements Gibbs sampling.
ctm-cCorrelated topic modelsCD. BleiThis implements variational inference for the CTM.
dilnDiscrete infinite logistic normalCJ. PaisleyThis implements the discrete infinite logistic normal, a Bayesian nonparametric topic model that finds correlated topics.
hldaHierarchical latent Dirichlet allocationCD. BleiThis implements a topic model that finds a hierarchy of topics. The structure of the hierarchy is determined by the data.
turbotopicsTurbo topicsPythonD. BleiTurbo topics find significant multiword phrases in topics.

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
很抱歉,我无法回答关于"model:wj03"的问题。可以提供更多的上下文或者更具体的问题吗?这样我才能为您提供帮助。<span class="em">1</span><span class="em">2</span><span class="em">3</span> #### 引用[.reference_title] - *1* [ahpmatlab代码-TCrelay-Morris-Lecar-model:我们研究了带有多个扩展的Morris-Lecar模型中的尖峰序列](https://download.csdn.net/download/weixin_38602098/19140751)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v93^chatsearchT3_2"}}] [.reference_item style="max-width: 33.333333333333336%"] - *2* [主题模型TopicModel:Unigram、LSA、PLSA模型](https://blog.csdn.net/chuange6363/article/details/100752944)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v93^chatsearchT3_2"}}] [.reference_item style="max-width: 33.333333333333336%"] - *3* [2019.01.11 应用结构 --》model-》最佳实践](https://blog.csdn.net/tangerine_/article/details/86301568)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v93^chatsearchT3_2"}}] [.reference_item style="max-width: 33.333333333333336%"] [ .reference_list ]
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值