python生成的词云没有图案,如何在Python中从LDA模型生成词云？

最新推荐文章于 2023-01-29 19:59:57 发布

多弗拉门戈

最新推荐文章于 2023-01-29 19:59:57 发布

阅读量207

点赞数

文章标签： python生成的词云没有图案

该博客介绍了如何使用Python的Gensim库进行LDA主题建模，并从每个主题中提取前20个关键词来创建词云。通过调用Gensim的`show_topic`方法，可以获取每个主题的关键词，并将其写入文件以供后续生成词云使用。

摘要由CSDN通过智能技术生成

I am doing some topic modeling on newspaper articles, and have implemented LDA using gensim in Python3. Now I want to create a word cloud for each topic, using the top 20 words for each topic. I know I can print the words, and save the LDA model, but is there any way to just save the top words for each topic which I can further use for generating word clouds?

I tried to google it, but could not find anything relevant. Any help is appreciated.

解决方案

You can get the topn words from an LDA model using Gensim's built-in method show_topic.

lda = models.LdaModel.load('lda.model')

for i in range(0, lda.num_topics):

with open('output_file.txt', 'w') as outfile:

outfile.write('{}\n'.format('Topic #' + str(i + 1) + ': '))

for word, prob in lda.show_topic(i, topn=20):

outfile.write('{}\n'.format(word.encode('utf-8')))

outfile.write('\n')

This will write a file with a format similar to this:

Topic #69: