python关键词提取,Python_RAKE中的关键字提取

I am a novice user and puzzled over the following otherwise simple "loop" problem. I have a local dir with x number of files (about 500 .txt files). I would like to extract the corresponding keywords from each unique file using RAKE for Python. I've reviewed the documentation for RAKE; however, the suggested code in the tutorial gets keywords for a single document. Can someone please explain to me how to loop over an X number of files stored in my local dir. Here's the code from the tutorial and it words really well for a single document.

$git clone https://github.com/zelandiya/RAKE-tutorial

import rake

import operator

rake_object = rake.Rake("SmartStoplist.txt", 5, 3, 4)

sample_file = open("data/docs/fao_test/w2167e.txt", 'r')

text = sample_file.read()

keywords = rake_object.run(text)

print "Keywords:", keywords

解决方案

Create a list of filenames you want to process:

filenames = [

'data/docs/fao_test/w2167e.txt',

'some/other/folder/filename.txt',

etc...

]

If you don't want to hardcode all the names, you can use the glob module to collect filenames by wildcards.

Create a dictionary for storing the results:

results = {}

Loop through each filename, reading the contents and storing the Rake results in the dictionary, keyed by filename:

for filename in filenames:

with open(filename, 'r') as fp:

results[filename] = rake_object.run(fp.read())

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值