python统计文本单词总数_python统计文本文件内单词数量的方法

本文实例讲述了python统计文本文件内单词数量的方法。分享给大家供大家参考。具体实现方法如下:

# count lines, sentences, and words of a text file

# set all the counters to zero

lines, blanklines, sentences, words = 0, 0, 0, 0

print '-' * 50

try:

# use a text file you have, or google for this one ...

filename = 'GettysburgAddress.txt'

textf = open(filename, 'r')

except IOError:

print 'Cannot open file %s for reading' % filename

import sys

sys.exit(0)

# reads one line at a time

for line in textf:

print line, # test

lines += 1

if line.startswith('\n'):

blanklines += 1

else:

# assume that each sentence ends with . or ! or ?

# so simply count these characters

sentences += line.count('.') + line.count('!') + line.count('?')

# create a list of words

# use None to split at any whitespace regardless of length

# so for instance double space counts as one space

tempwords = line.split(None)

print tempwords # test

# word total count

words += len(tempwords)

textf.close()

print '-' * 50

print "Lines : ", lines

print "Blank lines: ", blanklines

print "Sentences : ", sentences

print "Words : ", words

# optional console wait for keypress

from msvcrt import getch

getch()

希望本文所述对大家的python程序设计有所帮助。

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值