Python统计文本文件内单词数量的方法介绍
June 30, 2015
Python统计文本文件内单词数量是如何来实现的呢?下面的内容将会通过具体的实例来演示Python统计文本文件内单词数量的实现方法及相关技巧:
本文实例讲述了python统计文本文件内单词数量的方法。分享给大家供大家参考。具体实现方法如下:
# count lines, sentences, and words of a text file # set all the counters to zero lines, blanklines, sentences, words = 0, 0, 0, 0 print '-' * 50 try: # use a text file you have, or google for this one ... filename = 'GettysburgAddress.txt' textf = open(filename, 'r') except IOError: print 'Cannot open file %s for reading' % filename import sys sys.exit(0) # reads one line at a time for line in textf: print line, # test lines += 1 if line.startswith('n'): blanklines += 1 else: # assume that each sentence ends with . or ! or ? # so simply count these characters sentences += line.count('.') + line.count('!') + line.count('?') # create a list of words # use None to split at any whitespace regardless of length # so for instance double space counts as one space tempwords = line.split(None) print tempwords # test # word total count words += len(tempwords) textf.close() print '-' * 50 print "Lines : ", lines print "Blank lines: ", blanklines print "Sentences : ", sentences print "Words : ", words # optional console wait for keypress from msvcrt import getch getch()
Python统计文本文件内单词数量就是这样,欢迎大家参考。。。。
0 Comments