算法测试系列：Parsing Words

最新推荐文章于 2024-07-27 12:20:46 发布

Jason♠️

最新推荐文章于 2024-07-27 12:20:46 发布

阅读量154

点赞数

分类专栏：算法题文章标签： python 算法

本文链接：https://blog.csdn.net/weixin_45138266/article/details/105028431

版权

算法题专栏收录该内容

12 篇文章 0 订阅

订阅专栏

Parsing Words

We define a word as any sequence of one or more lower-case letters (no numbers, no punctuation) where words are separated by white space.

Write a function that takes a list of input lines and produces a string that contains the following :

the count of words in the input
the word “words”
each unique word, and the count of times it occurs in the input (listed in alphabetical order, each on its own line, with a space between the word and count)
the word “letters”
for every letter from a to z, the letter, and the count of times that letter occurred IN A WORD in the input (listed in alphabetical order, each on its own line, with a space between the letter and count).

There must be “whitespace” separating valid words in the input – actual spaces, and newlines. If your program finds something that is not whitespace, and not a word, it should skip until it comes to a valid word (or the end of the input). Finding a non-word character next to word-
characters makes the whole sequence a non-word.

import sys, re, string
from io import StringIO

data = sys.stdin.readlines()
tmp = []
findNonWord = False
wordList = []

for iLine in data:
    findNonWord = False
    for idx in range(len(iLine)):
        iChar = iLine[idx]
        if re.match("[a-z]", iChar) and not findNonWord:
            tmp.append(iChar)
        elif iChar == ' ':
            if not findNonWord and len(tmp)>0:
                wordList.append(''.join(tmp))
            tmp = []
            findNonWord = False
        else:
            findNonWord = True
            tmp = []

        if idx==len(iLine)-1:
            if not findNonWord and len(tmp)>0:
                wordList.append(''.join(tmp))
            tmp = []
print(wordList)
print('')

wordCount = len(wordList)
print(wordCount)
print('words')
wordSet = sorted(set(wordList))

for iWord in wordSet:
    print (iWord + " " + str(wordList.count(iWord)))

print('letters')

for iLetters in string.ascii_lowercase[:26]:
    count = 0
    for iWord in wordList:
        count += iWord.count(iLetters)
    print(iLetters + " " + str(count))
    count = 0

Jason♠️

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
算法测试系列：Parsing Words

Parsing WordsWe define a word as any sequence of one or more lower-case letters (no numbers, no punctuation) where words are separated by white space.Write a function that takes a list of input line...
复制链接

扫一扫