随机找一篇英文文档txt,统计其中词数、各个字符的数量和频率

先新建一个txt文件,往该文件写一点英文句子;

with open("./我有一个梦想.txt", "w", encoding = "utf-8") as f:
    f.write("""
    I Have a Dream


    I am happy to join with you today in what will go down in history as the greatest demonstration for freedom in the history of our nation.



    Five score years ago, a great American, in whose symbolic shadow we stand today, signed the Emancipation Proclamation. This momentous decree came as a great beacon light of hope to millions of Negro slaves who had been seared in the flames of withering injustice. It came as a joyous daybreak to end the long night of their captivity.



    But one hundred years later, the Negro still is not free. One hundred years later, the life of the Negro is still sadly crippled by the manacles of segregation and the chains of discrimination. One hundred years later, the Negro lives on a lonely island of poverty in the midst of a vast ocean of material prosperity. One hundred years later, the Negro is still languished in the corners of American society and finds himself an exile in his own land. And so we've come here today to dramatize a shameful condition.



    In a sense we've come to our nation's capital to cash a check. When the architects of our republic wrote the magnificent words of the Constitution and the Declaration of Independence, they were signing a promissory note to which every American was to fall heir. This note was a promise that all men, yes, black men as well as white men, would be guaranteed the "unalienable Rights" of "Life, Liberty and the pursuit of Happiness." It is obvious today that America has defaulted on this promissory note, insofar as her citizens of color are concerned. Instead of honoring this sacred obligation, America has given the Negro people a bad check, a check which has come back marked "insufficient funds."



    But we refuse to believe that the bank of justice is bankrupt. We refuse to believe that there are insufficient funds in the great vaults of opportunity of this nation. And so, we've come to cash this check, a check that will give us upon demand the riches of freedom and the security of justice.



    We have also come to this hallowed spot to remind America of the fierce urgency of Now. This is no time to engage in the luxury of cooling off or to take the tranquilizing drug of gradualism. Now is the time to make real the promises of democracy. Now is the time to rise from the dark and desolate valley of segregation to the sunlit path of racial justice. Now is the time to lift our nation from the quicksands of racial injustice to the solid rock of brotherhood. Now is the time to make justice a reality for all of God's children.



    It would be fatal for the nation to overlook the urgency of the moment. This sweltering summer of the Negro's legitimate discontent will not pass until there is an invigorating autumn of freedom and equality. Nineteen sixty-three is not an end, but a beginning. And those who hope that the Negro needed to blow off steam and will now be content will have a rude awakening if the nation returns to business as usual. And there will be neither rest nor tranquility in America until the Negro is granted his citizenship rights. The whirlwinds of revolt will continue to shake the foundations of our nation until the bright day of justice emerges.



    But there is something that I must say to my people, who stand on the warm threshold which leads into the palace of justice: In the process of gaining our rightful place, we must not be guilty of wrongful deeds. Let us not seek to satisfy our thirst for freedom by drinking from the cup of bitterness and hatred. We must forever conduct our struggle on the high plane of dignity and discipline. We must not allow our creative protest to degenerate into physical violence. Again and again, we must rise to the majestic heights of meeting physical force with soul force.



    The marvelous new militancy which has engulfed the Negro community must not lead us to a distrust of all white people, for many of our white brothers, as evidenced by their presence here today, have come to realize that their destiny is tied up with our destiny. And they have come to realize that their freedom is inextricably bound to our freedom.



    We cannot walk alone.



    And as we walk, we must make the pledge that we shall always march ahead.



    We cannot turn back.



    There are those who are asking the devotees of civil rights, "When will you be satisfied?" We can never be satisfied as long as the Negro is the victim of the unspeakable horrors of police brutality. We can never be satisfied as long as our bodies, heavy with the fatigue of travel, cannot gain lodging in the motels of the highways and the hotels of the cities. **We cannot be satisfied as long as the negro's basic mobility is from a smaller ghetto to a larger one. We can never be satisfied as long as our children are stripped of their self-hood and robbed of their dignity by signs stating: "For Whites Only."** We cannot be satisfied as long as a Negro in Mississippi cannot vote and a Negro in New York believes he has nothing for which to vote. No, no, we are not satisfied, and we will not be satisfied until "justice rolls down like waters, and righteousness like a mighty stream."



    I am not unmindful that some of you have come here out of great trials and tribulations. Some of you have come fresh from narrow jail cells. And some of you have come from areas where your quest -- quest for freedom left you battered by the storms of persecution and staggered by the winds of police brutality. You have been the veterans of creative suffering. Continue to work with the faith that unearned suffering is redemptive. Go back to Mississippi, go back to Alabama, go back to South Carolina, go back to Georgia, go back to Louisiana, go back to the slums and ghettos of our northern cities, knowing that somehow this situation can and will be changed.



    Let us not wallow in the valley of despair, I say to you today, my friends.



    And so even though we face the difficulties of today and tomorrow, I still have a dream. It is a dream deeply rooted in the American dream.



    I have a dream that one day this nation will rise up and live out the true meaning of its creed: "We hold these truths to be self-evident, that all men are created equal."



    I have a dream that one day on the red hills of Georgia, the sons of former slaves and the sons of former slave owners will be able to sit down together at the table of brotherhood.



    I have a dream that one day even the state of Mississippi, a state sweltering with the heat of injustice, sweltering with the heat of oppression, will be transformed into an oasis of freedom and justice.



    I have a dream that my four little children will one day live in a nation where they will not be judged by the color of their skin but by the content of their character.



    I have a dream today!



    I have a dream that one day, down in Alabama, with its vicious racists, with its governor having his lips dripping with the words of "interposition" and "nullification" -- one day right there in Alabama little black boys and black girls will be able to join hands with little white boys and white girls as sisters and brothers.



    I have a dream today!



    I have a dream that one day every valley shall be exalted, and every hill and mountain shall be made low, the rough places will be made plain, and the crooked places will be made straight; "and the glory of the Lord shall be revealed and all flesh shall see it together."2



    This is our hope, and this is the faith that I go back to the South with.



    With this faith, we will be able to hew out of the mountain of despair a stone of hope. With this faith, we will be able to transform the jangling discords of our nation into a beautiful symphony of brotherhood. With this faith, we will be able to work together, to pray together, to struggle together, to go to jail together, to stand up for freedom together, knowing that we will be free one day.



    And this will be the day -- this will be the day when all of God's children will be able to sing with new meaning:



    My country 'tis of thee, sweet land of liberty, of thee I sing. Land where my fathers died, land of the Pilgrim's pride, From every mountainside, let freedom ring!



    And if America is to be a great nation, this must become true.



    And so let freedom ring from the prodigious hilltops of New Hampshire.



    Let freedom ring from the mighty mountains of New York.



    Let freedom ring from the heightening Alleghenies of Pennsylvania.



    Let freedom ring from the snow-capped Rockies of Colorado.



    Let freedom ring from the curvaceous slopes of California.



    But not only that:



    Let freedom ring from Stone Mountain of Georgia.



    Let freedom ring from Lookout Mountain of Tennessee.



    Let freedom ring from every hill and molehill of Mississippi.



    From every mountainside, let freedom ring.



    And when this happens, and when we allow freedom ring, when we let it ring from every village and every hamlet, from every state and every city, we will be able to speed up that day when all of God's children, black men and white men, Jews and Gentiles, Protestants and Catholics, will be able to join hands and sing in the words of the old Negro spiritual:



    Free at last! Free at last!



    Thank God Almighty, we are free at last!
""")
    
    
    

 处理文件代码:

class DealText:
    def __init__(self, filename):
        self.filename = filename 
        # 文件名
        self.contents = "" 
        # 文本
        self.words_num = 0
        # 单词数
        self.chars_num = 0
        # 字符数
        self.words_list = self.contents.split()
        # 文本以空格为分隔符,将单词存储在列表中
        
    def count_words(self):
        try: 
        # 有可能产生错误的代码
            with open(self.filename, "r", encoding = "utf-8") as f:
                self.contents = f.read()
        except FileNotFoundError:
        # 发生找不到文件错误时运行
            print(f"Sorry, the file {self.filename} does not exist.")

        # 中间还可以加上其他可能的错误类型:except 错误类型 

        except: 
        # 产生其他错误会运行
            print("是什么错误呢")
        else:
        # 没有错误时会运行
            self.words_list = self.contents.split()
            # 方法spilt()以空格为分隔符将字符串拆分成多个部分,并将这些部分都存储在一个列表中;

            self.words_num = len(self.words_list)

            for word in self.words_list:
                for c in word:
                    if c != ' ':
                        self.chars_num += 1
            # 统计文本字符数
            
            seq = (f"The file {self.filename} has about {self.words_num} words.")
            print(seq)
            with open("./统计结果.txt", "a", encoding="utf-8") as f:
                f.write(f"{seq}\n")
                
        # 后面还可以加上:
        finally:
        # 不管发生错误与否都会运行
            print("不管发生错误与否都会运行")
            
            
    def count_word_num(self, word):
        
        cnt = 0
        for ele in self.words_list:
            if(ele == word):
                cnt += 1
                print(ele)
        return cnt
    
    def count_char_num(self, char):
        cnt = 0
        for word in self.words_list:
            for ele in word:
                if(ele == char and ele != ' '):
                    cnt += 1
        return cnt
    
    def count_word_frequence(self, word):
        cnt = self.count_word_num(word)
        if(sum != 0):
            frequence = cnt / self.words_num
            frequence *= 100 
            # 转换为百分数
        else:
            frequence = 0
        
        seq = (f"{word} 在此篇文章中共有 {cnt} 个词, 占比比率为 %.2f%%" %frequence)
        print(seq)
        with open("./统计结果.txt", "a", encoding="utf-8") as f:
            f.write(seq + '\n')
        
    def count_char_frequence(self, char):
        cnt = self.count_char_num(char)
        
        if(self.chars_num != 0):
            frequence = cnt / self.chars_num
            frequence *= 100 
            # 转换为百分数
        else:
            frequence = 0
        
        seq = (f"{char} 在此篇文章中共有 {cnt} 个字符, 占比比率为 %.2f%%" %frequence)
        print(seq)
        with open("./统计结果.txt", "a", encoding="utf-8") as f:
            f.write(seq + '\n')

while(1):
    text = input("请输入您想要处理的文章名字,并以为结尾, 若不想查询,以按q以结束 ")
    
    if text != "q":
        with open("./统计结果.txt", "w", encoding="utf-8") as f:
            pass
    
        deal_text = DealText(f"{text}.txt")
        deal_text.count_words()
        while(1):
            if(deal_text.chars_num == 0):
                break
            op = input("您想查询字符还是单词,查询字符请按 c,查询单词请按 w,不想查询请按 q  ")
            if op == 'c':
                char= input("请输入您想要查询的字符 ")
                deal_text.count_char_frequence(char)
            elif op == 'w':
                word = input("请输入您想要查询的单词 ")
                deal_text.count_word_frequence(word)
            elif op == 'q': 
                break
    else:
        break


  • 9
    点赞
  • 10
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
可以使用Python中的collections模块中的Counter类来完成这个任务。具体步骤如下: 1. 读取txt文件内容 2. 将文件内容转换为小写字母,去除标点符号和换行符 3. 使用split()方法将字符串分割成单词列表 4. 使用Counter类统计每个单词出现的次数 5. 输出结果 下面是代码示例: ```python from collections import Counter # 读取txt文件内容 with open('example.txt', 'r') as f: text = f.read() # 将文件内容转换为小写字母,去除标点符号和换行符 text = text.lower().replace('\n', ' ').replace(',', '').replace('.', '').replace('!', '').replace('?', '') # 使用split()方法将字符串分割成单词列表 words = text.split() # 使用Counter类统计每个单词出现的次数 word_counts = Counter(words) # 输出结果 for word, count in word_counts.most_common(): print(f'{word}: {count}') ``` 解释一下代码: 1. 首先使用open()函数打开txt文件,使用with语句保证文件在使用后正确关闭。 2. 读取文件内容并赋值给变量text。 3. 对text进行一系列字符串操作,将其转换为小写字母,去除标点符号和换行符。 4. 使用split()方法将text分割成单词列表,并赋值给变量words。 5. 使用Counter类对words中的单词进行统计,得到每个单词出现的次数,并赋值给变量word_counts。 6. 使用most_common()方法对word_counts进行排序,得到出现次数最多的单词列表,并遍历输出每个单词及其出现次数。

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值