分析并统计一个文本文件中各个词出现的频率

最新推荐文章于 2024-06-11 00:15:00 发布

Tab-妳

最新推荐文章于 2024-06-11 00:15:00 发布

阅读量1.9k

点赞数 3

本文链接：https://blog.csdn.net/yangshoujian/article/details/42713381

版权

该程序用于读取30KB至300KB文本文件，统计单词频率并输出前10高频词。初版使用输入输出函数，Java版增加了性能分析，使用Java VisualVM监控CPU、内存和线程。

摘要由CSDN通过智能技术生成

作业要求：

写一个程序，分析一个文本文件中各个词出现的频率，并且把频率最高的10个词打印出来。文本文件大约是30KB～300KB大小。

程序实现思路：

首先要读取一个文本文件中的内容，需要输入输出函数，然后统计单词出现频率，在这步需要先统计一共有多少单词分别是哪些，这些单词分别有多少个，再计算出其频率即可，对出现频率进行排序，最后是打印出频率最高的10个单词。

源代码：

#include<stdio.h>
#include<stdlib.h>
#include<string.h>
struct wordsdata {
	char words[30];
	int m;
	struct wordsdata *next;
};
int main()
{
	struct wordsdata *head=NULL;
	struct wordsdata *q;
	FILE *fp;
	int i;
	int a[10];
	char b;
	for (i=0;i<10;i++)
	{
		a[i]=0;
	}
	fp=fopen("E://zuoye.txt","r");//读取本地文本文件
	while (!feof(fp))
	{
		char *p=(char*)malloc(30*sizeof(char));
		fscanf(fp,"%s",p);
		if (head==NULL)//分析单词频率
		{
			struct wordsdata *temp =(struct wordsdata*)malloc(sizeof(struct wordsdata));
			strcpy(temp->words,p);
			temp->m=1;
			temp->next=NULL;
			head=temp;
		}
		else
		{
			struct wordsdata *l=head;
			while(l!=NULL)
			{
				if (strcmp(l->words,p)==0)
				{
					int count =l->m;
					count++;
					l->m=count;
					break;
				}
				l=l->next;
			}
			if (l==NULL)
			{
				struct wordsdata *temp=(struct wordsdata*)malloc(sizeof(struct wordsdata));
				strcpy(temp->words,p);
				temp->m=1;
				temp->next=head;
				head=temp;
			}
		}
	}
	printf("单词出现频率由高到低：\n");//排序出现频率最高的10个单词并输出
	for (i=0;i<10;i++)
	{
		q=head;
		while (q!=NULL)
		{
			if(q->m>a[i])
				a[i]=q->m;
			else
				q=q->next;
		}
		q=head;
		while (q!=NULL)
		{
			if (a[i]==q->m)
			{
				q->m=0;
				printf("%s\t\n",q->words);
				printf("单词出现频率：%d\t\

最低0.47元/天解锁文章

Tab-妳

关注

3
点赞
踩
10

收藏

觉得还不错? 一键收藏
0
评论
分析并统计一个文本文件中各个词出现的频率

作业要求：写一个程序，分析一个文本文件中各个词出现的频率，并且把频率最高的10个词打印出来。文本文件大约是30KB～300KB大小。程序实现思路：首先要读取一个文本文件中的内容，需要输入输出函数，然后统计单词出现频率，在这步需要先统计一共有多少单词分别是哪些，这些单词分别有多少个，再计算出其频率即可，对出现频率进行排序，最后是打印出频率最高的10个单
复制链接

扫一扫