C语言:文本中不同单词频率

最新推荐文章于 2021-07-05 12:26:17 发布

ejdjdk

最新推荐文章于 2021-07-05 12:26:17 发布

阅读量2.5k

点赞数 2

分类专栏： C语言文章标签： c语言 c算法

本文链接：https://blog.csdn.net/weixin_42281802/article/details/104092340

版权

本文介绍了如何使用C语言统计文本中不同单词的出现频率。通过利用英文单词间的空格间隔，采用fscanf函数逐个读取单词，并用链表存储，实现对文本单词计数的改进方法。

摘要由CSDN通过智能技术生成

c语言统计文本中不同单词频率

标准英文文章中两个单词间即使有标点符号，也会存在空格，所以可用空格区分单词，可用fscanf函数特性，一次读取一个单词

此为改进版（运用链表存储）

#include<stdio.h>
#include<ctype.h>
#include<string.h>
#include<stdlib.h>
typedef struct word{
	char wrd[20];
	struct word *next;
	int num;
}word;
static int total_words=0;   //单词总数
static int diff_words=0;	//不同单词个数
void insert(word * const head,char *s,int size)
{
	word *cur;
	word *newwrd;
	total_words++;  
	if (!isalpha(s[size-1]))   //因为最后一个字母可能是标点符号，所以去掉
		s[size-1]='\0';
	cur=head->next;
	while (cur!=NULL)    //遍历链表
	{
		if (!strcmp(cur->wrd,s))   //若链表中已存在，num++,并结束此函数
		{
			cur->num++;
			return;
		}
		cur=cur->next;
	}
	newwrd=(word*)malloc(sizeof(word));   //执行到这里说明没有找到相同单词，执行头插法
	newwrd->num=1;
	strcpy(newwrd->wrd,s);
	newwr

最低0.47元/天解锁文章

ejdjdk

关注

2
点赞
踩
18

收藏

觉得还不错? 一键收藏
0
评论
C语言:文本中不同单词频率

c语言读取一个文件，统计文件中英文单词的总数目，以及不同单词出现的次数#include<stdio.h>#include<ctype.h>#include<string.h>typedef struct Word{ char s[20]; int num;} Word; //s存储一个单词，num表示单词个数（"单词表成员"...
复制链接

扫一扫

专栏目录