字典树Trie的简单理解

最新推荐文章于 2023-12-08 19:47:16 发布

weixin_41157881

最新推荐文章于 2023-12-08 19:47:16 发布

阅读量193

点赞数

分类专栏： C++ 文章标签：字典树 C++

C++ 专栏收录该内容

200 篇文章 1 订阅

订阅专栏

字典树又叫Trie树，是单词储存的一种树形结构。最典型的的应用是统计、排序和保存大量字符串。可以插入单词、统计单词的个数、公共前缀的单词数量，单词的查找功能等。
我们先定义字典树节点的数据结构：
typedef struct Node
{
char value; //表示该节点储存的英文字母
Node *child[26];//用孩子表示下个字母的
bool isWorld; //表示该节点是否为单词
Node()
{
for (int i = 0; i < 26; i++)
{
this->child[i] = nullptr;//初始化默认为nullptr
this->isWorld = false;//初始化默认节点为非单词。
}
}
}TrieNode;
TrieNode *root = nullptr;//定义根节点，字典树中根节点是不存放字母的（也不知道存放哪个啊）
下面是插入单词的代码，通过对判断节点是否存在字母孩子，来进行节点的是否创建。
void insertWorld(char *str)
{
if (root == nullptr)
{
root = new TrieNode();
}
char *str1 = str;
TrieNode *proot = root;
while (*str1 != ‘\0’)
{
char temp = *str1;
if (proot->child[temp - ‘a’] != nullptr)
{
proot = proot->child[temp - ‘a’];
}
else
{
TrieNode *tempNode = new TrieNode();
tempNode->value = temp;
proot->child[temp - ‘a’] = tempNode;
proot = proot->child[temp - ‘a’];
}
str1++;
}
proot->isWorld = true;//当str1遍历完毕后，把该节点表示单词的变量改为true。这是我们进行单词统计、查找的依据。
}
单词个数统计代码，我们采用递归的办法来进行单词的统计。
int calcCountOfWorld(TrieNode *node)//该函数在统计单词公有前缀的时候还会用到，单独写出来。
{
if (node == nullptr)
{
return 0;
}
int sum = 0;
if (node->isWorld == true)//判读当前节点是否为单词节点。
{
sum = 1;
}
for (int i = 0; i < 26; i++)
{
sum += calcCountOfWorld(node->child[i]);//遍历所有的孩子进行单词统计。
}
return sum;
}
int sumOfWorld()
{
TrieNode *proot = root;
if (proot == nullptr)
{
return 0;
}
int sum = 0;
for (int i = 0; i < 26; i++)
{
sum += calcCountOfWorld(proot->child[i]);
}
return sum;
}
单词查找，下面是单词查找的代码：
bool searchWorld(char *str)
{
TrieNode *proot = root;
if (proot == nullptr)
{
return false;
}
char *str1 = str;
while (*str1 != ‘\0’)
{
char temp = *str1;
if (proot->child[temp - ‘a’] == nullptr)
{
return false;//如果某个单词的字母不存在，直接返回false
}
else
{
proot = proot->child[temp - ‘a’];
}
str1++;
}
return proot->isWorld == true;//即使该字符串的所有单词都存在也要判断该节点是否为单词节点。
}
统计相同的单词前缀的数量，代码如下：
int sumOfPrefix(char *str)
{
TrieNode *proot = root;
if (proot == nullptr)
{
return 0;
}
char *str1 = str;
while (*str1 != ‘\0’)
{
char temp = *str1;
if (proot->child[temp - ‘a’] == nullptr) //为nllptr，说明不存在该单词前缀
{
return 0;
}
else
{
proot = proot->child[temp - ‘a’];
}
str1++;
}
return calcCountOfWorld(proot);//直接利用这个函数calcCountOfWorld计算该节点下的单词数量。
}
主函数以及验证代码
int main()
{
insertWorld(“internet”);
insertWorld(“apple”);
insertWorld(“bee”);
insertWorld(“teacher”);
insertWorld(“student”);
insertWorld(“interesting”);
insertWorld(“interested”);
insertWorld(“interest”);
insertWorld(“monkey”);
insertWorld(“panda”);
insertWorld(“rabbit”);
cout << sumOfWorld() << endl;
cout << searchWorld(“beee”) << endl;
cout << searchWorld(“bee”) << endl;
cout << searchWorld(“interest”) << endl;
cout << sumOfPrefix(“interest”) << endl;
cout << sumOfPrefix(“inter”) << endl;
cout << sumOfPrefix(“bee”) << endl;
system(“pause”);
return 0;
}

验证结果截图：
在这里插入图片描述
结果执行还是很正确的啊
以上是字典树和基本功能的简单实现。实际上字典树的优化点还可以在单词的存储上做文章。我没没必要一下子高26个孩子。我可以按照单词的前缀按需，读单词进行分解，来进行存储。这样的存储空间更少。利用率更高。

weixin_41157881

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。