机器学习-数学基础2：字母表，二叉树

最新推荐文章于 2023-06-30 14:19:52 发布

梁东东的成长日记

最新推荐文章于 2023-06-30 14:19:52 发布

阅读量226

点赞数

分类专栏：魔鬼训练营文章标签： matlab 算法机器学习

本文链接：https://blog.csdn.net/qq_42281173/article/details/116649775

版权

魔鬼训练营专栏收录该内容

7 篇文章 0 订阅

订阅专栏

本贴是对闵老师的博客的理解

一.字母表

1.1字母表

常见的字母表包括:
$\Sigma= \{0, 1\}$
$\Sigma = \{\mathrm{a}, \dots, \mathrm{z}\}$

我们需要的字母表：
$\Sigma = \{\mathrm{l},\mathrm{r}\}$

1.2正闭包

The positive closure of alphabet is given by $\Sigma^+ = \Sigma^1 \cup \Sigma^2 \cup ...$

$\Sigma^1=\{\mathrm{l},\mathrm{r}\}$
$\Sigma^2=\{\mathrm{ll},\mathrm{lr},\mathrm{rr},\mathrm{rl}\}$
$\dots$
$\Sigma^+ = \{\mathrm{l}, \mathrm{r}, \mathrm{ll}, \mathrm{lr}, \mathrm{rl}, \mathrm{rr}, \mathrm{lll}, \dots \}$

1.3克林闭包

空串 $\varepsilon$ (varepsilon)
The Cling closure of alphabet $\Sigma^* = \Sigma^0 \cup \Sigma^+ = \{\varepsilon\} \cup \Sigma^+$
字母表克林闭包的元素, 就称为字符串

1.4跳转函数

为了描述在某个状态接受字符串的跳转, 可定义跳转函数为:
Let $\bm{S}$ denote the set of states. The positive state transition function is given by $\bm{S} \times \Sigma^+ \to \bm{S}$ , where $\forall s \in \bm{S}$ and $a_1 a_2 \dots a_k \in \Sigma^+$ ,
$a_1 a_2 \dots a_k) = f(f(s, a_1), a_2 a_3 \dots a_k)$

二.二叉树

2.1初始版本

Let $\Sigma = \{\mathrm{l}, \mathrm{r}\}$ be the alphbet and $\phi$ be a null node. A binary tree is a triple $(\bm{V}, r, c)$ , where $\bm{V} = \{v_1, \dots, v_n\}$ is the set of nodes, $\in \bm{V}$ is the root, and $\bm{V} \cup \{\phi\} \times \Sigma^+ \to \bm{V} \cup \{\phi\}$ satisfying
a) $c(\phi, \mathrm{l}) = c(\phi, \mathrm{r}) = \phi$ ;
b) $\forall v \in \bm{V} \setminus \{r\}$ , $\exists1$ $\in \Sigma^+$ st. $(\mathrm{r},\mathrm{s})=v$ ;
c) $\forall v \in \bm{V}$ , $\in \Sigma$ , $c(\mathrm{v},\mathrm{a}) \neq r$

这是一个三元组。
b）代表从根节点到任意节点的路径仅存在唯一的路径
c) 代表根节点没有父节点

注：由b) 和 c)我们推出了一个性质 $\bm V$ 中节点没有环
在这里插入图片描述

2.2打磨版本

2.2.1定义

Let $\Sigma = \{\mathrm{l}, \mathrm{r}\}$ be the alphbet and $\phi$ be a null node. A binary tree is a triple $(\bm{V}, r, c)$ , where $\bm{V} = \{v_1, \dots, v_n\}$ is the set of nodes, $\in \bm{V}$ is the root, and $\bm{V} \cup \{\phi\} \times \Sigma^* \to \bm{V} \cup \{\phi\}$ satisfying
$\forall v \in \bm{V}$ , $\exists1$ $\in \Sigma^*$ st. $(\mathrm{r},\mathrm{s})=v$ ;

a)和c)都是冗余的，但是只保留b),会出现一种特殊情况
在这里插入图片描述
所以我们引入了空串， $\Sigma^* = \Sigma^0 \cup \Sigma^+ = \{\varepsilon\} \cup \Sigma^+$
$\varepsilon) = r$ . 即从 $r$ 读入空串到自己

2.2.2性质

①：二叉树的任何节点 (空节点除外) 不会有到自己的环.

Property 1. $\forall v \in \bm{V}$ , $\exists s \in \Sigma^+$ ,st. $c (v, s) = v$ .
Proof. Suppose that $\exists v_i \in \bm{V}$ and $\in \Sigma^+$ ,st. $c(v_i, s') = v_i$
According to Definition 17, $\exists s_1 \in \Sigma^*$ st. $c(r, s_1) = v_i$ .
Consequentyly $c ( r , s_1 s ′ ) = c ( c ( r , s_1 ) , s ′ ) = c ( v i , s ′ ) = v i$ , and $s$ takes at least two values $(s 1$ and $s 1 s')$ , making it not unique.
This contradition shows that the assumption does not hold.
The proof is finished.
在这里插入图片描述

②：空节点的左右孩子都是自己

Property 1. $c(\phi, \mathrm{l}) = c(\phi, \mathrm{r}) = \phi$ .
Proof. $a_1 a_2 \dots a_{n+1} \in \Sigma^*$ , we consider $c (r, s)$ . Let the path corresponding to the calculation of $c (r, s)$ be $v_0' v_1' \dots v_{n+1}'$ where $v_0' = r$ . Since $|\bm{V} \cup \{\phi\}| = n + 1$ , according to the Pigeon Cage Principle (鸽笼原理), there must $\exists0 \leq i < j \leq n + 1$ st. $v_i' = v_j'$ . In other words, $v_i' \dots v_j'$ is a loop.
Now assume that $\exists i < k < j$ st. $v_k' \in \bm{V}$ . We have $a_1 a_2 \dots a_k) = v_k'$ , and $a_1 a_2 \dots a_j a_{i+1} a_{i + 2} \dots a_k) = v_k'$ , making the path from $r$ to $v_k'$ not unique.
Hence the assumption does not hold, and $v_i' = v_{i + 1}' = \dots = v_j' = \phi$ .
In other words, any character takes $\phi$ to itself.
This completes the proof.
说明:
a) 鸽笼原理: $n + 1$ 只鸽子飞进 $n$ 个鸽笼, 至少有两个鸽子在同一个笼子里. 这是组合数学中重要的定理.
b) 这里用到了有穷状态自动机 (Finite state automata) 的知识. 从任一节点 (状态), 读入一个字符, 到达下一个节点. 这里的 $\phi$ 被称为陷井状态.

1)开始在一个节点，读入n+1个字符，就会经过n+2个节点。有效集合加上{ $\phi$ }一共n+1个节点。由鸽笼原理。得知肯定由两个节点相同。两个节点相同由性质1可知二叉树的任何节点 (空节点除外) 不会有到自己的环，所以这两个相同的点肯定是 $\phi$ .
2)接着我们探讨 $v_i,v_j$ 是否存在有效节点。我们假设在i，j中存在有效节点k。由于 $v_k'$ 属于有效节点，再由b)可知 $a_1 a_2 \dots a_k) = v_k'$ 。 $a_1 a_2 \dots a_j a_{j+1} a_{j + 2} \dots a_k) = v_k'$ = $a_1 a_2 \dots a_i a_{i+1} a_{i + 2} \dots a_k) = v_k'$ ，这就导致路径不唯一与定义相矛盾。为了防止不冲突，我们只有当读取到 $a_i$ 或者 $a_j$ 时是 $\phi$ ，后面 $a_{i+1}\dots$ 或者 $a_{j+1}\dots$ 都是 $\phi$ 节点是才可以。即 $v_i' = v_{i + 1}' = \dots = v_j' = \phi$ .证明了空节点自成一个环路

梁东东的成长日记

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
机器学习-数学基础2：字母表，二叉树

本贴是对闵老师的博客的理解一.字母表1.1字母表常见的字母表包括:Σ={0,1}\Sigma= \{0, 1\}Σ={0,1}Σ={a,…,z}\Sigma = \{\mathrm{a}, \dots, \mathrm{z}\}Σ={a,…,z}我们需要的字母表：Σ={l,r}\Sigma = \{\mathrm{l},\mathrm{r}\}Σ={l,r}1.2正闭包The positive closure of alphabet is given by Σ+=Σ1∪Σ2∪.
复制链接

扫一扫

专栏目录