文献阅读 - Sum-Product Networks: A New Deep Architecture

本文链接：https://blog.csdn.net/zhaoyin214/article/details/103659128

和积网络（Sum-Product Networks, SPNs）是一种新型的深度架构，旨在解决图模型推理和学习中的配分函数计算复杂度问题。SPNs是具有变量叶节点、和与积操作内部节点的有向无环图。如果SPN完整且一致，它表示图模型的配分函数。文章介绍了基于反向传播和EM的SPN学习算法，表明SPN在效率和准确性上优于传统深度网络。" 128263394,9981807,2020-2022年北京交通大学软件工程考研数据解析,"['软件工程', '考研', '数据分析', '计算机考研', '北交计算机考研']

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Sum-Product Networks: A New Deep Architecture

H. Poon, P. Domingos, Sum-Product Networks: A New Deep Architecture, ICCV (2011), Best Paper

摘要

图模型（graphical model）推理（inference）和学习（learning）的主要制约因素（key limiting factor）为配分函数（partition function）的复杂度。

本文提出一种和积网络（SPN）：以变量为叶节点，中间节点为和、积运算，且对边加权的有向无环图（SPNs are directed acyclic graphs with variables as leaves, sums and products as internal nodes, and weighted edges）。

若SPN完备（complete）且一致（consistent），则该SPN表示图模型的配分函数及所有边缘、SPN的节点表示语义（the partition function and all marginals of some graphical model, and give semantics to its nodes）。

本文提出一种基于反向传播（backpropagation）和EM的SPN学习算法（learning algorithms）

SPN的学习和推理速度、准确性均优于传统深度网络。

1 引言

图模型（graphical models）将分布表示为因子的归一化乘积（graphical models represent distributions compactly normalized products of factors）： $\frac{1}{Z} \prod_{k} \phi_{k} (x_{\{k\}})$ ，其中，

$\in \mathcal{X}$ 为 $d$ 维向量
势（potential） $\phi_{k}$ 为变量子集（作用域） $x_{\{k\}}$ 的函数（each potential $\phi_{k}$ is a function of a subset $x_{\{k\}}$ of the variables (its scope)）
$\sum_{x \in \mathcal{X}} \prod_{k} \phi_{k} (x_{\{k\}})$ 表示配分函数（partition function）。

图模型的缺点：

一些分布无法表示成上述形式；
最坏情况下（in the worst case），推理（inference）的时间复杂度呈指数（exponential）增长；
最坏情况下，学习所需样本数量（sample size required for accurate learning）随变量数量（scope size）呈指数增长；
由于学习过程涉及推理，即使固定变量，其时间复杂度依然为指数（because learning requires inference as a subroutine, it can take exponential time even with fixed scopes）。

通过假设隐含变量（hidden variables） $y$ ，可显著提高图模型的紧凑性（compactness）： $\frac{1}{Z} \sum_{y} \prod_{k} \phi_{k} ( (x, y)_{k} )$

多层隐藏变量的模型能够在类别数量众多的分布上高效推理（models with multiple layers of hidden variables allow for efficient inference in a much larger class of distributions）。

若能通过分配律将 $\sum_{x \in \mathcal{X}} \prod_{k} \phi_{k} (x_{\{k\}})$ 改写为多项式数量的和、积项（if $\sum_{x \in \mathcal{X}} \prod_{k} \phi_{k} (x_{\{k\}})$ can be reorganized using the distributive law into a computation involving only a polynomial number of sums and products），则配分函数 $Z$ 可高效计算。

本文提出和积网络（sum-product networks，SPNs）。SPN可视为混合模型的广义有向无环图（generalized directed acyclic graphs of mixture models），其和节点对应变量子集的混合（sum nodes corresponding to mixtures over subsets of variables）、积节点对应混合的特征（product nodes corresponding to features or mixture components）。SPN可采用反向传播或EM学习（efficient learning by backpropagation or EM）。