次/超模 (supermodular/submodular)

最新推荐文章于 2023-12-15 20:28:32 发布

琉璃树下

最新推荐文章于 2023-12-15 20:28:32 发布

阅读量1.7k

点赞数 2

文章标签：学习

本文链接：https://blog.csdn.net/weixin_44372736/article/details/128366927

版权

本文介绍了次模函数和超模函数的概念，包括它们的数学定义、性质以及在子模集函数最优化中的应用。次模函数在组件上具有非增加性质，而超模函数则具有非减少性质。 Lovász Extension 用于求解子模集函数的最小值，而多线性松弛用于最大化问题。文章还提到了在多变量正态分布中的超级模随机顺序。这些概念在资源分配、广告投放优化等实际问题中有广泛应用。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

文章目录

次/超模 (supermodular/submodular)

次/超模 (supermodular/submodular)

定义

submodular function

Definition1: In mathemaics, a function $f:R^k \rightarrow R$ is submodular if

$f(x\uparrow y)+f(x\downarrow y)\leq f(x)+f(y)$

for all $x,y\in R^k$ , where $x\uparrow y$ denotes the componentwise maximum and $x\downarrow y$ the componentwise minimun of $x$ and $y$ .

Definition2: If $f$ is twice continuously differentiable, then submodularity is equibalent to the condition

$\frac{\partial^2 f}{\partial z_i \partial z_j}\leq 0 \ \ \text{for all} \ \ i\not=j$

submodular set function

Definition 1: A function $2^N\rightarrow R$ is submodular if for any $S,T\subseteq N$ ,

$f(S\cup T)+f(S\cap T)\leq f(S)+f(T)$

Definition 2: Submodularity can be alternatively defined by

$f(S\cup \{j\})-f(S)\geq f(T\cup \{j\})-f(T)$

for all $S\subseteq T, j\notin T$

性质

If $f (x)$ is submodular, then $- f (x)$ is supermodular
a sum of submodular functions is a submodular function
复合函数，Topik (1978)

$[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-WMH5UmqX-1671541140531)(file://E:\博士\新知识\2022-11-08-15-07-12-image.png?msec=1671535175375)]e.g., 第四行表示， is concave increasing function, is submodular, then is submodular.$

e.g., 第四行表示， $f$ is concave increasing function, $g$ is submodular, then $f\circ g =f(g(\cdot))$ is submodular.

Lovász Extension and Minimization of Submodular set Function

可以用于找到 submodular set function 的最小值

Definition: Givena set function $2^N\rightarrow R$ , the Lovász Extension $f^L:[0,1]^N\rightarrow R$ is defined as $f^L(x)=\sum_{j=1}^m\lambda_jf(S_j)$ , where ${S_j\}$ is the unique decreasing series of sets $N=S_1\supset S_2 ... \supset S_m= \emptyset$ such that $x=\sum_j \lambda_j \mathcal{1}_{S_j}$ for $\sum_j \lambda_j=1, \lambda_j\geq 0$

submodular set function 的 Lovász Extension 总是凸的。同样的，如果某个 set function 的 Lovász Extension 是凸的，那么这个 set function 一定是 submodular。

[minimization of submodular set function]

If $f:2^N\rightarrow R$ is a submodular set functin, then the minimizer of its Lovász Extension is dimain $0,1]^N$ can be ontained at vertex points: $\min_{x\in[0,1]^N} f^L(x)=\min_{S\subseteq N}f(S)$

Multilinear relaxation and Maximization of Submodular set Function

在很多情况下我们也想要最大化一个 submodular set function。例如，一个公司在有限预算下投放广告，人与人之间存在行为影响。公司需要决策将广告投放给哪些人能够最大化效用。将会互相影响的 consumers 团体作为一个 set，k 可以看作是公司在有限预算下能够投放的广告数。那么投放给哪些 consumers 能最大化效用，实际上就是找到 $k$ 个 sets 能覆盖最多的 consumers。这个简化的模型称为 Max-k-Cover problem:

Given a set of sets $\{S_j\subseteq N|j\in A\}$ , find $k$ sets which covers the most number of elements.

也可以给每个 consumer 的价值赋值。The Maximum Coverage Problem:

Given a set of $S_1,S_2,...,S_m\subseteq N$ . For each element $i\in N$ , it has a value $\nu_i\geq 0$ , and for each set $S\subseteq N$ the value function is defined as $V(S)=\sum_{i\in S}\nu_i$ . We need to select $k$ sets $\{S_j|j\in A\}$ , and to maximize the value $V(\cup_{j\in A}S_j)$ .

这个问题的目标函数有次模性。（证明不是很难）由此衍生出来的还有 Assortment Optimization 问题

假设有 $N$ 个互相替代商品，要选择其中的一些商品做广告。广告数量（or 广告位）不多于 $K$ 个。一些论文里会假设投放广告的利润 $V (S)$ 是 sumodular，那么问题可以表示为 $\max\ \{V(S):|S|\leq K,S\subseteq N\}$

以上有数量约束 (cardinality constrained) 的最大化submodular set function 的问题都是 NP hard。

对于最大化问题，可以引入 Multilinear relaxation

Definition: Given set function $2^N \rightarrow R$ , we define its multilinear relaxiation by rounding a countinous point $x\in[0,1]^N$ to ${0,1\}^N$ : $F(x)=E[f(\xi(x))]$ , where $\xi(x)\in R^N$ takes value $\xi(x)_i = 1$ with probability $x_i$ , and $\xi(x)_i=0$ with probability $1-x_i$ independently

可以由 $\max F(x)$ 得出近似结果。（有一些有用的算法，有空的话可以补）

Supermodular stochastic order (Muller and Scarsini 2000)

Definition: A random vector $D_1$ is said to be smaller than the random vector $D_2$ in the supermodular order, written $D_1\leq_{sm}D_2$ , if $E[f(D_1)]\leq E[f(D_2)]$ for all supermodular functon $f$ such that the expectation exists.

根据定义，Muller and Scarsini 得到结论：

Let $D_1$ and $D_2$ be multivariate normal random vectors with parameters $D_1\sim N(\mu,\Sigma_1)$ and $D_2 \sim N(\mu,\Sigma_2)$ , where $\Sigma_1,\Sigma_2$ are covariance matrices such that $\sigma_{ii}^1=\sigma_{ii}^2, \sigma_{ij}^1\leq\sigma_{ij}^2$ . Then $D_1\leq_{sm}D_2$ .