论文速递 | Operations Research 1月文章合集

最新推荐文章于 2024-08-01 04:02:43 发布

运筹OR帷幄

最新推荐文章于 2024-08-01 04:02:43 发布

阅读量845

点赞数 17

文章标签：算法

本文链接：https://blog.csdn.net/weixin_53463894/article/details/136196993

版权

在这里插入图片描述

编者按

在本系列文章中，我们梳理了运筹学顶刊Operations Research在2024年1月份发布的7篇文章的基本信息，旨在帮助读者快速洞察领域新动态。

推荐文章7

● 题目：A Pareto Dominance Principle for Data-Driven Optimization
数据驱动优化的帕累托最优原则

● 原文链接：https://doi.org/10.1287/opre.2021.0609

● 作者：Tobias Sutter, Bart P. G. Van Parys, Daniel Kuhn

● 发布时间：2024/01/19

● 摘要：

我们提出了一种统计上最优的方法来为随机优化问题构建数据驱动的决策。从根本上讲，数据驱动的决策只是一个将可用训练数据映射到一个可行行动的函数。它总是可以被表达为从数据构建的代理优化模型的最小化器。数据驱动决策的质量通过其样本外风险来衡量。另一个质量衡量是其样本外失望，我们将其定义为样本外风险超过代理优化模型的最优值的概率。数据驱动优化的关键是数据生成的概率测度是未知的。因此，理想的数据驱动决策应当同时针对每一个可想象的概率测度（因此特别是针对未知的真实测度）最小化样本外风险。不幸的是，这样理想的数据驱动决策通常是不可获得的。这促使我们寻求在做出数据驱动决策，实现样本内风险最小化的同时，针对每一个可想象的概率测度约束样本外失望的上界。我们证明在允许有趣应用的条件下，存在帕累托最优的数据驱动决策。该条件为：未知的数据生成概率测度必须属于一个参数模糊集，并且相应的参数必须生成一个满足大偏差原理的充分统计量。如果这些条件成立，我们进一步证明生成最优数据驱动决策的代理优化模型必须是从充分统计量和其大偏差原理的率函数构建的分布鲁棒优化问题。这表明，从严格统计意义上将数据映射到决策的最优方法是解决一个分布鲁棒优化模型。或许令人惊讶的是，这个结果无论原始随机优化问题是否凸，甚至当训练数据不是独立同分布时，都是成立的。作为一个副产品，我们的分析揭示了数据生成随机过程的结构属性如何影响最优分布鲁棒优化模型底层模糊集的形状。

We propose a statistically optimal approach to construct data-driven decisions for stochastic optimization problems. Fundamentally, a data-driven decision is simply a function that maps the available training data to a feasible action. It can always be expressed as the minimizer of a surrogate optimization model constructed from the data. The quality of a data-driven decision is measured by its out-of-sample risk. An additional quality measure is its out-of-sample disappointment, which we define as the probability that the out-of-sample risk exceeds the optimal value of the surrogate optimization model. The crux of data-driven optimization is that the data-generating probability measure is unknown. An ideal data-driven decision should therefore minimize the out-of-sample risk simultaneously with respect to every conceivable probability measure (and thus in particular with respect to the unknown true measure). Unfortunately, such ideal data-driven decisions are generally unavailable. This prompts us to seek data-driven decisions that minimize the in-sample risk subject to an upper bound on the out-of-sample disappointment—again simultaneously with respect to every conceivable probability measure. We prove that such Pareto dominant data-driven decisions exist under conditions that allow for interesting applications: The unknown data-generating probability measure must belong to a parametric ambiguity set, and the corresponding parameters must admit a sufficient statistic that satisfies a large deviation principle. If these conditions hold, we can further prove that the surrogate optimization model generating the optimal data-driven decision must be a distributionally robust optimization problem constructed from the sufficient statistic and the rate function of its large deviation principle. This shows that the optimal method for mapping data to decisions is, in a rigorous statistical sense, to solve a distributionally robust optimization model. Maybe surprisingly, this result holds irrespective of whether the original stochastic optimization problem is convex or not and holds even when the training data are not independent and identically distributed. As a byproduct, our analysis reveals how the structural properties of the data-generating stochastic process impact the shape of the ambiguity set underlying the optimal distributionally robust optimization model.

运筹OR帷幄

关注

17
点赞
踩
17

收藏

觉得还不错? 一键收藏
0
评论
论文速递 | Operations Research 1月文章合集

Dantzig-Wolfe (DW) 分解是混合整数规划（MIP）中一种著名的技术，用于分解和凸化约束以获得潜在的强对偶界。我们研究了使用 DW 分解算法可以导出的切割平面，并显示这些切割可以提供与 DW 分解相同的对偶界。更具体地说，我们为每个 DW 块生成一个切割，当与原始公式中的约束结合时，这些切割暗示了可以简单编写使用 DW 界限的目标函数切割。这种方法通常会导致具有较低对偶退化的公式，因此在使用标准 MIP 解算器在原始空间解决时具有更好的计算性能。我们还讨论如何加强这些切割以进一步提高计算性能。
复制链接

扫一扫