Stochastic dual dynamic integer programming (SDDiP)

Stochastic dual dynamic integer programming (SDDiP)

Introduction

Multistage stochastic programming

  • The resulting formulation, called the extensive form, is
    请添加图片描述
  • An alternative to the extensive form (1.1) is to formulate the MSP problem via the following dynamic programming (DP) recursions
    请添加图片描述

MSIP with binary state variables

  • In this section, we

    • describe the class of MSIP problem we consider in this work
    • propose the key reformulation
  • 一些假设:

  1. 线性(本文结果可以很容易地拓展到非线性情况,但本文聚焦于线性情况)请添加图片描述
  2. 一个重要的要求!!!—— x n x_n xn要是binary的, y n y_n yn无所谓
    请添加图片描述
    之所以有这个要求是因为:
    请添加图片描述
  3. complete continuous recourse (can always be achieved by adding nonnegative auxiliary con- tinuous variables and penalizing them in the objective function)
    请添加图片描述
  • 对本文至关重要的对(1.1)的reformulation——引入 z n z_n zn
    请添加图片描述
    写成DP:
    请添加图片描述
    请添加图片描述

Stochastic nested decomposition (SND) and SDDiP

  • In this section, we present SND and its special case SDDiP for solving (2.1):
    • SDDiP --> when the stochasticity satisfies stage-wise independence

The SND algorithm

  • a sampling step请添加图片描述
  • a forward step
    请添加图片描述
    请添加图片描述
  • a backward step
    The goal of the backward step is to update the approximate expected cost-to-go function ψ n i ( ⋅ ) \psi_n^i(\cdot) ψni() for each sampled node n ∈ ω k i n \in \omega_k^i nωki.请添加图片描述

The SDDiP algorithm

  • for the setting where the scenario tree satisfies stage-wise independence, i.e., for any two nodes n n n and n ′ n' n in S t S_t St the set of children nodes C ( n ) \mathcal{C}(n) C(n) and C ( n ′ ) \mathcal{C}(n') C(n) are defined by identical data and conditional probabilities.
  • Q n ( ⋅ ) ≡ Q t ( ⋅ ) \mathcal{Q}_{n}(\cdot) \equiv \mathcal{Q}_{t}(\cdot) Qn()Qt() for all n ∈ S t n\in S_t nSt
    请添加图片描述
    注:随机性出现在约束的左边或者目标函数中较难处理,会出现bilinear项
    请添加图片描述

Sufficient cut conditions

  • SND和SDDiP算法是valid的只要cut满足下面三个性质:
    请添加图片描述

Convergence

  • 本节证明了SND算法的收敛性

    • with probability one, the approximate cost-to-go functions constructed using valid, tight, and finite cuts define an optimal solution to MSIP with binary state variables in a finite number of iterations
  • 一个technical假设(大多数确定性MIP求解器都满足,如CPLEX和Gurobi)
    请添加图片描述

  • 核心定理:
    请添加图片描述

Cut families

Benders’ cut

不tight

Integer optimality cut

三个性质都满足,但只在指定的解上是紧的,在其他解上非常松 ⇒ \Rightarrow 收敛很慢

Lagrangian cut

  • obtained by solving Lagrangian dual of the reformulated nodal problems
    请添加图片描述
    请添加图片描述
  • 证明Lagrangian cuts满足三条性质:valid, tight and finite
    请添加图片描述
Matlab优化算法是一种基于数学模型的计算方法,在许多领域中都有着广泛的应用。根据实际应用需求,我们可以选择适合的优化算法。下面罗列100个常见的Matlab优化算法: 1. 遗传算法(Genetic Algorithm) 2. 粒子群算法(Particle Swarm Optimization) 3. 差分进化算法(Differential Evolution) 4. 蚁群算法(Ant Colony Optimization) 5. 模拟退火算法(Simulated Annealing) 6. 人工鱼群算法(Artificial Fish Swarm Algorithm) 7. 历史遗传算法(Historical Genetic Algorithm) 8. 协方差矩阵适应进化策略(Convex Matrix Evolution Strategy) 9. 盲化梯度下降(Blind Gradient Descent) 10. 坐标下降法(Coordinate Descent) 11. 简单x方法(Simplex Method) 12. 对偶内点法(Dual Interior Point Method) 13. 增广拉格朗日法(Augmented Lagrangian Method) 14. 卡尔曼滤波(Kalman Filter) 15. 扩展卡尔曼滤波(Extended Kalman Filter) 16. 动态规划(Dynamic Programming) 17. 灰关联分析(Grey Relational Analysis) 18. 纯虚拟炼金术模拟算法(Purely Virtual Alchemy Simulation Algorithm) 19. 模糊控制算法(Fuzzy Control Algorithm) 20. 归纳逻辑程序设计算法(Inductive Logic Programming Algorithm) 21. Linear Programming 22. Nonlinear Programming 23. Quadratic Programming 24. Integer Programming 25. Semi-definite Programming 26. Combinatorial Optimization 27. Stochastic Programming 28. Convex Optimization 29. Non-negative Matrix Factorization 30. Support Vector Machine 31. Logistic Regression 32. Linear Discriminant Analysis 33. Naive Bayes Classifier 34. Principal Component Analysis 35. Independent Component Analysis 36. Karhunen-Loeve Transform 37. Wavelet Transform 38. Discrete Wavelet Transform 39. Fast Fourier Transform 40. Nonlinear Least Squares 41. Maximum Likelihood Estimation 42. Conditional Maximum Likelihood Estimation 43. Maximum A Posteriori Estimation 44. Sequential Monte Carlo 45. Markov Chain Monte Carlo 46. Gibbs Sampling 47. Metropolis-Hastings Algorithm 48. Hamiltonian Monte Carlo 49. Variational Bayes 50. Expectation-Maximization Algorithm 51. Structured Variational Bayes 52. Belief Propagation 53. Compressed Sensing 54. Sparse Representation 55. Non-negative Sparse Representation 56. Robust Principal Component Analysis 57. Low Rank Matrix Completion 58. Nonlinear Regression 59. Kernel Regression 60. Gaussian Process Regression 61. Kriging 62. Smoothing Splines 63. Nonparametric Regression 64. Discriminant Analysis 65. Nonparametric Bayes 66. Boosting 67. Random Forest 68. Deep Learning 69. Convolutional Neural Network 70. Recurrent Neural Network 71. Long Short-Term Memory 72. Autoencoder 73. Variational Autoencoder 74. Generative Adversarial Network 75. Reinforcement Learning 76. Q-Learning 77. Policy Gradient 78. Actor-Critic 79. Monte Carlo Tree Search 80. Exploration-Exploitation Dilemma 81. Batch Reinforcement Learning 82. Decision Trees 83. Naive Decision Trees 84. Randomized Decision Trees 85. Regression Trees 86. Tree Boosting 87. Gradient Boosting 88. Adaboost 89. Rank Boosting 90. Decision Forest 91. Learning to Rank 92. Unsupervised Learning 93. Clustering 94. K-Means 95. Gaussian Mixture Model 96. Hierarchical Clustering 97. Self-Organizing Maps 98. Non-negative Matrix Factorization Clustering 99. Subspace Clustering 100. Latent Dirichlet Allocation.
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值