【运筹学】utility theory

最新推荐文章于 2024-07-25 16:35:35 发布

竹聿far

最新推荐文章于 2024-07-25 16:35:35 发布

阅读量208

点赞数

分类专栏：运筹学·决策方法文章标签：算法学习

本文链接：https://blog.csdn.net/kirinfar/article/details/124808152

版权

运筹学·决策方法专栏收录该内容

4 篇文章 0 订阅

订阅专栏

decision criteria

decision making under uncertainty

action, $a_i \in A$
state, $s_j \in S$
reward, $r_{ij}$

a discrete newsvendor example

$c=20,p=25,S=\{600,700,800,900,1000\}$
$r_{ij}=$ reward of purchasing i with demand j
$r_{ij}= \left\{ \begin{aligned} & (p-c)i, &i\le j \\ & pj-ci, &i > j \end{aligned} \right.$
dominated actions 支配
$a_i\ is\ dominated\ by\ a_i',\\ if\ r_{ij} \le r_{i'j},\ \forall s_j\in S,\ r_{i'j}< r_{i'j'},\ for\ some\ s_{j'}\in S$
对于action i，在所有state里，reward都小于等于action i’ 带来的reward；

在部分state里，reward比action i’ 带来的reward 小

decision criteria

maximin：最大化最少的reward
$a_i=\arg\max_{a_i\in A}\{\min_{s_j\in S}r_{ij} \}$
maximax：最大化最大的reward
$a_i=\arg\max_{a_i\in A}\{\max_{s_j\in S}r_{ij}\}$
expected value：最大化期望

$a_i=\arg\max_{a_i\in A}\{\sum_{s_j\in S}p_jr_{ij}\}$

minimax regret：最小化最大的后悔（最好的action和实际采取的action之间的差值）
$a_i=\arg\min_{a_i\in A}\{\max_{s_j\in S}R_{ij}\} \\ \begin{aligned} &Regret& R_{ij} = & r_{i^*(j),j}-r_{ij}, \\ & where && i^*(j)=\arg\max_{a_i\in A}r_{ij}, &\forall s_j\in S \end{aligned} \\ \\ r_{i^*(j),j}=对于j，最好的action产生的reward \\ r_{ij}=实际采取的action产生的reward$

Utility Theory

Lottery （L）

$p_1,r_1;p_2,r_2;...,p_n,r_n)$
r： reward
p： probability
tree representation
表示方式
- $L_1$ $p$ $L_2$ ： prefers $L_1$
- $L_1iL_2$ : equivalent lotteries, indifferent between $L_1$ and $L_2$
- $L_2pL_1$ : prefers $L_2$

Von Neumann-Morgenstern Utility Theory

Utility of the reward $r_i$ , $u(r_i)$ is the number $q_i$ such that $L i L^{'}$ .

回报 $r_i$ 的效用， $u(r_i)$ ，使得 $L i L^{'}$ 的数 $q_i$
- $L=(1,r_1)$
  
  $L'=(q_1,most\ favorable\ outcome;1-q_1,least\ favorable\ outcome)$
- u(least favorable outcome)=0
  
  u(most favorable outcome)=1
Utility function, $u(r_i),\forall r_i$
expected utility of the lottery L
$E(U\ for\ L)=\sum_{i=1}^n p_iu(r_i)$

Axiom

Complete ordering axiom: define the most/least favorable outcome
$\forall r_1,r_2:r_1 \succ r_2,r_1 \sim r_2,r_1 \prec r_2 \\ r_1 \succ r_2,r_2\succ r_3, then\ r_1\succ r_3(transitivity\ of\ preferences)$
Continuity axiom: evaluate the intermediate outcome values between the most/least favorable outcome
$L_1=(1,r_2),L_2=(c,r_1;1-c,r_3) \\ If\ r_1\succ r_2,r_2\succ r_3 \\ then\ \exist c(0<c<1),such\ that\ L_1iL_2$
Independence axiom: “plug in” the equivalent lottery of the intermediate outcomes
$L_1=(c,r_1;1-c,r_3),L_2=(c,r_2;1-c,r_3) \\ r_1\sim r_2,任意r_3 \\ \forall c(0<c<1), L_1iL_2$
Unequal probability axiom: rank the lotteries
$L_1=(p_1,r_1;1-p_1,r_2), L_2=(p_2,r_1;1-p_2,r_2) \\ r_1\succ r_2,p_1>p_2 \rightarrow L_1pL_2$
Compound lottery axiom: convert a compound lottery into a simple lottery

If we consider all the possible rewards, a compound lottery $L$ has a probability $p_i$ of receiving a reward $r_i$ , then $L i L^{'}$ , where $L'=(p_1,r_1;p_2,r_2;...;p_n,r_n)$
lemma 1：线性

给定 utility function $u (x)$ , 定义 positive linear function $\forall a>0,b$ ,

对于两个lotteries $L_1$ 和 $L_2$
$L_1pL_2\ using\ u(x) \leftrightarrow L_1pL_2\ using\ v(x) \\ L_1iL_2\ using\ u(x) \leftrightarrow L_1iL_2\ using\ v(x)$

definitions

expected value of L’s outcomes, EV(L)
$EV(L)=\sum_{i=1}^np_ir_i$
expected utility of L, E(U for L)
$E(U\ for\ L)=\sum_{i=1}^np_iu(r_i)$
certainty equivalent of L, CE(L)

is the number such that a decision maker is indifferent between L and receiving a certain payoff of CE(L)
$E (U f o r L) = u (C E (L))$
risk premium of L, RP(L)
$R P (L) = E V (L) - C E (L)$

risk attitudes

risk-averse	$R P (L) > 0$	u(x) is strictly concave 严格凹
risk-neutral	$R P (L) = 0$	u(x) is linear
risk-seeking	$R P (L) < 0$	u(x) is strictly convex 严格凸

graphic illustration

$L=(p,x_1;1-p,x_2),x_1<x_2$

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-D9PkJiCE-1652706216498)(C:\Users\FJB\AppData\Roaming\Typora\typora-user-images\1652692715154.png)]

Exercise

value of insurance (Winston 2004, p.p. 85-86)
- cash: $10,000
- Home: $90,000
- accident probability: 0.1%
问题：如果房子被毁，需要支付多少保险？
假设： $u(x)=x^{1/2}$ , x表示整体财富（现金加房子）
solution
$KaTeX parse error: Can't use function '$' in math mode at position 28: \dotsned} L_1 & =(1，$̲100,000-y)，L_2=\dots$

example

1.

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-wVakE5na-1652706216500)(C:\Users\FJB\AppData\Roaming\Typora\typora-user-images\1652604680187.png)]

$5000
$u(x)=\sqrt x/1,000$

investment 1 investment 2
+$295,000, 80%;
+$95,000, 20% +$595,000, 50%
+$5,000, 50%

investment 1	investment 2
+$295,000, 80%; +$95,000, 20%	+$595,000, 50% +$5,000, 50%

| [外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-t3lT6mIi-1652706216501)(C:\Users\FJB\AppData\Roaming\Typora\typora-user-images\1652694074064.png)] | [外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-PQDy2KJS-1652706216501)(C:\Users\FJB\AppData\Roaming\Typora\typora-user-images\1652694089566.png)] |
| $EV(L_1)=80%*$300,000+20% *$100,000= $260,000$ | $EV(L_2)=$305,000$ |
| $E(UforL_1)=80\%*0.55+20\%*0.32=0.504$ | $E(UforL_2)=0.437$ |
| $CE(L_1)=u^{{-1}(E(UforL_1))=(0.504*1,000)}2=$254,016$ | $CE(L_2)=$190,969$ |
| $RP(L_1)=EV(L_1)-CE(L_1)=$5,984>0$ | $RP(L_2)=$114,031$ |
| risk-averse | risk-averse |

We are risk-averse.

$CE(L_1)>CE(L_2)$

We prefer investment 1.

2.

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-RM8aPoCb-1652706216502)(C:\Users\FJB\AppData\Roaming\Typora\typora-user-images\1652604699537.png)]

risk-neutral, so $R P (L) = 0, E V (L) = C E (L), u (x)$ is linear.

$P(the\ first\ head\ is\ obtained\ on\ the\ nth\ toss\ of\ the\ coin)=\frac{1}{2^n}$

$\begin{aligned} p_i & =\frac{1}{2^i},\ r_i=2^i \\ EV(L) & =\sum_{i=1}^np_ir_i=\frac{1}{2}*2+\frac{1}{2^2}*2^2+...+\frac{1}{2^n}*2^n=n \\ CE(L) & =EV(L)=n \\ n & \rightarrow\infty,CE(L)\rightarrow\infty \\ \because & u(x) is\ linear \\ \therefore & u(x)\rightarrow\infty,E(UforL)\rightarrow\infty \end{aligned}$

This is unreasonable.

$u(x)=\log_2 (x)$
$\begin{aligned} u(r_i) & =\log_2(r_i)=\log_2(2^i)=i \\ E(UforL) & =\sum_{i=1}^np_iu(r_i)=\sum_{i=1}^n\frac{1}{2^i}i=2-\frac{1}{2^{n-1}}-\frac{n}{2^n} \\ CE(L) & =u^{-1}(E(UforL))=2^{2-\frac{1}{2^{n-1}}-\frac{n}{2^n}} \end{aligned}$
ble.
$u(x)=\log_2 (x)$
$\begin{aligned} u(r_i) & =\log_2(r_i)=\log_2(2^i)=i \\ E(UforL) & =\sum_{i=1}^np_iu(r_i)=\sum_{i=1}^n\frac{1}{2^i}i=2-\frac{1}{2^{n-1}}-\frac{n}{2^n} \\ CE(L) & =u^{-1}(E(UforL))=2^{2-\frac{1}{2^{n-1}}-\frac{n}{2^n}} \end{aligned}$

竹聿far

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
【运筹学】utility theory

decision criteriadecision making under uncertaintyaction, ai∈Aa_i \in Aai∈Astate, sj∈Ss_j \in Ssj∈Sreward, rijr_{ij}rija discrete newsvendor examplec=20,p=25,S={600,700,800,900,1000}c=20,p=25,S=\{600,700,800,900,1000\}c=20,p=25,S={600,700,800
复制链接

扫一扫

专栏目录