Stochastic Process: the News Vendor Problem

最新推荐文章于 2021-07-03 14:15:52 发布

ZJ_Frank

最新推荐文章于 2021-07-03 14:15:52 发布

阅读量308

点赞数

分类专栏：随机过程

本文链接：https://blog.csdn.net/ZJ_11701/article/details/107765294

版权

随机过程专栏收录该内容

1 篇文章 0 订阅

订阅专栏

Introduction

Many scenarios can be described as a stochastic process. Such as Go, DiDi, Inventory, Patient Wards, and Portfolio of Stocks. In this semester, we are going to talk about Markov Chains.

To begin with, let’s look at a simple yet interesting problem: News Vendor Problem. Its setting is very natural, suppose you are a vendor selling New York paper, you could easily observe that for each day the demand is not the same. For example, if recently there will be an election, perhaps more people would buy papers. As a vendor, you want to figure out the best quantity to order, such that you could earn the largest amount of money (in expectation). You may cleverly record each day’s demand and observe a demand distribution. Congrats! If you make use of the data you have, you can get more money!

Setting

To be more rigorous, the setting is usually as follows:

the product is perishable (say, after today it’s valueless)

You can sell it with price $c_p$ , you buy in with $c_v$ . If you are left with unselled items, you may get $c_s$ per item.

Each day’s demand follows a $\, D$

Clearly, to make sense, $c_p > c_v > c_s$ , otherwise, the vendor has no motive to do the business. Also, in some casesm $c_s$ may be negative, if we take environmental issues into consideration.

[There could also be fixed cost $c_f$ to order. Or some “holding cost” $h$ . But they shall not affect the optimal quantity. ]

Strategies

To be a clever vendor, there are clearly many strategies you could follow, to make the most fortune. Here, we suppose you are lazy and want to find out the best fixed quantity to order every day.

Say, you are ordering $q$ product. Then, if today’s demand is $D$ , then you will make $c_p\min (D, q) - c_v q + c_s \max(0, D-q)$ . That is, you could sell at most the minimum of demand and the number you ordered (which cost you $c_v q$ ). If there is some remaining terms, you can sell them to get $c_s \max(0, D-q)$ .

That is,
$c_p \min(q,D) - c_v q + c_s \max(0, q-D) \\ = c_p (q\wedge D) - c_v q + c_p (q-D)^+$
Since the demand is really a random thing, we care about the expected profit, that is,
$\max _q \, h(q) = \mathbb{E} [profit(q, D)] \\ = c_p \mathbb{E}[q\wedge D] - c_v q + c_s \mathbb{E}[(q-D)^+]$
Oops, here is a small trick, $(q-D)^+ + (q\wedge D) \equiv q$ . To solve this optimization problem, it’s natural to think about “taking derivative”, right? OK, let’s start with assuming the distribution $D$ is continuous. We shall get some insights from there, then we do the discrete case.

Continuous Case

$\mathbb{E} [profit(y, D)] = c_p \mathbb{E}[y \wedge D] - c_v y + c_s \mathbb{E}[(y-D)^+] \\ = c_p \mathbb{E}[y \wedge D] - c_v y + c_s \mathbb{E}[y - (y\wedge D)] \\ = (c_p - c_s) \mathbb{E}[y \wedge D] + (c_s - c_v) y\\ = (c_p-c_s) (\int_0^y x f(x)dx + \int_y^\infty y f(x)dx) - c_v y + c_s y \\ = (c_p-c_s) (\int_0^y x f(x)dx + y (1-F(y) )) - c_v y + c_s y \\ = (c_p - c_s)[\int_0^y xf(x)dx - yF(y)] + (c_p - c_v) y$

To find the maximum, we take derivative wrt $y$ , $h'(y) = (c_p - c_s)[ yf(y) - F(y)-y f(y) ] + (c_p - c_v) \\ = (c_p - c_v) - (c_p - c_s) F(y)$ , let it equal 0, we get $\frac{c_p - c_v} {c_p - c_s}$ , that is, the maximum of $h$ shall be obtained when $y = F^{-1} [(c_p-c_v) / (c_p - c_s)]$ . And this would be the optimal quantity to order. This number has certain interpretation in terms of economy: $c_p - c_v$ is the amount you will gain by ordering one more if the demand is less than you ordered, $c_v - c_s$ is the amount you will gain by ordering one less if the demand is more than you ordered. The ratio $c_p - c_v) / (c_p - c_s) = (c_p - c_v) / (c_p - c_v + c_v- c_s)$ measures certain tradeoff.

But wait! How can we order a “fraction” number of papers? Like 10.3 papers? That’s certainly absurd, and the demand is also discrete. We will see in a moment that the conclusion for the discrete case is quite similar, with optimal ordering quantity $y *$ is the least number such that $\ge \frac{c_p - c_v} {c_p - c_s}$

Discrete Case

All right, let’s turn our attention to the case when demand is a discrete distribution, which is more realistic. We want to pick y to maximize $h (y)$ , so we must have $ h(y+1) \le h(y), h(y)\ge h(y-1) $. By considering the increment $\mathbb{E}[ profit(y+1, D) - profit(y, D) ] = \mathbb{E} [ revenue(y+1, D)- revenue(y, D) ] - c_v$ .

If $\ge y+1 , revenue(y+1, D)- revenue(y, D) = c_p$

If $\le y, revenue(y+1, D)- revenue(y, D) = c_s$

Thus, $c_p P(D\ge y+1) + c_s P(D\le y) - c_v = c_p [1 - P(D\le y)] + c_s P(D\le y) - c_v = (c_p - c_v) - (c_p - c_s) P(D\le y)$

Suppose $y *$ is the optimal quantity, we must have $\le 0, h(y*) - h(y*-1) \ge 0$

They contribute to $\ge (c_p - c_v) / (c_p - c_s)$ and $\le (c_p - c_v) / (c_p - c_s)$ , that is, $F^{-1}[ (c_p - c_v) / (c_p - c_s) ] \le y \le 1 + F^{-1}[ (c_p - c_v) / (c_p - c_s) ]$

Therefore, we should pick y such that $y $ equals the ceiling of $F^{-1}[ (c_p - c_v) / (c_p - c_s) ]$ , i.e. y is the least number such that $\ge (c_p - c_v) / (c_p - c_s)$ . In particular case, if $F(y*) = (c_p - c_v) / (c_p - c_s)$ , then both $y *, y * + 1$ are optimal solutions.

A Bit Extension

Number to Report

In this section, let’s extend the news vendor problem a bit. Say you are an amazing news vendor, and you have to report your expecte profit in the future 3 months, what number should you report?

It’s tempting to just say $90 h (q *)$ , where $q *$ is the optimal quantity. However, your may want to report a number such that 90% chance the future profit is higher than it. This is where confidence interval steps in. Let $90Y\ge x ] = 1-0.05$ , where $Y$ follows $\sigma ^2$ , this will give $x\le 90 q* - z_{0.05}\sigma \sqrt{90}$ .

In reality, to report this number is complex. It may depend on the risk strategy of your company.

Perishable Items with 2 life period

In the following blog, we shall talk about how to make optimal ordering if we are selling not that easy to perish product, for example bananas. And it’s in here we shall introduce the very basics of stochastic process: DTMC (discrete time markov chain).