Chapter 1 (Sample Space and Probability): Probabilistic models (概率模型)_let the sample space = [0; 1] and the probability -CSDN博客

本文链接：https://blog.csdn.net/weixin_42437114/article/details/109079766

本文详细介绍了概率模型的基础知识，包括样本空间、事件、序贯模型和概率律。讨论了概率模型中的关键概念，如样本空间、事件的和、积与差事件，并通过德摩根律进行证明。此外，还探讨了序贯模型，例如通过树形图来描述多次抛硬币实验。文章进一步解释了概率定律的性质，如减法和加法公式，并给出了若干问题以加深理解。最后，文章涉及离散模型与连续模型，特别是连续模型中的几何概型和长度作为概率分配的例子。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

本文为 $I n t r o d u c t i o n$ $t o$ $P r o b a b i l i t y$ 的读书笔记

Probability Models 概率模型

A probabilistic model is a mathematical description of an uncertain situation. Its two main ingredients are listed below and are visualized in Fig. 1.2.

Sample space: 样本空间

在这里插入图片描述

Sample Spaces and Events 样本空间和事件

Every probabilistic model involves an underlying process, called the experiment, that will produce exactly one out of several possible outcomes.
- It is important to note that in our formulation of a probabilistic model, there is only one experiment. So, three tosses of a coin constitute (组成) a single experiment rather than three experiments.
The set of all possible outcomes is called the sample space of the experiment, and is denoted by $\Omega$ .
A subset of the sample space, that is, a collection of possible outcomes, is called an event (事件 / 随机事件).
- 设 $A$ 和 $B$ 为两个随机事件，事件 $A\cup B$ 称为 $A$ 和 $B$ 的和事件，也记为 $A + B$
- 事件 $A\cap B$ 称为 $A$ 和 $B$ 的积事件，也记为 $A B$
- 事件 $A - B$ 称为 $A$ 和 $B$ 的差事件；注意到 $A-B=A\bar B$
- 事件 $\Omega-A$ 称为 $A$ 的补事件，也记为 $\bar A$ 或 $A^C$

补充：De Morgan’s laws (德摩根律)
$(\mathop{\cup}\limits_{n}S_n)^c=\mathop{\cap}\limits_{n}S_n^c,(\mathop{\cap}\limits_{n}S_n)^c=\mathop{\cup}\limits_{n}S_n^c$ PROOF
[Hint: if $x\in(\mathop{\cup}\limits_{n}S_n)^c$ , then $x\in\mathop{\cap}\limits_{n}S_n^c$ ]

Sequential Models 序贯模型

Many experiments have an inherently sequential character; for example, tossing a coin three times. It is then often useful to describe the experiment and the associated sample space by means of a tree-based sequential description(序贯树形图), as in Fig. 1.3.

die: 骰子

Note that every node of the tree can be identified with an event. For example, the node labeled by a 1 can be identified with the event {(1, 1), (1, 2). (1, 3), (1, 4) } that the result of the first roll is 1.

Probability Laws 概率律

The probability law assigns to every event $A$ . a number $P (A)$ , called the probability of $A$ . satisfying the following axioms.
Inference (推论):
$1=P(\Omega)=P(\Omega\cup\varnothing)=P(\Omega)+P(\varnothing)=1+P(\varnothing)\\\therefore P(\varnothing)=0$

Properties of Probability Laws

减法公式： $P(A-B)=P(A\bar B)=P(A)-P(AB)$
加法公式： $P(A\cup B)=P(A)+P(B)-P(AB)$
- 推广： $P(A\cup B\cup C)=P(A)+P(B)+P(C)-P(AB)-P(AC)-P(BC)+P(ABC)$

Problem 9

A partition of the sample space $\Omega$ is a collection of disjoint events $S_1, ... , S_n$ such that $\Omega = \cup_{i=1}^n S_i$ .

(a) Show that for any event $A$ , we have
$P(A)=\sum_{i=1}^nP(A\cap S_i)$
(b) Use part (a) to show that for any events $A, B$ , and $C$ , we have
$P(A)=P(A\cap B)+P(A\cap C)+P(A\cap B^C\cap C^C)-P(A\cap B\cap C)$

Problem 10.

Show the formula
$P((A\cap B^C)\cup(A^C\cap B))=P(A)+P(B)-2P(A\cap B),$ which gives the probability that exactly one of the events $A$ and $B$ will occur.

SOLUTION
$\begin{aligned}&P((A\cap B^C)\cup(A^C\cap B))\\ =&P(A\cap B^C)+P(A^C\cap B)\\ =&P(A)-P(A\cap B)+P(B)-P(A\cap B)\end{aligned}$

Problem 11 Bonferroni’s inequality (邦费罗尼不等式)

(a) Prove that for any two events $A$ and $B$ , we have
$P(A\cap B)\geq P(A)+P(B)-1$

SOLUTION

(a) We have $P(A\cup B) = P(A) + P(B) - P(A\cap B)$ and $P(A\cup B)\leq1$ . which implies part (a).

也可以这么做： $1+P(A\cap B)=2P(A\cap B)+P(A\cap B^C)+P(A^C\cap B)+P(A^C\cap B^C)=(P(A\cap B)+P(A\cap B^C))+(P(A\cap B)+P(A^C\cap B))+P(A^C\cap B^C)=P(A)+P(B)+P(A^C\cap B^C)\geq P(A)+P(B)$

Problem 13. Continuity property of probabilities (概率的连续性)

$(a)$ Let $A_1 , A_2, ...$ . be an infinite sequence of events, which is “monotonically increasing,” meaning that $A_n\subset A_{n +1}$ for every $n$ . Let $\cup_{n=1}^\infty A_n$ . Show that $lim_{n\rightarrow \infty} P(A_n )$
[Hint: Express the event $A$ as a union of countably many disjoint sets.]
$(b)$ Suppose now that the events are “monotonically decreasing,” i.e., $A_{n + 1}\subset A_n$ for every $n$ . Let $\cap_{n=1}^\infty A_n$ . Show that $lim_{n\rightarrow +\infty} P(A_n )$ .
$(c)$ Consider a probabilistic model whose sample space is the real line. Show that
$P([0,\infty))=lim_{n\rightarrow\infty}P([0,n])\\ lim_{n\rightarrow\infty}P([n,\infty))=0$

SOLUTION

(a) Let $B_1 = A_1$ and, for $n\geq2, B_n = A_n\cap A_{n-1}^C$ . The events $B_n$ are disjoint, and we have $\cup_{k=1}^n B_k= A_n$ , and $\cup_{k=1}^\infty B_k= A$ .
$P(A)=\sum_{k=1}^\infty P(B_k)=P(\cup_{k=1}^\infty B_k)=lim_{n\rightarrow \infty}P(\cup_{k=1}^n B_k)=lim_{n\rightarrow \infty} P(A_n)$
(b) [Hint: Apply the result of part (a) to the complements of the events $P(A^C)$ .]
( $c$ ) For the first equality, use the result frorn part (a) with $A_n= [0, n]$ and $\infty)$ . For the second, use the result from part (b) with $A_n= [n,\infty)$ and $\varnothing$ .

Discrete Models 离散模型

离散概率律
在这里插入图片描述

Note that we are using here the simpler notation $P(s_i)$ to denote the probability of the event ${s_i\}$ , instead of the more precise $P(\{ s_i\})$ .

离散均匀概率律 (古典概型)

In the special case where the probabilities $P(s_1)$ , … , $P(s_n)$ are all the same (by necessity equal to $1 / n$ ), we obtain the following.

Continuous Models 连续模型

Probabilistic models with continuous sample spaces differ from their discrete counterparts in that the probabilities of the single-element events may not be sufficient to characterize the probability law.
几何概型：连续均匀概率律

Example 1.4

A wheel of fortune (幸运轮) is continuously calibrated from 0 to 1, so the possible outcomes of an experiment consisting of a single spin are the numbers in the interval $n = [0, 1]$ . Assuming a fair wheel, it is appropriate to consider all outcomes equally likely, but what is the probability of the event consisting of a single element? It cannot be positive, because then, using the additivity axiom, it would follow that events with a sufficiently large number of elements would have probability larger than 1. Therefore, the probability of any event that consists of a single element must be 0.
In this example, it makes sense to assign probability $b - a$ to any subinterval $[a, b]$ of $[0, 1]$ , and to calculate the probability of a more complicated set by evaluating its “length”.
The legitimacy of using length as a probability law hinges on the fact that the unit interval has an uncountably infinite number of elements.