CS224d: Deep Learning for NLP Lecture1 概率复习(1)

因为平时考试,我的报告分数特别低,主要是因为我的英文写作能力特别差,为了练习英文写作,部分博客(比较简单的内容)将用英文写作,望大家闲来无聊时,看看我的博客,指正错误。


Abstract

The main content is the review of cumulative distribution functions(CDFs), probability mass functions(PMFs) and probability density functions(PDFs).

Axioms of Probability

  • P(A)0, for all A, is a set of events
  • P(Ω)=1, Ω represents a sample space
  • If A1,A2,... are disjoint events, then P(iAi)=iP(Ai)

Conditional Probability and Independence

The conditional probability of any event A given B is defined as,

P(A|B)=P(AB)P(B)
P(A|B) means the probability measure of A when the event B occurs. We can see the event B as a universal set. Two events are called independent if and only if P(AB)=P(A)P(B) or P(A|B)=P(A) . Therefore, independence is equivalent to saying that observing B does not have any effect on the probability of A.

Random Variables

We denote random variables using upper case letters X(ω) or simply X . We denote the value that a random variable may take on using lower case letter x. Random variables are divided into discrete random variable and continuous random variable. When X can take on only a finite number of values, it is known as a discrete random variable. The probability of the set associated with a random variable X taking on some specific value k is:

P(X=k)=P({ω:X(ω)=k})
When X can take on a infinite number of possible values, it is called a continuous random variable. The probability that X takes on a value between two real constants a and b is:

P(aXb)=P({ω:aX(ω)b})

Cumulative Distribution Functions

A cumulative distribution function is a function FX:[0,1] which specifies a probability measure as,

FX(x)=P(Xx)
Properties:
- 0FX(x)1
- limxFX(x)=0
- limxFX(x)=1
- xyFX(x)FX(y)

Probability Mass Functions

When X is a discrete random variable, a simpler way to represent the probability of a random variable is to directly specify the probability of each value that the random variable can assume. A probability mass function is a function pX:Ω such that:

pX(x)=P(X=x)
The notation Val(X) is the set of possible values that the random variable X may assume.
Properties:
- 0pX(x)1
- xVal(X)pX(x)=1
- xApX(x)=P(XA)

Probability Density Functions

The CDF FX(x) is differentiable everywhere for some continuous random variables. The probability density functions is the derivative of the CDF:

fX(x)=dFX(x)dx
According to the properties of differentiation, for very small x , P(xXx+x)fX(x)x
Properties:
- fX(x)0
- fX(x)=1
- xAfX(x)dx=P(XA)

Reference:http://cs229.stanford.edu/section/cs229-prob.pdf

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值