Probability Distributions play an important role in our daily lives. We commonly use them when trying to summarise and gain insights from different forms of data.
概率分布在我们的日常生活中起着重要作用。 在尝试总结不同形式的数据并从中获取见解时,我们通常使用它们。
Because of this, they're quite an important topic in fields such as Mathematics, Computer Science, Statistics, and Data Science.
因此,它们是数学,计算机科学,统计和数据科学等领域的重要主题。
There are two main types of data: Numerical (for example integers and floats), and Categorical (for example strings of text).
数据有两种主要类型: 数值 (例如整数和浮点数)和分类 (例如文本字符串)。
Numerical data can also be in either of two forms:
数值数据也可以采用以下两种形式之一:
Discrete: this form of data can just take a limited number of values (like the number of clothes we have). We can infer probability mass functions from discrete data.
离散的:这种形式的数据只能接受有限数量的值(例如我们拥有的衣服数量)。 我们可以从离散数据推断概率质量函数。
Continuous: on the other hand, continuous data is used to describe more abstract concepts such as weight/distance which can take any fractional or real value. From continuous data we can instead infer probability density functions.
连续的:另一方面,连续的数据用于描述更抽象的概念,例如权重/距离,它可以取任何分数或实数值。 我们可以从连续数据中推断出概率密度函数。
Probability mass functions can give us the probability that a variable is equal to a certain value. On the other hand, the values of probability density functions do not represent probabilities on their own, but instead first need to be integrated (within the considered range).
概率质量函数可以为我们提供变量等于某个值的概率。 另一方面,概率密度函数的值本身并不表示概率,而是首先需要积分(在所考虑的范围内)。
什么是泊松分布? (What is a Poisson Distribution?)
Poisson Distributions are commonly used for two main purposes:
泊松分布通常用于两个主要目的:
- Predicting how many times an