Introduction to Statistics in R：02-Random Numbers and Probability

521R

已于 2024-03-15 01:59:59 修改

阅读量875

点赞数 32

分类专栏： R语言统计学基础文章标签： r语言

于 2024-03-14 15:23:16 首次发布

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/m0_67662731/article/details/136711630

版权

R语言统计学基础专栏收录该内容

4 篇文章

订阅专栏

Random Numbers and Probability

What are the chances?

Measuring chance

What's the probability of an event?

Example: a coin flip

Assigning salespeople

Sampling from a data frame

sales_counts

sales_count %>%
    sample_n(1)

sales_count %>%
    sample_n(1)

Setting a random seed

set.seed(5)
sales_counts %>%
    sample_n(1)

set.seed(5)
sales_couonts %>%
    sample_n(1)

Sampling with replacement in R

sales_counts %>%
    sample_n(2, replace=TRUE)

sample(sales_team, 5, replace=TRUE)

Independent events

Two events are independent if the probability of the second event isn't affected by the outcome of the first event.

Sampling with replacement = each pick is independent

Dependent events

Two events are dependent if the probability of the sencond event is affected by the outcome of the first event.

Sampling without replacement = each pick is dependent

Discrete distributions

Rolling the dice

Choosing salespeople

Probability distribution

Describes the probability of each possible outcome in a scenario

Visualizing a probability distribution

Probability = area

Uneven die

Visualizing uneven probabilities

Adding areas

Discrete probability distributions

Discribe probabilities for discrete outcomes

Sampling from discrete distributions

die

mean(die$n)

rolls_10 <- die %>%
	sample_n(10, replace = TRUE)
rolls_10

Visualizing a sample

ggplot(rolls_10, aes(n)) +
	geom_histogram(bins = 6)

Sample distribution vs. theoretical distribution

A bigger sample

Law of large numbers

As the size of your sample increases, the sample mean will approach the expected value

Continuous distribution

Waiting for the bus

Continuous uniform distribution

Probability still = area

Uniform distribution in R

punif(7, min = 0, max = 12)
# 0.5833333

lower.tail

punif(7, min = 0, max = 12, lower.tail = FALSE)
# 0.4166667

punif(7, min = 0, max = 12) - punif(4, min = 0, max = 12)

Total area = 1

Other continuous distributions

Other special types of distributions

The binomial distribution

Coin fipping

Binary outcomes

A single flip

rbinom(# of trials, # of coins, # probability of heads/success)

1 = head, 0 = tails

rbinom(1, 1, 0.5)
# 1

rbinom(1, 1, 0.5)
# 0

One flip many times

rbinom(8, 1, 0.5)
# 1 0 0 1 0 0 1 0

Many flips one time

rbinom(1, 8, 0.5)
# 3

Many flips many times

rbinom(10, 3, 0.5)
# 2 0 1 0 1 1 3 3 3 1

Other probabilities

rbinom(10, 3, 0.25)
# 1 1 0 0 1 1 1 1 2 1

Binomial distribution

Probability distribution of the number of successes in a sequence of independent trials

E.g. Number of heads in a sequence of coin flips

Describe by n and p

n: total number of trials
p: probability of success

What's the probability of 7 heads?

P(heads = 7)

#dbinom(num heads, num trials, prob of heads)
dbinom(7, 10, 0.5)
# 0.1171875

What's the probability of 7 or fewer heads?

P(heads <= 7)

pbinom(7, 10, 0.5)
#0.9453125

What's the probability of more than 7 heads?

P(heads > 7)

pbinom(7, 10, 0.5, lower.tail = FALSE)
# 0.0546875

1 - pbinom(7, 10, 0.5)
# 0.0546875

Expected value

Expected value = n x p

Expected number of heads out of 10 flips = 10 x 0.5 = 5

Independence

The binomial distribution is a probability distribution of the number of successes in a sequence of independent trials

Probabilities of second trial are altered due to outcome of the first

If trials are not independent, the binomial distribution does not apply!

博客等级

码龄3年

17
原创

437
点赞

385
收藏

342
粉丝

关注

私信

热门文章

分类专栏

最新评论

Pytorch入门实战: 07-咖啡豆识别(VGG-16复现)
CSDN-Ada助手: 恭喜您完成了第14篇博客《Pytorch入门实战: 07-咖啡豆识别(VGG-16复现)》，看来您对Pytorch的应用已经游刃有余了！不过在接下来的创作中，或许可以尝试挑战更复杂的模型或者应用场景，以拓展自己的技术视野。希望您能继续坚持创作，不断提升自己的水平！期待您更多精彩的分享！
Pytorch入门实战: 06-VGG-16算法-Pytorch实现人脸识别
CSDN-Ada助手: 恭喜您在Pytorch入门实战系列中发布了第13篇博客！标题中的VGG-16算法在人脸识别领域应用广泛，您的实现方法也必定会对读者有所帮助。希望您能继续保持创作的热情和质量，也建议在未来的文章中可以探讨一下如何优化模型性能或者尝试结合其他算法进行更深入的探讨。期待您的下一篇作品！
Pytorch入门实战: 05-Pytorch实现运动鞋识别
CSDN-Ada助手: 恭喜作者成功发布了第12篇博客《Pytorch入门实战: 05-Pytorch实现运动鞋识别》，内容丰富有趣，让读者受益匪浅。希望作者能够继续坚持创作，分享更多有关Pytorch的实战经验和技巧。下一步建议可以尝试探讨Pytorch在其他领域的应用，拓展自己的知识面，共同进步。期待作者的更多精彩作品！愿您在创作路上不断精进，谦卑进取。
Pytorch入门实战: 04-猴痘病识别
CSDN-Ada助手: 恭喜您写了第11篇博客！看到您在Pytorch入门实战中探讨猴痘病识别，我感到非常高兴。您的内容不仅丰富多彩，而且对读者非常有启发性。希望您能继续保持创作的热情，分享更多有趣的主题。下一步，我建议您可以考虑深入探讨Pytorch在其他医学领域的应用，或者尝试与其他深度学习框架进行比较分析，这样可以让读者更全面地了解相关知识。期待您更多的精彩内容！
Introduction to Statistics in R: 04-Correlation and Experimental Design
CSDN-Ada助手: 恭喜您在博客中介绍了统计学在R语言中的应用，特别是关于相关性和实验设计的部分。不仅内容丰富，而且对读者来说非常有启发性。希望您能继续保持创作的热情，探索更多统计学在R中的应用，或者分享一些实际案例和应用经验，让读者更好地理解和应用所学知识。期待您的下一篇作品！

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。