Python网络爬虫与文本数据分析
pingouin是基于Pandas和numpy开发的Python3统计包。主要统计功能有
- 方差分析
- 多元线性回归
- 中介效应分析
- 卡方检验
- Q-Q图
- 贝叶斯因子
- 信效度检验
- 等等
我是统计小白,看不懂啊;还有很多功能没有列上,感兴趣的统计大神可以看看https://pingouin-stats.org/api.html
安装
pip3 install pingouin
快速上手
构造实验数据x,y
import numpy as np #控制代码每次随机状态保持一致 np.random.seed(666) n=30 mean= [4,5] cov = [(1, 0.6), (0.6, 1)] x, y = np.random.multivariate_normal(mean, cov, n).T x
array([3.04817645, 2.54387965, 4.56033188, 4.40504338, 3.77876203, 3.87177128, 3.4546112 , 4.47317551, 5.23133856, 5.40273745, 5.19344217, 3.37061786, 3.23980982, 2.85574177, 4.67728276, 4.31935242, 4.39440207, 3.87458876, 4.91426293, 3.13673286, 3.73459839, 4.18708647, 5.48558345, 3.7066784 , 3.73400287, 3.49664637, 3.95954844, 2.61545452, 5.11352964, 5.62666503])
y
array([4.47747109, 4.35695696, 5.46239455, 4.56091782, 4.07534588, 4.03904897, 3.79549165, 5.06121364, 5.71635355, 6.60772697, 6.94890455, 5.13347618, 5.41207983, 3.38254684, 5.49705058, 5.93394729, 4.65224366, 4.59491971, 5.17926604, 4.25844527, 5.72809738, 5.