纲要
boss说增加项目平台分析方法:
T检验(独立样本T检验)、线性回归、二元Logistics回归、因子分析、可靠性分析
根本不懂,一脸懵逼状态,分析部确实有人才,反正我是一脸懵
首先解释什么是二元Logistic回归分析吧
二元Logistics回归 可以用来做分类,回归更多的是用于预测
官方简介:
链接:https://pythonfordatascience.org/logistic-regression-python/
Logistic regression models are used to analyze the relationship between a dependent variable (DV) and independent variable(s) (IV) when the DV is dichotomous. The DV is the outcome variable, a.k.a. the predicted variable, and the IV(s) are the variables that are believed to have an influence on the outcome, a.k.a. predictor variables. If the model contains 1 IV, then it is a simple logistic regression model, and if the model contains 2+ IVs, then it isa multiple logistic regression model.
Assumptionsforlogistic regression models:
The DViscategorical (binary)
If there are more than2 categories interms of types of outcome, a multinomial logistic regression should be used
Independence of observations
Cannot be a repeated measures design, i.e. collecting outcomes at two different time points.
Independent variables are linearly related to the log odds
Absence of multicollinearity
Lack of outliers
原文
理解了什么是二元以后,开始找库
需要用的包
这里需要特别说一下,第一天晚上我就用的logit,