如何读取回归表

回归分析是数据分析的重要方法,用于研究因变量与一个或多个自变量之间的关系。文章通过一个包含GRE分数和入学机会的示例,解释了如何绘制回归线,计算回归线方程,并介绍了回归表的各个部分,包括ANOVA表中的自由度、平方和等统计量。通过回归表,我们可以了解模型解释数据变化的能力,如R²表示模型解释因变量变异的百分比。此外,残差输出有助于检查模型的准确性,如通过残差图检测非线性、异方差和离群值。
摘要由CSDN通过智能技术生成

by Sharad Vijalapuram

莎拉德·维贾拉普拉姆(Sharad Vijalapuram)

如何读取回归表 (How to read a Regression Table)

什么是回归? (What is regression?)

Regression is one of the most important and commonly used data analysis processes. Simply put, it is a statistical method that explains the strength of the relationship between a dependent variable and one or more independent variable(s).

回归是最重要且最常用的数据分析过程之一。 简而言之,它是一种统计方法,可以解释因变量与一个或多个自变量之间的关系强度。

A dependent variable could be a variable or a field you are trying to predict or understand. An independent variable could be the fields or data points that you think might have an impact on the dependent variable.

因变量可以是您试图预测或理解的变量或字段。 自变量可以是您认为可能会对因变量产生影响的字段或数据点。

In doing so, it answers a couple of important questions —

这样做可以回答几个重要的问题-

  • What variables matter?

    什么变量重要?
  • To what extent do these variables matter?

    这些变量在多大程度上重要?
  • How confident are we about these variables?

    我们对这些变量有多自信?

让我们举个例子…… (Let’s take an example…)

To better explain the numbers in the regression table, I thought it would be useful to use a sample dataset and walk through the numbers and their importance.

为了更好地解释回归表中的数字,我认为使用样本数据集并逐步了解数字及其重要性将非常有用。

I’m using a small dataset that contains GRE (a test that students take to be considered for admittance in Grad schools in the US) scores of 500 students and their chance of admittance into a university.

我正在使用一个小型数据集,其中包含GRE(美国研究生院应考虑的入学考试)500名学生的分数及其入读大学的机会。

Because chance of admittance depends on GRE score, chance of admittance is the dependent variable and GRE score is the independent variable.

因为chance of admittance取决于GRE score ,所以chance of admittance是因变量,而GRE score是自变量。

回归线 (Regression line)

Drawing a straight line that best describes the relationship between the GRE scores of students and their chances of admittance gives us the linear regression line. This is known as the trend line in various BI tools. The basic idea behind drawing this line is to minimize the distance between the data points at a given x-coordinate and the y-coordinate through which the regression line passes.

画一条最能描述学生GRE分数与他们入学机会之间关系的直线 ,可以得出线性回归线 。 这在各种BI工具中被称为趋势线 。 画这条线的基本思想是使回归线通过的给定x坐标和y坐标处的数据点之间的距离最小。

The regression line makes it easier for us to represent the relationship. It is based on a mathematical equation that associates the x-coefficient and y-intercept.

回归线使我们更容易表示关系。 它基于将x系数和y截距相关联的数学方程式。

Y-intercept is the point at which the line intersects the y-axis at x = 0. It is also the value the model would take or predict when x is 0.

Y截距是线在x = 0时与y轴相交的点。它也是模型在x为0时采用或预测的值。

Coefficients provide the impact or weight of a variable towards the entire model. In other words, it provides the amount of change in the dependent variable for a unit change in the independent variable.

系数提供了变量对整个模型的影响或权重。 换句话说,它为自变量的单位变化提供了因变量的变化量。

计算回归线方程 (Calculating the regression line equation)

In order to find out the model’s y-intercept, we extend the regression line far enough until it intersects the y-axis at x = 0. This is our y

  • 1
    点赞
  • 3
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值