关于回归的一些学习思考

回归(regression)

回归(regression):Regression is a statistical method used in finance, investing, and other disciplines that attempts to determine the strength and character of the relationship between a dependent variable and one or more independent variables.

回归是一个统计方法。它可以确定因变量与一个或多个自变量之间关系的强度和特征。

a trend or shift toward a lower or less perfect state:such as  a functional relationship between two or more correlated variables that is often empirically determined from data and is used especially to predict values of one variable when given values of the others

在词典中的解释:回归是一个趋势或者转变。向较低或不太完美的状态的趋势或转变:例如两个或多个相关变量之间的函数关系,该关系通常根据经验从数据中确定,并且特别用于在给定单(多)变量的值时预测一个变量的值

the regression of y on x is linear

specifically a function that yields the mean value of a random variable under the condition that one or more independent variables have specified values

回归的应用

回归模型(Regression models) describe the relationship between variables by fitting a line to the observed data. Linear regression models use a straight line, while logistic and nonlinear regression models use a curved line. Regression allows you to estimate how a dependent variable changes as the independent variable(s) change.

根据上面的定义,回归模型被用于描述变量之间的关系。

回归模型通过将一条线,拟合观测数据来描述变量之间的关系。线性回归模型使用直线,而逻辑和非线性回归模型使用曲线。回归使得工作者估计因变量如何随着自变量的变化而变化。

重点在于利用自变量的变化来估计因变量变化​

以下是例子。

Simple linear regression example

You are a social researcher interested in the relationship between income and happiness. You survey 500 people whose incomes range from 15k to 75k and ask them to rank their happiness on a scale from 1 to 10.

Your independent variable (income) and dependent variable (happiness) are both quantitative, so you can do a regression analysis to see if there is a linear relationship between them.

If you have more than one independent variable, use multiple linear regressioncitecilaiz instead.

引用:Regression: Definition, Analysis, Calculation, and Example (investopedia.com)icon-default.png?t=N7T8https://www.investopedia.com/terms/r/regression.asp#What%20Is%20Regression?

              Regression Definition & Meaning - Merriam-Webstericon-default.png?t=N7T8https://www.merriam-webster.com/dictionary/regression

       Simple Linear Regression | An Easy Introduction & Examples (scribbr.com)icon-default.png?t=N7T8https://www.scribbr.com/statistics/simple-linear-regression/

在数据分析的学习以及未来的应用上,需要认真了解专业术语的定义。

线性回归模型是基本模型。而模型的目的是预测或者理解产生数据的机制。模型的筛选贯穿整个数据分析过程。

arxiv.org/pdf/1901.08152

此链接介绍数据模型应具有三个原则:可预测性、可计算性以及稳定性 。

变量(variable)

在上面所说的变量(variable)中,提及两个变量分别是:dependent variable(因变量) 以及independent variable(自变量)。

The words “explanatory variable” and “response variable” are often interchangeable with other terms used in research.

Cause (what changes)Effect (what’s measured)
Independent variableDependent variable
Predictor variableOutcome/criterion variable
Explanatory variableResponse variable

该表是自变量与因变量的多种同义替换 。

那么什么是自变量呢?

What is an independent variable?  

An independent variable is the variable you manipulate or vary in an experimental study to explore its effects. It’s called “independent” because it’s not influenced by any other variables in the study.

Independent variables are also called:

  • Explanatory variables (they explain an event or outcome)
  • Predictor variables (they can be used to predict the value of a dependent variable)
  • Right-hand-side variables (they appear on the right-hand side of a regression equation).

These terms are especially used in statistics, where you estimate the extent to which an independent variable change can explain or predict changes in the dependent variable.

也就是说用于预测的变量在统计中,自变量可以称为输入(input)。在本人所研究的领域中,基因+环境=表型 是一个通用公式。假定,作者我在得到一个新的基因型数据后,本人想预测植物的表型。在基因型数据输入进数据模型后,我可以得到模型预测后的表型结果。在这个过程中基因型就是自变量(independent variable),而表型就是因变量(output,dependent variable)。

在定义中,两个变量的定义本身就具有因果关系。

  • 25
    点赞
  • 13
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值