reduced rank regression model

clear-mind

于 2021-05-12 15:25:20 发布

阅读量1.3k

点赞数 3

文章标签： statistics 矩阵数据挖掘机器学习

本文链接：https://blog.csdn.net/qaqchichong/article/details/116666857

版权

Multivariate reduced-rank regression (RRR) is a technique enhancing classical multivariate regression, focusing on dimensionality reduction and structure discovery in data. It is popular due to its simplicity and versatility in handling various statistical problems. The model considers both fixed and random input variables, with applications in time series and econometrics." 125686419,9549694,高薪区块链岗位解析,"['区块链', '开发者', '风险管理', '项目管理', '内容创作']

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

reduced rank regression model

关于reduced rank regression model的内容可以直接跳到Multivariate Reduced-Rank Regression的部分

Multivariate linear regression is a natural extension of multiple linear regression in that both techniques try to interpret possible linear relationships between certain input and output variables. Multiple regression is concerned with studying to what extent the behavior of a single output variable Y is influenced by a set of r input variables X = (X1, ··· ,Xr) $^T$ .
Multivariate regression has s output variables Y = (Y1, ··· ,Ys) $^T$ , each of whose behavior may be influenced by exactly the same set of inputs
X = (X1, ··· ,Xr) $^T$ .
So, not only are the components of X correlated with each other, but in multivariate regression, the components of Y are also correlated with each other (and with the components of X). In this chapter, we are interested in estimating the regression relationship between Y and X, taking into account the various dependencies between the r-vector X and the s-vector Y and the dependencies within X and within Y.
we describe the multivariate reduced-rank regression model (RRR) (Izenman, 1975), which is an enhancement of the classical multivariate regression model and has recently received research attention in the statistics and econometrics literature. The following reasons explain the popularity of this model: RRR provides a unified approach to many of the diverse classical multivariate statistical techniques; it lends itself quite naturally to analyzing a wide variety of statistical problems involving reduction of dimensionality and the search for structure in multivariate data; and it is relatively simple to program because the regression estimates depend only upon the sample covariance matrices of X and Y and the eigendecomposition of a certain symmetric matrix that generalizes the multiple squared correlation coefficient R2 from multiple regression.

我们需要考虑X是随机和非随机两种情况

The Fixed-X Case

Let Y = (Y1, ··· ,Ys) $^T$ be a random s-vector-valued output variate with
mean vector $µ_Y$ and covariance matrix $Σ_{Y Y}$ , and let X = (X1, ··· ,Xr) $^T$
be a fixed (nonstochastic) r-vector-valued input variate. The components
of the output vector Y will typically be continuous responses, and the
components of the input vector X may be indicator or “dummy” variables
that are set up by the researcher to identify known groupings of the data
associated with distinct subpopulations or experimental conditions.

Suppose we observe n replications,
$X_j^T , Y_j^T )^τ$ , j = 1, 2,… ,n,
on the (r + s)-vector $X^τ , Y^τ )^τ$ . We define an (r × n)-matrix X and an
(s × n)-matrix Y by $\mathcal X = (X1, ··· , Xn), \mathcal Y = (Y1, ··· , Yn)$ .