非线性PCA

326 篇文章 2 订阅
183 篇文章 6 订阅

Nonlinear PCA

Nonlinear PCA toolbox for MATLAB

by Matthias Scholz
Nonlinear PCA (NLPCA) - Autoencoder neural network
Auto-associative neural network (Autoencoder)

Nonlinear principal component analysis (NLPCA) is commonly seen as a nonlinear generalization of standard principal component analysis (PCA). It generalizes the principal components from straight lines to curves (nonlinear). Thus, the subspace in the original data space which is described by all nonlinear components is also curved.
Nonlinear PCA can be achieved by using a neural network with an autoassociative architecture also known as autoencoder, replicator network, bottleneck or sandglass type network. Such autoassociative neural network is a multi-layer perceptron that performs an identity mapping, meaning that the output of the network is required to be identical to the input. However, in the middle of the network is a layer that works as a bottleneck in which a reduction of the dimension of the data is enforced. This bottleneck-layer provides the desired component values (scores).

linear PCA
PCA
The left plot shows standard PCA applied to a simple two-dimensional data set. The two resulting components are plotted as a grid which illustrates the linear PCA transformation. The plot on the right shows nonlinear PCA (autoencoder neural network) applied to a 3/4 circle with noise. Again, the two components are plotted as a grid, but the components are curved which illustrates the nonlinear transformation of NLPCA.
nonlinear PCA
NLPCA - Nonlinear PCA - Autoencoder neural network
 
PCASpectral dataNLPCA - Nonlinear PCA - Autoencoder

Here, NLPCA is applied to 19-dimensional spectral data representing equivalent widths of 19 absorption lines of 487 stars, available atwww.cida.ve. The figure in the middle shows a visualisation of the data by using the first three components of standard PCA. Data of different colors belong to different spectral groups of stars. The first three components of linear PCA and of NLPCA are represented by grids in the left and right figure, respectively. Each grid represents the two-dimensional subspace given by two components while the third one is set to zero. Thus, the grids represent the new coordinate system of the transformation. In contrast to linear PCA (left) which does not describe the nonlinear characteristics of the data, NLPCA gives a nonlinear (curved) description of the data, shown on the right.


Publications by Matthias Scholz

see all publications: [Matthias Scholz: publications]


Related algorithms

See also:

来源:

Nonlinear PCA: www.nlpca.org


代码下载:
 Nonlinear PCA toolbox for MATLABby Matthias Scholz 

nlpca

Nonlinear principal component analysis (NLPCA)

Nonlinear PCA - Nonlinear principal component analysis (NLPCA)

Syntax

[pc, net] = nlpca(data, k)

pc = nlpca_get_components(net, data)
data_reconstruction = nlpca_get_data(net, pc)

Description

The nonlinear PCA is based on an auto-associative neural network (autoencoder), see also: www.nlpca.org .

pc = nlpca(data,k) extracts k nonlinear components from the data set. pc represents the estimated component values (scores).

net is a data structure explaining the neural network parameters for the nonlinear transformation from data space to component space and reverse.
net can be used in nlpca_get_components andnlpca_get_data to obtain component values (scores) for new data or reconstructed data for any component value.


Example

In this example nonlinear PCA (circular PCA) is applied to artificial data of a noisy circle.

   % generate circular data 
   t=linspace(-pi , +pi , 100);  % angular value t=-pi,...,+pi
   data = [sin(t);cos(t)];       % circle
   data = data + 0.2*randn(size(data));    % add noise

   % nonlinear PCA (circular PCA, inverse network architecture)
   [c,net]=nlpca(data, 1,  'type','inverse',  'circular','yes' );
                                
   % plot components             
   nlpca_plot(net)  
  

See also the demos of the toolbox below.


Download

The NLPCA toolbox is distributed under the GNU General Public License.
NLPCA can be downloaded as single package or individual files:

NLPCA-0.88.zipNLPCA package (all files), version 0.88
 
package contains:
 
    nlpca.mmain program - component extraction
    nlpca_get_data.mto reconstruct data from new component values
    nlpca_get_components.mto estimate component values from new data
    nlpca_plot.mto plot the components
 
    demo_hierarchical_NLPCA_StarData.m   demo of hierarchical nonlinear PCA
    demo_circular_PCA.mdemo of circular units (Circular PCA)
    demo_inverse_NLPCA.mdemo of inverse network architecture
    demo_missing_data.mdemo of missing data estimation

References

Validation:Validation of nonlinear PCA
Matthias Scholz
Neural Processing Letters, 2012
[ pdf (pre-print) | pdf (Neural Process Lett) |poster RECOMB 2012 |Matlab code]
review (book chapter):Nonlinear principal component analysis: neural network models and applications.
Matthias Scholz, Martin Fraunholz, and Joachim Selbig.
In Principal Manifolds for Data Visualization and Dimension Reduction, edited by Alexander N. Gorban, Balázs Kégl, Donald C. Wunsch, and Andrei Zinovyev. Volume 58 of LNCSE, pages 44-67. Springer Berlin Heidelberg, 2007.
[ pdf (all book chapters) | pdf (Springer) | entire book (Springer)]
Circular PCA:Analysing periodic phenomena by circular PCA.
Matthias Scholz.
In S. Hochreiter and R. Wagner, editors, Proceedings of the Conference on Bioinformatics Research and Development BIRD'07, LNCS/LNBI Vol. 4414, pages 38-47. Springer-Verlag Berlin Heidelberg, 2007.
[ pdf (final version at Springer) |pdf (author's pre-version)]
Inverse model, missing data:Non-linear PCA: a missing data approach.
Matthias Scholz, Fatma Kaplan, Charles L. Guy, Joachim Kopka, and Joachim Selbig.
Bioinformatics 21(20):3887-3895. 2005.
[ pdf (final version) |pdf (pre-version in colour) ]
Hierarchical NLPCA:Nonlinear PCA: a new hierarchical approach.
Matthias Scholz and Ricardo Vigário.
In M. Verleysen, editor, Proceedings ESANN. 2002.
[ pdf (pre-print version) |pdf (ESANN)


  • 2
    点赞
  • 4
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值