2018年12月_坚持，再坚持一下

12月 11月 10月 09月 06月 05月 04月 03月

原创安装spark集群

1.下载spark 1.1进入Apache spark 下载页面 https://archive.apache.org/dist/spark/ 选择需要的版本号以2.2.0为例，由于已经安装过hadoop、所以我们下载hadoop-2.6版本的spark 1.2需要安装的环境 JDK 1.8.0 hadoop 2.6.0 scala 2.11.0 spark 2.2.0 注意：从2.0版开始...

2018-12-19 14:05:45 153

HIERARCHICAL CLUSTERING SCHEMES

Techniques for partitioning objects into optimally homogeneous groups on the basis of empirical measures of similarity among those objects have received increasing attention in several different fields. This paper develops a useful correspondence between any hierarchical system of such clusters, and a particular type of distance measure. The correspondence gives rise to two methods of clustering that are computationally rapid and invariant under monotonic transformations of the data. In an explicitly defined sense, one method forms clusters that are optimally "connected," while the other forms clusters that are optimally "compact."

2018-10-29

聚类原始数据集

聚类数据集 %% 利用不同方法对债券样本进行聚类 %说明 %分别采用不同的方法，对数据进行聚类 %可以选择的pdist/clustering距离 % methods = {'euclidean'; 'seuclidean'; 'cityblock'; 'chebychev'; ... % 'mahalanobis'; 'minkowski'; 'cosine'; 'correlation'; ... % 'spearman'; 'hamming'; 'jaccard'}; %Y=pdist(X) 生成各数据点之间距离的行向量 %squareform(Y) 生成方阵（i，j）代表i个点与j各点之间的距离 %聚类方法： %k-means %kidx=kmeans(bonds,numClust,'distance',dist_k); %层次聚类 %hidx=clusterdata(bonds,'maxclust',numClust,'distance',dist_h,'linkage',link); %liankage产生层次聚类树 %获取距离矩阵，第二参数指定距离计算方法

2018-10-26

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人

原创 安装spark集群

HIERARCHICAL CLUSTERING SCHEMES

聚类原始数据集

空空如也

原创安装spark集群