- 博客(1)
- 资源 (2)
- 收藏
- 关注
原创 安装spark集群
1.下载spark 1.1进入Apache spark 下载页面 https://archive.apache.org/dist/spark/ 选择需要的版本号 以2.2.0为例,由于已经安装过hadoop、所以我们下载hadoop-2.6版本的spark 1.2需要安装的环境 JDK 1.8.0 hadoop 2.6.0 scala 2.11.0 spark 2.2.0 注意:从2.0版开始...
2018-12-19 14:05:45 153
HIERARCHICAL CLUSTERING SCHEMES
Techniques for partitioning objects into optimally homogeneous
groups on the basis of empirical measures of similarity among those objects
have received increasing attention in several different fields. This paper
develops a useful correspondence between any hierarchical system of such
clusters, and a particular type of distance measure. The correspondence
gives rise to two methods of clustering that are computationally rapid and
invariant under monotonic transformations of the data. In an explicitly
defined sense, one method forms clusters that are optimally "connected,"
while the other forms clusters that are optimally "compact."
2018-10-29
聚类原始数据集
聚类数据集
%% 利用不同方法对债券样本进行聚类
%说明
%分别采用不同的方法,对数据进行聚类
%可以选择的pdist/clustering距离
% methods = {'euclidean'; 'seuclidean'; 'cityblock'; 'chebychev'; ...
% 'mahalanobis'; 'minkowski'; 'cosine'; 'correlation'; ...
% 'spearman'; 'hamming'; 'jaccard'};
%Y=pdist(X) 生成各数据点之间距离的行向量
%squareform(Y) 生成方阵(i,j)代表i个点与j各点之间的距离
%聚类方法:
%k-means
%kidx=kmeans(bonds,numClust,'distance',dist_k);
%层次聚类
%hidx=clusterdata(bonds,'maxclust',numClust,'distance',dist_h,'linkage',link);
%liankage产生层次聚类树
%获取距离矩阵,第二参数指定距离计算方法
2018-10-26
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人