Scikit Source Code Reading(2015.05.31)

Today’s Job

Today’s job is main about the source reading of plot_color_quantization.py and k_means_.py under scikit-learn-0.15.2\sklearn\cluster in scikit-learn-0.15.2.

Gains

  1. pairwise_distances_argmin:
    Compute minimum distances between one point and a set of points.
  2. shuffle:
    Shuffle arrays or sparse matrices in a consistent way
  3. Lloyd’s algorithm and Vorlonoi Diagram
  4. check_random_state(seed):
    Turn seed into a np.random.RandomState instance
  5. inertia:
    Sum of distances of samples to their closest cluster center.
  6. labels assignment is also called the E-step of EM
    computation of the means is also called the M-step of EM
  7. _tolerance(X, tol):
    Return a tolerance which is independent of the dataset

Quesions to be solved

  1. def _k_init(X, n_clusters, x_squared_norms, random_state, n_local_trials=None):
    “”“Init n_clusters seeds according to k-means++
    Selects initial cluster centers for k-mean clustering in a smart way
    to speed up convergence. see: Arthur, D. and Vassilvitskii, S.
    “k-means++: the advantages of careful seeding”. ACM-SIAM symposium
    on Discrete algorithms. 2007

    Version ported from http://www.stanford.edu/~darthur/kMeansppTest.zip,
    which is the implementation used in the aforementioned paper.

Grid Search

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值