机器学习中的常用距离

最新推荐文章于 2020-10-07 15:39:10 发布

weixin_34121304

最新推荐文章于 2020-10-07 15:39:10 发布

阅读量164

点赞数

文章标签：人工智能 python

原文链接：https://yq.aliyun.com/articles/91218

版权

If x1,x2∈Rn, then:
闵可夫斯基距离 Minkowski Distance

d 12 = \sum k = 1 n (x 1 k - x 2 k) p - - - - - - - - - - - - \sqrt p, p > 0

欧氏距离 Enclidean Distance
L2 norm

d 12 = \sum k = 1 n (x 1 k - x 2 k) 2 - - - - - - - - - - - - \sqrt or d 12 = (x 1 - x 2) T (x 1 - x 2) - - - - - - - - - - - - - - - - \sqrt

标准化欧式距离/加权欧式距离 Weighted Euclidean Distance

d 12 = \sum k = 1 n (x 1 k - x 2 k S k) 2 - - - - - - - - - - - - - -  ⎷  

where

Sk is the standard deviation.

from numpy import *
vectormat=mat([[1,2,3],[4,5,6]])
v12=vectormat[0]-vectormat[1]
varmat=std(vectormat.T, axis=0)
normmat=(vectormat-mean(vectormat))/varmat.T
normv12=normmat[0]-normmat[1]
print(sqrt(normv12*normv12.T))

曼哈顿距离 Manhattan Distance
L1 norm

d 12 = \sum k = 1 n | x 1 k - x 2 k |

切比雪夫距离 Chebyshev Distance
L∞ norm

d 12 = max i (| x 1 i - x 2 i |)

from numpy import *
vector1=mat([1,2,3])
vector2=mat([4,5,7])
print(abs(vector1-vector2).max())

夹角余弦 Cosine

cos θ = \sum n k = 1 x 1 k x 2 k \sum n k = 1 x 2 1 k - - - - - - - - \sqrt \sum n k = 1 x 2 2 k - - - - - - - - \sqrt

汉明距离 Hamming Distance
In information theory, the Hamming distance between two strings of equal length is the number of positions at which the corresponding symbols are different. In other words, it measures the minimum number of substitutions required to change one string into the other. (referred from Wikipedia)

from numpy import *
matV=mat([[1,1,0,1,0,1,0,0,1],[0,1,1,0,0,0,1,1,1]])
smstr=nonzero(matV [0]-matV[1])
print(shape(smstr[0])[0])

杰卡德相似系数 Jaccard Similarity Coefficient
Given two sets, A and B, the Jaccard similarity coefficient is defined as

J (A, B) = | A \cap B | | A \cup B |

杰卡德距离 Jaccard Distance

J δ (A, B) = 1 - J (A, B) = | A \cup B | - | A \cap B | | A \cup B |

from numpy import *
import scipy.spatial.distance as dist
matV=mat([[1,1,0,1,0,1,0,0,1],[0,1,1,0,0,0,1,1,1]])
print(dist.pdist(matV,'jaccard'))

马氏距离 Mahalanobis Distance
Given m sample vectors X1,…,Xm whose mean value is μ and covariance matrix is S, then the Mahalanobis distance of sample vector X and μ is defined as

D (X) = (X - μ) T S - 1 (X - μ) - - - - - - - - - - - - - - - - - \sqrt

that of sample vector

Xi and

Xj is

D (X) = (X i - X j) T S - 1 (X i - X j) - - - - - - - - - - - - - - - - - - - - \sqrt

weixin_34121304

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
机器学习中的常用距离

If x1,x2∈Rn, then: 闵可夫斯基距离 Minkowski Distance d12=∑k=1n(x1k−x2k)p−−−−−−−−−−−−√p,p>0欧氏距离 Enclidean Distance L2 norm d12=∑k=1n(x1k−x2k)2−−−−−−−−−−−−√ord12=(x1−x2)T...
复制链接

扫一扫