similarity measures

The most common useful indexes have been collected by Holliday et al (Holliday, JD., Hu, C-Y. and Willett, P. (2002) Combinatorial Chemistry and High Throughput Screening 5, 155-166) These are shown in the table, and can be referred to, by name, in applications and toolkits calls which allow user defined similarity functions.

Measure RangeFormula
Cosine0.0,1.0{short description of image}
Dice0.0,1.0{short description of image}
Euclid0.0,1.0{short description of image}
Forbes0.0,∞{short description of image}
Hamman-1.0,1.0{short description of image}
Jaccard0.0,1.0{short description of image}
Kulczynski0.0,1.0{short description of image}
Manhattan1.0,0.0{short description of image}
Matching0.0,1.0{short description of image}
Pearson-1.0,1.0{short description of image}
Rogers-Tanimoto0.0,1.0{short description of image}
Russell-Rao0.0,1.0{short description of image}
Simpson0.0,1.0{short description of image}
Tanimoto0.0,1.0{short description of image}
Yule-1.0,1.0{short description of image}

Notes

  • The Tanimoto and Jaccard indexes are the same.
  • The Forbes index has no upper limit.
  • The Manhattan index is a distance = 1.0 - Matching index
  • The Kulczynski index is the mean of the individual substructure similarities
  • The Simpson index is the best of the individual substructure similarities
  • The Dice index is the ratio of the bits in common to the arithmetic mean of the number of on bits in the two items.
  • The Cosine index is the ration of the bits in common to the geometric mean of the number of on bits in the two items.

from : http://www.daylight.com/dayhtml/doc/theory/theory.finger.html

转载于:https://www.cnblogs.com/carol-wei/p/7664957.html

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值