java中String的hashcode()的实现

最新推荐文章于 2024-07-31 18:02:34 发布

qwer_bob

最新推荐文章于 2024-07-31 18:02:34 发布

阅读量3.7k

点赞数 1

分类专栏： java基础文章标签： hashCode解析 String的hashCode()

原文链接：https://www.cnblogs.com/LoganChen/p/8586310.html

版权

java基础专栏收录该内容

19 篇文章 0 订阅

订阅专栏

首先来看一下String中hashCode方法的源码

 /** Cache the hash code for the string */
    private int hash; // Default to 0


/**
     * Returns a hash code for this string. The hash code for a
     * {@code String} object is computed as
     * <blockquote><pre>
     * s[0]*31^(n-1) + s[1]*31^(n-2) + ... + s[n-1]
     * </pre></blockquote>
     * using {@code int} arithmetic, where {@code s[i]} is the
     * <i>i</i>th character of the string, {@code n} is the length of
     * the string, and {@code ^} indicates exponentiation.
     * (The hash value of the empty string is zero.)
     *
     * @return  a hash code value for this object.
     */
    public int hashCode() {
        int h = hash;
        if (h == 0 && value.length > 0) {
            char val[] = value;

            for (int i = 0; i < value.length; i++) {
                h = 31 * h + val[i];
            }
            hash = h;
        }
        return h;
    }

在String中有一个字段hash来存储该串的哈希值，在第一次调用hashCode方法时，字符串的哈希值被计算并且赋值给hash字段。之后再调用hashCode方法便可以直接取hash字段返回。

String类中的hashCode计算方法还是比较简单的，就是以31为权，每一位字符的ASCII只进行计算，用自然溢出来等效取模。

哈希计算公式可以记为s[0]*31^(n-1)+s[1]*31^(n-2)+...+s[n-1]。

主要原因是因为31是一个奇素数，所以31*i=32*i-i=(i<<5)-i，这种位移与剪发结合的计算相比一般的运算块很多。

字符串哈希可以做很多事情，通常是类似于字符串判等，判回文之类的。

但是仅仅依赖于哈希值来判断其实是不严谨的，除非能够保证不会有哈希值冲突。通常这一点很难做到。

就拿jdk中String类的哈希方法来举例，字符串“gdejicbegh”与字符串"hgebcijedg"具有相同的hashCode()返回值-801038016，并且他们具有reverse的关系。这个例子说明了用jdk中默认的hashCode方法判断字符串相等或者字符串回文都存在反例。