HashMap的源码分析

最新推荐文章于 2024-09-13 19:36:07 发布

飞鱼he兰猪

最新推荐文章于 2024-09-13 19:36:07 发布

阅读量128

点赞数 2

分类专栏： Java 文章标签：哈希算法散列表数据结构 java Powered by 金山文档

本文链接：https://blog.csdn.net/ting_happy/article/details/128771261

版权

Java 专栏收录该内容

9 篇文章 0 订阅

订阅专栏

在看这篇文章之前大家可以先去看看这篇文章 HashMap中的hashcode与equals ！！！

上述文章提到，HashMap底层存储元素的方式是运用开散列的方式，也就是利用链表数组来存储元素，那今天我们拿到源码来看看是不是这样。

打开源码，我们可以看到，在HashMap类中，存在一个静态内部类：

同时我们能看到，源码中定义了成员变量Node数组：

从上述静态内部类的部分源码中，可以看出其含有链表的功能，可以看出HashMap确实存在一个链表数组，那HashMap是否确实是将其用来存储元素了呢？

那我们来看看HashMap中存储元素的代码：

可以看到，其存入元素又调用了putVal方法，其中参数中的hash（key）方法源码：

其详细作用在开头文章详细介绍过。让我们继续看看putVal的源码：

final V putVal(int hash, K key, V value, boolean onlyIfAbsent,
                   boolean evict) {
        Node<K,V>[] tab; Node<K,V> p; int n, i;
        if ((tab = table) == null || (n = tab.length) == 0)
            n = (tab = resize()).length;
        if ((p = tab[i = (n - 1) & hash]) == null)
            tab[i] = newNode(hash, key, value, null);
        else {
            Node<K,V> e; K k;
            if (p.hash == hash &&
                ((k = p.key) == key || (key != null && key.equals(k))))
                e = p;
            else if (p instanceof TreeNode)
                e = ((TreeNode<K,V>)p).putTreeVal(this, tab, hash, key, value);
            else {
                for (int binCount = 0; ; ++binCount) {
                    if ((e = p.next) == null) {
                        p.next = newNode(hash, key, value, null);
                        if (binCount >= TREEIFY_THRESHOLD - 1) // -1 for 1st
                            treeifyBin(tab, hash);
                        break;
                    }
                    if (e.hash == hash &&
                        ((k = e.key) == key || (key != null && key.equals(k))))
                        break;
                    p = e;
                }
            }
            if (e != null) { // existing mapping for key
                V oldValue = e.value;
                if (!onlyIfAbsent || oldValue == null)
                    e.value = value;
                afterNodeAccess(e);
                return oldValue;
            }
        }
        ++modCount;
        if (++size > threshold)
            resize();
        afterNodeInsertion(evict);
        return null;
    }

一进去我们就可以看到，代码在检测成员变量也就是一开始的Nade数组是否为null或者长度是否为0，如果满足任意一个条件，则调用resize()方法，那resize()方法是干嘛的呢？让我们来看resize（）的源码：

可以看到，其定义了oldCap（旧的容量），oldThr，其中oldThr=threshold（容量乘以负载因子），继续往下看，如果旧的容量大于定义的数组的最大值，则成员变量threshold赋值为Integer的最大值，返回原数组，未作任何改变。

而当oldCap<<1(相当于*2）小于定义的最大值，并且oldCap的容量大于等于定义的最小值，newCap=oldCap<<1,很容易我们就能发现resize()函数的功能是对Node数组进行扩容，并且正常情况下是进行二倍扩容，并且如果oldCap为0时，newCap=DEFAULT_INITIAL_CAPACITY ,在知道了resize（）原理之后，我们来继续回过头来看putVal源码：

final V putVal(int hash, K key, V value, boolean onlyIfAbsent,
                   boolean evict) {
        Node<K,V>[] tab; Node<K,V> p; int n, i;
        if ((tab = table) == null || (n = tab.length) == 0)
            n = (tab = resize()).length;
        if ((p = tab[i = (n - 1) & hash]) == null)
            tab[i] = newNode(hash, key, value, null);
        else {
            Node<K,V> e; K k;
            if (p.hash == hash &&
                ((k = p.key) == key || (key != null && key.equals(k))))
                e = p;
            else if (p instanceof TreeNode)
                e = ((TreeNode<K,V>)p).putTreeVal(this, tab, hash, key, value);
            else {
                for (int binCount = 0; ; ++binCount) {
                    if ((e = p.next) == null) {
                        p.next = newNode(hash, key, value, null);
                        if (binCount >= TREEIFY_THRESHOLD - 1) // -1 for 1st
                            treeifyBin(tab, hash);
                        break;
                    }
                    if (e.hash == hash &&
                        ((k = e.key) == key || (key != null && key.equals(k))))
                        break;
                    p = e;
                }
            }
            if (e != null) { // existing mapping for key
                V oldValue = e.value;
                if (!onlyIfAbsent || oldValue == null)
                    e.value = value;
                afterNodeAccess(e);
                return oldValue;
            }
        }
        ++modCount;
        if (++size > threshold)
            resize();
        afterNodeInsertion(evict);
        return null;
    }

即如果原Node数组为空或者数组长度为0时，对其进行扩容，（也就是说，HashMap并不是在创建其实例时就将数组创建好了，而是在第一次put的时候，才为数组开辟好空间！！！）

由于hash值决定了元素在数组中存储的位置，所以接下来就对Node数组中hash%len位置上的元素进行检查，看此位置是否放置了元素，那明明tab[hash%len]就可以解决的事情，为什么源码要写成tab[(n-1)&hash]呢？我猜测这样写是因为位运算的速度远远大于普通的加减乘除的速度，毕竟大佬的思维，谁又能猜的准呢？而n为2的幂时，(n-1)&hash刚好与hash%len的值一样。

如果此位置为空，则在此位置new Node，放入元素，故HashMap底层确实采用了一个链表数组来存储元素，并且这个链表长度到达了一定程度可以转换成红黑树。

而数组初始容量是多少，负载因子是多少，链表长度达到多少的时候能转换成红黑树？源码中都有写，但由于不同版本的jdk可能会出现不同的设定，故这里给出的源码值仅供参考，我这里是jdk1.8版本的设定，大家应该还是以自己电脑上的源码为准：