HashMap源码阅读

最新推荐文章于 2022-12-16 13:36:43 发布

double鱼

最新推荐文章于 2022-12-16 13:36:43 发布

阅读量107

点赞数

分类专栏：集合源码文章标签： java

本文链接：https://blog.csdn.net/weixin_45870082/article/details/109707376

版权

集合源码专栏收录该内容

2 篇文章 0 订阅

订阅专栏

这里看下jdk1.8源码

先看下相关的属性

static final int DEFAULT_INITIAL_CAPACITY = 1 << 4; // aka 16

    /**
     * The maximum capacity, used if a higher value is implicitly specified
     * by either of the constructors with arguments.
     * MUST be a power of two <= 1<<30.
     */
    static final int MAXIMUM_CAPACITY = 1 << 30;

    /**
     * The load factor used when none specified in constructor.
     */
    static final float DEFAULT_LOAD_FACTOR = 0.75f;

    /**
     * The bin count threshold for using a tree rather than list for a
     * bin.  Bins are converted to trees when adding an element to a
     * bin with at least this many nodes. The value must be greater
     * than 2 and should be at least 8 to mesh with assumptions in
     * tree removal about conversion back to plain bins upon
     * shrinkage.
     */
    static final int TREEIFY_THRESHOLD = 8;

    /**
     * The bin count threshold for untreeifying a (split) bin during a
     * resize operation. Should be less than TREEIFY_THRESHOLD, and at
     * most 6 to mesh with shrinkage detection under removal.
     */
    static final int UNTREEIFY_THRESHOLD = 6;

默认初始化容量为16，最大容量为2的30次方，负载因子0.75，链表转化为红黑树的阈值为8，退化链表的阈值为6.这些值是经过数学计算，概率统计得到的。比如

Because TreeNodes are about twice the size of regular nodes, we
     * use them only when bins contain enough nodes to warrant use
     * (see TREEIFY_THRESHOLD). And when they become too small (due to
     * removal or resizing) they are converted back to plain bins.  In
     * usages with well-distributed user hashCodes, tree bins are
     * rarely used.  Ideally, under random hashCodes, the frequency of
     * nodes in bins follows a Poisson distribution
     * (http://en.wikipedia.org/wiki/Poisson_distribution) with a
     * parameter of about 0.5 on average for the default resizing
     * threshold of 0.75, although with a large variance because of
     * resizing granularity. Ignoring variance, the expected
     * occurrences of list size k are (exp(-0.5) * pow(0.5, k) /
     * factorial(k)). The first values are:
     *
     * 0:    0.60653066
     * 1:    0.30326533
     * 2:    0.07581633
     * 3:    0.01263606
     * 4:    0.00157952
     * 5:    0.00015795
     * 6:    0.00001316
     * 7:    0.00000094
     * 8:    0.00000006
     * more: less than 1 in ten million

这是官方给出的解释，用的什么泊松分布，哈希桶中元素超过8的时候概率非常非常小，这个时候吧他转化为红黑树提高查询效率

直接看put操作

public V put(K key, V value) {
        return putVal(hash(key), key, value, false, true);
    }

里面只调了一个方法putVal()，先看一下里面对插入的key值是如何作hash的

static final int hash(Object key) {
        int h;
        return (key == null) ? 0 : (h = key.hashCode()) ^ (h >>> 16);
    }

这里其实很简单，因为hashcode()计算得到的是32位的一个整数，我们的哈希桶往往只能计算低4,5,6…位，因此为了让我们的数据能够在对应的低4,5,6…位分布更散列，能减少哈希碰撞所以这里把高16位和低16位进行了异或运算

看完hash再来看下putVal()方法

final V putVal(int hash, K key, V value, boolean onlyIfAbsent,
                   boolean evict) {
        Node<K,V>[] tab; Node<K,V> p; int n, i;
        if ((tab = table) == null || (n = tab.length) == 0)
            n = (tab = resize()).length;
        if ((p = tab[i = (n - 1) & hash]) == null)
            tab[i] = newNode(hash, key, value, null);
        else {
            Node<K,V> e; K k;
            if (p.hash == hash &&
                ((k = p.key) == key || (key != null && key.equals(k))))
                e = p;
            else if (p instanceof TreeNode)
                e = ((TreeNode<K,V>)p).putTreeVal(this, tab, hash, key, value);
            else {
                for (int binCount = 0; ; ++binCount) {
                    if ((e = p.next) == null) {
                        p.next = newNode(hash, key, value, null);
                        if (binCount >= TREEIFY_THRESHOLD - 1) // -1 for 1st
                            treeifyBin(tab, hash);
                        break;
                    }
                    if (e.hash == hash &&
                        ((k = e.key) == key || (key != null && key.equals(k))))
                        break;
                    p = e;
                }
            }
            if (e != null) { // existing mapping for key
                V oldValue = e.value;
                if (!onlyIfAbsent || oldValue == null)
                    e.value = value;
                afterNodeAccess(e);
                return oldValue;
            }
        }
        ++modCount;
        if (++size > threshold)
            resize();
        afterNodeInsertion(evict);
        return null;
    }

第一个if语句中是在刚插入第一个元素的时候进行了resize()，给容器分配默认初始空间16，第二个if就是在插入数据时发现对应key的hash桶中没有值，为null，所以直接进行插入操作。接下来的else中就是对应产生hash冲突时是怎么做的。

进入else里面，首先要判断一下两个key的hash是不是一致，equals是不是一致，完全一样的话后面会进行value的覆盖操作，这里就不作插入操作，再后面就是判断是否需要转成红黑树，需要就转，不需要如果key完全相等直接跳出循环，否则继续使用头插法继续插入链表节点。

double鱼

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
HashMap源码阅读

这里看下jdk1.8源码先看下相关的属性static final int DEFAULT_INITIAL_CAPACITY = 1 << 4; // aka 16 /** * The maximum capacity, used if a higher value is implicitly specified * by either of the constructors with arguments. * MUST be a power of tw
复制链接

扫一扫