HashMap源码解析

最新推荐文章于 2023-10-24 21:29:59 发布

蜗牛慢行

最新推荐文章于 2023-10-24 21:29:59 发布

阅读量195

点赞数

分类专栏： java 文章标签： java

公众号：慢行的蜗牛

本文链接：https://blog.csdn.net/suiyuanwangshi/article/details/125248305

版权

java 专栏收录该内容

9 篇文章 0 订阅

订阅专栏

介绍

HashMap继承自AbstractMap，存储为<K, V>结构。非线程安全容器，底层基于数组+连表+红黑树的方式存储数据。

主要参数


// ------静态常量-----------
// 默认容量 16
static final int DEFAULT_INITIAL_CAPACITY = 1 << 4;

// 最大容量
static final int MAXIMUM_CAPACITY = 1 << 30;

// 默认的负载因子
static final float DEFAULT_LOAD_FACTOR = 0.75f;

// 连表转红黑树的连表阈值，当连表长度大于等于8，则会转为红黑树
static final int TREEIFY_THRESHOLD = 8;

// 红黑树转为连表的阈值，当节点书小于等于6，则转为连表
static final int UNTREEIFY_THRESHOLD = 6;

// 转红黑树的另一个前提，当前存数组长度大于等于该阈值，否则优先扩容
static final int MIN_TREEIFY_CAPACITY = 64;

// --------核心参数--------

// 底层的存储结构，是通过数组存储
transient Node<K,V>[] table;
// 用于快速失败
transient int modCount;
// 阈值
int threshold;
// 负载因子，如没有指定，则为0.75
final float loadFactor;

构造方法

/**
 * initialCapacity 初始容量大小
 * loadFactor 负载因子
**/
public HashMap(int initialCapacity, float loadFactor) {
    if (initialCapacity < 0)
        throw new IllegalArgumentException("Illegal initial capacity: " +
                                           initialCapacity);
    if (initialCapacity > MAXIMUM_CAPACITY)
        initialCapacity = MAXIMUM_CAPACITY;
    if (loadFactor <= 0 || Float.isNaN(loadFactor))
        throw new IllegalArgumentException("Illegal load factor: " +
                                           loadFactor);
    this.loadFactor = loadFactor;
    // 通过这里可以看到，并不是指定多少，初始容量就是多少，其实这里关于threshold的应用，我个人不认为是个很好的使用方式，毕竟这个是阈值，而非记录容量的内容，一个属性在不同场景下使用的意义并不一样。
    this.threshold = tableSizeFor(initialCapacity);
}

在构造方法中，可以指定初始容量和负载因子大小。
初始容量并不一定是真实指定的值，最终的初始容量大小为2的N次幂。
在初始化时，并不正的把数组进行初始化，只有在第一个<K, V>数据写入时，才会真多初始话存储空间，这个对空间使用友好。

主要方法

put(K key, V value) 方法

/**
 * hash 当前key对hash值
 * key 需要写入的key
 * value 需要写入的value
 * onlyIfAbsent true不修改已经存在的value
 * evict 驱逐策略，在hashMap中使用不到
**/
final V putVal(int hash, K key, V value, boolean onlyIfAbsent,
               boolean evict) {
    Node<K,V>[] tab; Node<K,V> p; int n, i;
    // 如果为首次put值，则需要创建数组，从这里可以佐证，即使指定了容量在构造函数中也不会初始化存储空间
    if ((tab = table) == null || (n = tab.length) == 0)
        n = (tab = resize()).length;
    // 如果当前槽位为空，则直接添加
    if ((p = tab[i = (n - 1) & hash]) == null)
        tab[i] = newNode(hash, key, value, null);
    else {
        Node<K,V> e; K k;
        // 如果当前槽位的key与当前指定的key一致，则记录当前Node信息
        if (p.hash == hash &&
            ((k = p.key) == key || (key != null && key.equals(k))))
            e = p;
        // 如果是红黑树节点，则在树中添加对应的Node
        else if (p instanceof TreeNode)
            e = ((TreeNode<K,V>)p).putTreeVal(this, tab, hash, key, value);
        else { // 这是连表添加，尾插法
            for (int binCount = 0; ; ++binCount) {
                if ((e = p.next) == null) {
                    p.next = newNode(hash, key, value, null);
                    // 判断当前连表的长度是否触发转红黑树
                    if (binCount >= TREEIFY_THRESHOLD - 1) // -1 for 1st
                        treeifyBin(tab, hash);
                    break;
                }
                // 如果连表中某节点与当前节点的key一致，则结束查找
                if (e.hash == hash &&
                    ((k = e.key) == key || (key != null && key.equals(k))))
                    break;
                p = e;
            }
        }
        if (e != null) { // existing mapping for key
            V oldValue = e.value;
            // 判断是否替换旧值
            if (!onlyIfAbsent || oldValue == null)
                e.value = value;
            // 该方法会在LinkedHashMap中具体用到，这其实就是一个空模版
            afterNodeAccess(e);
            return oldValue;
        }
    }
    ++modCount;
    // 如果大于了扩容阈值，则进行扩容。
    if (++size > threshold)
        resize();
    afterNodeInsertion(evict);
    return null;
}

put操作中，并未对key和value是否为null进行判断，因此，hashMap是支持null为key的，其实更直观的可以看下求hash的方法(key == null) ? 0 : (h = key.hashCode()) ^ (h >>> 16);，可以直观看到确实是支持key为null的。

treeifyBin(Node<K,V>[] tab, int hash)链表转红黑树

final void treeifyBin(Node<K,V>[] tab, int hash) {
    int n, index; Node<K,V> e;
    // 如果当前数组长度小于转换的最低阈值，则优先扩容
    if (tab == null || (n = tab.length) < MIN_TREEIFY_CAPACITY)
        resize();
    else if ((e = tab[index = (n - 1) & hash]) != null) {
        // 将链表转为空黑树，然后进行红黑树调整
        TreeNode<K,V> hd = null, tl = null;
        do {
            TreeNode<K,V> p = replacementTreeNode(e, null);
            if (tl == null)
                hd = p;
            else {
                p.prev = tl;
                tl.next = p;
            }
            tl = p;
        } while ((e = e.next) != null);
        if ((tab[index] = hd) != null)
            hd.treeify(tab);
    }
}

转为红黑树的另一个前提是，数组长度必须大于等于64，否则优先扩容。
关于为什么必须是64，这个目前没有太好的解释，因为根据源码注释来说Should be at least 4 * TREEIFY_THRESHOL,那意味应该32才对。这里说明对是，为什么会有这么个值，而不是直接转换为红黑树，因为，在数组长度较小时，如果直接转为红黑树，随着数据的逐渐增多，很快就会到达扩容的需求，扩容后，很可能需要将红黑树再次转为链表，增加了频繁转换的损耗，这样对性能是极不友好的。

扩容resize()

final Node<K,V>[] resize() {
    Node<K,V>[] oldTab = table;
    int oldCap = (oldTab == null) ? 0 : oldTab.length;
    int oldThr = threshold;
    int newCap, newThr = 0;
    // 当前已经初始过存储空间
    if (oldCap > 0) {
        if (oldCap >= MAXIMUM_CAPACITY) {
            threshold = Integer.MAX_VALUE;
            return oldTab;
        }
        else if ((newCap = oldCap << 1) < MAXIMUM_CAPACITY &&
                 oldCap >= DEFAULT_INITIAL_CAPACITY)
            newThr = oldThr << 1; // double threshold
    }
    // 这个主要是解决指定了初始空间大小，在这里会应用到。
    else if (oldThr > 0) // initial capacity was placed in threshold
        newCap = oldThr;
    else {  // 默认存储空间
        newCap = DEFAULT_INITIAL_CAPACITY;
        newThr = (int)(DEFAULT_LOAD_FACTOR * DEFAULT_INITIAL_CAPACITY);
    }
    if (newThr == 0) {
        float ft = (float)newCap * loadFactor;
        newThr = (newCap < MAXIMUM_CAPACITY && ft < (float)MAXIMUM_CAPACITY ?
                  (int)ft : Integer.MAX_VALUE);
    }
    threshold = newThr;
    @SuppressWarnings({"rawtypes","unchecked"})
    Node<K,V>[] newTab = (Node<K,V>[])new Node[newCap];
    table = newTab;
    if (oldTab != null) {
        for (int j = 0; j < oldCap; ++j) {
            Node<K,V> e;
            if ((e = oldTab[j]) != null) {
                oldTab[j] = null;
                // 当前槽位只有一个节点
                if (e.next == null)
                    newTab[e.hash & (newCap - 1)] = e;
                else if (e instanceof TreeNode) // 红黑树
                    ((TreeNode<K,V>)e).split(this, newTab, j, oldCap);
                else { // 链表的结构
                    Node<K,V> loHead = null, loTail = null;
                    Node<K,V> hiHead = null, hiTail = null;
                    Node<K,V> next;
                    do {
                        next = e.next;
                        // 高位是0，表明扩容后，仍在低位，举个例子：
                        // 当前数组长度为16，在槽位4上，hash值为4，20均在一个槽位上
                        // 当扩容后hash值为4的仍在原位置，但是hash为20的则会移到高位
                        if ((e.hash & oldCap) == 0) {
                            if (loTail == null)
                                loHead = e;
                            else
                                loTail.next = e;
                            loTail = e;
                        }
                        else { // 在高位
                            if (hiTail == null)
                                hiHead = e;
                            else
                                hiTail.next = e;
                            hiTail = e;
                        }
                    } while ((e = next) != null);
                    if (loTail != null) {
                        loTail.next = null;
                        newTab[j] = loHead;
                    }
                    if (hiTail != null) {
                        hiTail.next = null;
                        newTab[j + oldCap] = hiHead;
                    }
                }
            }
        }
    }
    return newTab;
}

每次扩容都是以翻倍的形式扩容，结果仍未2的N次幂。这样的好处，扩容后，移动数据遍历，计算方式都通过逻辑运算，效率高。

基于key的查询

/**
 * hash 当前key对应的hash值
 * key 指定的key
**/
final Node<K,V> getNode(int hash, Object key) {
    Node<K,V>[] tab; Node<K,V> first, e; int n; K k;
    if ((tab = table) != null && (n = tab.length) > 0 &&
        (first = tab[(n - 1) & hash]) != null) {
        if (first.hash == hash && // 如果当前槽位的key就是要查询的key，则找到
            ((k = first.key) == key || (key != null && key.equals(k))))
            return first;
        if ((e = first.next) != null) {
            if (first instanceof TreeNode) // 红黑树中查找
                return ((TreeNode<K,V>)first).getTreeNode(hash, key);
            do { // 链表中查找。
                if (e.hash == hash &&
                    ((k = e.key) == key || (key != null && key.equals(k))))
                    return e;
            } while ((e = e.next) != null);
        }
    }
    return null;
}