关于HashMap你需要知道的一些细节

最新推荐文章于 2022-06-01 10:06:20 发布

JasonGaoH

最新推荐文章于 2022-06-01 10:06:20 发布

阅读量1.2k

点赞数 3

分类专栏： java基础 Java集合源码分析文章标签： HashMap resize 容量和负载因子树化改造扰动函数

本文链接：https://blog.csdn.net/H_Gao/article/details/90746413

版权

本文详细介绍了HashMap的重要参数——容量和负载因子，以及put方法的实现。当元素个数超过容量*负载因子时，HashMap会进行resize。如果链表长度超过8，HashMap会进行树化改造。此外，还探讨了hash函数的设计，包括扰动函数的作用，以减少哈希碰撞。最后，解释了resize的实现原理，如何在扩容时保持元素位置的高效迁移。

摘要由CSDN通过智能技术生成

本文的公众号文章链接：
关于HashMap你需要知道的一些细节
在官方文档中的描述：

Hash table based implementation of the Map interface. This implementation provides all of the optional map operations, and permits null values and the null key. (The HashMap class is roughly equivalent to Hashtable, except that it is unsynchronized and permits nulls.) This class makes no guarantees as to the order of the map; in particular, it does not guarantee that the order will remain constant over time.

基于Map接口的哈希表的实现。此实现提供所有可选的映射操作，并允许空值和空键。（HashMap类大致等同于HashTable，只是它不同步并且允许空值。）这个类不保证映射的顺序；特别是，它不保证顺序会随着时间的推移而保持不变。

两个重要的参数

在HashMap中有两个很重要的参数，容量(Capacity)和负载因子(Load factor)

Initial capacity The capacity is the number of buckets in the hash table, The initial capacity is simply the capacity at the time the hash table is created.

Load factor The load factor is a measure of how full the hash table is allowed to get before its capacity is automatically increased.

简单的说，Capacity就是buckets的数目，Load factor就是buckets填满程度的最大比例。如果对迭代性能要求很高的话不要把capacity设置过大，也不要把load factor设置过小。当bucket填充的数目（即hashmap中元素的个数）大于capacity*load factor时就需要调整buckets的数目为当前的2倍。

首先，我们来一起看看 HashMap 内部的结构，它可以看作是数组(Node[] table)和链表结合组成的复合结构，数组被分为一个个桶(bucket)，通过哈希值决定了键值对在这个数组的寻址;哈希值相同的键值对，则以链表形式存储，你可以参考下面的示意图。这里需要注意的是，如果链表大小超过阈值(TREEIFY_THRESHOLD, 8)，图中的链表就会被改造为树形结构。

put函数的实现

接着来看 put 方法实现:

put函数大致的思路为：

对key的hashCode()做hash，然后再计算index;
如果没碰撞直接放到bucket里；
如果碰撞了，以链表的形式存在buckets后；
如果碰撞导致链表过长(大于等于TREEIFY_THRESHOLD)，就把链表转换成红黑树；
如果节点已经存在就替换old value(保证key的唯一性)
如果bucket满了(超过load factor*current capacity)，就要resize。

public V put(K key, V value) {
   
    // 对key的hashCode()做hash
    return putVal(hash(key), key, value, false, true);
}
final V putVal(int hash, K key, V value, boolean onlyIfAbsent,
               boolean evict) {
   
    Node<K,V>[] tab; Node<K,V> p; int n, i;
    // tab为空则创建
    if ((tab = table) == null || (n = tab.length) == 0)
        n = (tab = resize()).length;
    // 计算index，并对null做处理
    if ((p = tab[i = (n - 1) & hash]) == null)
        tab[i] = newNode(hash,