HashMap详解(一)

Alex_ChuTT

已于 2022-07-06 18:04:17 修改

阅读量245

点赞数 1

分类专栏： Java基础文章标签：数据结构 hashmap java

于 2020-07-21 19:00:32 首次发布

本文链接：https://blog.csdn.net/u012346890/article/details/107489237

版权

Java基础专栏收录该内容

11 篇文章 0 订阅

订阅专栏

字典

小学时候的新华字典，通过偏旁部首找到了某个字，HashMap原理和这差不多，有的编程语言就命名这种数据结构叫字典。

计算机的物理存储结构(数组、链表)

数组:

采用一段连续的存储单元来存储数据。对于指定下标的查找，时间复杂度为O(1)；通过给定值进行查找，需要遍历数组，逐一比对给定关键字和数组元素，时间复杂度为O(n)，当然，对于有序数组，则可采用二分查找，插值查找等方式，可将查找复杂度降低为O(logn)；对于一般的插入删除操作，涉及到数组元素的移动，其平均复杂度为O(n)

链表

对于链表的新增，删除等操作（在找到指定操作位置后），仅需处理结点间的引用即可，时间复杂度为O(1)，而查找操作需要遍历链表逐一进行比对，复杂度为O(n)

最简版结构图

在这里插入图片描述

左边是数组，右边是链表
每一个entry也就是键值对，里面除了key value之外，还会包含hash值，next指针，可以参考这个:
在这里插入图片描述

//JDK1.7的结构，JDK1.8已经改为Node<K,V> implements Map.Entry<K,V> {},但原理差不多
    static class Entry<K,V> implements Map.Entry<K,V> {
        final K key;
        V value;
        Entry<K,V> next;//存储指向下一个Entry的引用，单链表结构
        int hash;//对key的hashcode值进行hash运算后得到的值，存储在Entry，避免重复计算

        /**
         * Creates new entry.
         */
        Entry(int h, K k, V v, Entry<K,V> n) {
            value = v;
            next = n;
            key = k;
            hash = h;
        }

继承关系

public class HashMap<K,V> extends AbstractMap<K,V>
    implements Map<K,V>, Cloneable, Serializable {

在这里插入图片描述

分析构造函数以及几个重要参数

int size;
实际存储的key-value键值对(也叫Entry)的个数

final float loadFactor;
负载因子，代表了table的填充度有多少，默认是0.75
加载因子存在的原因，还是因为减缓哈希冲突，如果初始桶为16，等到满16个元素才扩容，某些桶里可能就有不止一个元素了。
所以加载因子默认为0.75，也就是说大小为16的HashMap，到了第13个元素，就会扩容成32。

int modCount;
HashMap被改变的次数，由于HashMap非线程安全，在对HashMap进行迭代时，
如果期间其他线程的参与导致HashMap的结构发生变化了（比如put，remove等操作），
需要抛出异常ConcurrentModificationException

在这里插入图片描述

a)无参构造，英文描述还是很清楚的，翻译过来就是构造了一个默认的HashMap,默认容量16(默认的数组大小)，负载因子0.75(到达总量75%的时候扩容)

   /**
     * Constructs an empty <tt>HashMap</tt> with the default initial capacity
     * (16) and the default load factor (0.75).
     */
)
    public HashMap() {
        this.loadFactor = DEFAULT_LOAD_FACTOR; // all other fields defaulted
    }

b)传入容量为initialCapacity的HashMap

    /**
     * Constructs an empty <tt>HashMap</tt> with the specified initial
     * capacity and the default load factor (0.75).
     *
     * @param  initialCapacity the initial capacity.
     * @throws IllegalArgumentException if the initial capacity is negative.
     */
    public HashMap(int initialCapacity) {
        this(initialCapacity, DEFAULT_LOAD_FACTOR);
    }

c)前面很好懂，threshold = capacity *load factor,当HashMap的size到达这个值时，进行扩容

   /**
     * Constructs an empty <tt>HashMap</tt> with the specified initial
     * capacity and load factor.
     *
     * @param  initialCapacity the initial capacity
     * @param  loadFactor      the load factor
     * @throws IllegalArgumentException if the initial capacity is negative
     *         or the load factor is nonpositive
     */
    public HashMap(int initialCapacity, float loadFactor) {
        if (initialCapacity < 0)
            throw new IllegalArgumentException("Illegal initial capacity: " +
                                               initialCapacity);
        if (initialCapacity > MAXIMUM_CAPACITY)
            initialCapacity = MAXIMUM_CAPACITY;
        if (loadFactor <= 0 || Float.isNaN(loadFactor))
            throw new IllegalArgumentException("Illegal load factor: " +
                                               loadFactor);
        this.loadFactor = loadFactor;
        this.threshold = tableSizeFor(initialCapacity);
    }

注意最后一个方法
tableSizeFor(initialCapacity),主要功能是返回一个比给定的initialCapacity大且最接近的2的幂次方整数，比如入参10，则返回2的4次方16，至于为什么是2的幂次方，涉及到hash算法的均匀分布，后面会说到。

    /**
     * Returns a power of two size for the given target capacity.
     */
    static final int tableSizeFor(int cap) {
        int n = cap - 1;
        n |= n >>> 1;
        n |= n >>> 2;
        n |= n >>> 4;
        n |= n >>> 8;
        n |= n >>> 16;
        return (n < 0) ? 1 : (n >= MAXIMUM_CAPACITY) ? MAXIMUM_CAPACITY : n + 1;
    }

d)英文翻译过来也就可以理解了：大概意思是用传入的这个map构建一个新的HashMap，容量和负载因子还是默认的，但是注意putMapEntries()这个方法

    /**
     * Constructs a new <tt>HashMap</tt> with the same mappings as the
     * specified <tt>Map</tt>.  The <tt>HashMap</tt> is created with
     * default load factor (0.75) and an initial capacity sufficient to
     * hold the mappings in the specified <tt>Map</tt>.
     *
     * @param   m the map whose mappings are to be placed in this map
     * @throws  NullPointerException if the specified map is null
     */
    public HashMap(Map<? extends K, ? extends V> m) {
        this.loadFactor = DEFAULT_LOAD_FACTOR;
        putMapEntries(m, false);
    }

    //将入参的map的所有元素放到本HashMap中
    final void putMapEntries(Map<? extends K, ? extends V> m, boolean evict) {
        int s = m.size();
        if (s > 0) {
            if (table == null) { // pre-size
                float ft = ((float)s / loadFactor) + 1.0F;
                int t = ((ft < (float)MAXIMUM_CAPACITY) ?
                         (int)ft : MAXIMUM_CAPACITY);
                if (t > threshold)
                    threshold = tableSizeFor(t);
            }
            else if (s > threshold)
                resize();
            //上面的操作是为了初始化一些参数、并得到正确的容量
            //然后就开始逐个把元素放到本HashMap了
            for (Map.Entry<? extends K, ? extends V> e : m.entrySet()) {
                K key = e.getKey();
                V value = e.getValue();
                //put(K,V)也是调用的这个方法
                putVal(hash(key), key, value, false, evict);
            }
        }
    }

下一篇就说重头戏map是如何put一个键值对的
HashMap详解(二)

Alex_ChuTT

关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
打赏
0
评论
HashMap详解(一)

字典小学时候的新华字典，通过偏旁部首找到了某个字，HashMap原理和这差不多，有的编程语言就命名这种数据结构叫字典。前置知识计算机的物理存储结构(数组、链表)数组:采用一段连续的存储单元来存储数据。对于指定下标的查找，时间复杂度为O(1)；通过给定值进行查找，需要遍历数组，逐一比对给定关键字和数组元素，时间复杂度为O(n)，当然，对于有序数组，则可采用二分查找，插值查找等方式，可将查找复杂度降低为O(logn)；对于一般的插入删除操作，涉及到数组元素的移动，其平均复杂度为O(n)链表对于链表
复制链接

扫一扫