java HashMap 实现原理探究

最新推荐文章于 2024-09-26 08:56:55 发布

架构师思考实践

最新推荐文章于 2024-09-26 08:56:55 发布

阅读量1.2k

点赞数 4

分类专栏： java深度探究文章标签： java HashMap 实现原理探

本文链接：https://blog.csdn.net/smile0198/article/details/23878961

版权

java深度探究专栏收录该内容

48 篇文章 2 订阅

订阅专栏

jdk版本： 1.7

概念

hash

翻译过来是哈希，还有种叫法散列，学过数据结构的应该知道。哈希就是将任意长度输入值通过哈希算法得到一个固定长度输出值。这里不重点介绍hash算法。

java 的hashmap实现

1、存储结构

首先我们知道 map中存的元素是Entry<K,V>。

hashmap类使用一个Entry数组来实现存储，数组中每个元素可能对应一个链表

/**
     * The table, resized as necessary. Length MUST Always be a power of two.
     */
    transient Entry[] table;

初始化大小、最大值、加载因子：

 /**
     * The default initial capacity - MUST be a power of two.
     */
    static final int DEFAULT_INITIAL_CAPACITY = 1 << 4; // aka 16  默认是16

    /**
     * The maximum capacity, used if a higher value is implicitly specified
     * by either of the constructors with arguments.
     * MUST be a power of two <= 1<<30.
     */
    static final int MAXIMUM_CAPACITY = 1 << 30;   //最大是2的30次方

    /**
     * The load factor used when none specified in constructor.
     */
    static final float DEFAULT_LOAD_FACTOR = 0.75f;  //默认是0.75

构造函数，我们一般调用这个：

/**
     * Constructs an empty <tt>HashMap</tt> with the default initial capacity
     * (16) and the default load factor (0.75).
     */
    public HashMap() {
        this.loadFactor = DEFAULT_LOAD_FACTOR;
        threshold = (int)(DEFAULT_INITIAL_CAPACITY * DEFAULT_LOAD_FACTOR);
        table = new Entry[DEFAULT_INITIAL_CAPACITY];
        init();
    }

内存是动态分配的，当内存使用到3/4时，追加到原来的两倍

当然也可以指定初始化大小和加载因子

public HashMap(int initialCapacity, float loadFactor)

2、如何hash

计算hash值：

/**
     * Retrieve object hash code and applies a supplemental hash function to the
     * result hash, which defends against poor quality hash functions.  This is
     * critical because HashMap uses power-of-two length hash tables, that
     * otherwise encounter collisions for hashCodes that do not differ
     * in lower bits. Note: Null keys always map to hash 0, thus index 0.
     */
    final int hash(Object k) {
        int h = hashSeed;
        if (0 != h && k instanceof String) {
            return sun.misc.Hashing.stringHash32((String) k);
        }

        h ^= k.hashCode();

        // This function ensures that hashCodes that differ only by
        // constant multiples at each bit position have a bounded
        // number of collisions (approximately 8 at default load factor).
        h ^= (h >>> 20) ^ (h >>> 12);
        return h ^ (h >>> 7) ^ (h >>> 4);
    }

然后通过哈希值和数组长度与运算，计算在数据的下标：

/**
     * Returns index for hash code h.
     */
    static int indexFor(int h, int length) {
        // assert Integer.bitCount(length) == 1 : "length must be a non-zero power of 2";
        return h & (length-1);
    }

3、put方法

/**
     * Associates the specified value with the specified key in this map.
     * If the map previously contained a mapping for the key, the old
     * value is replaced.
     *
     * @param key key with which the specified value is to be associated
     * @param value value to be associated with the specified key
     * @return the previous value associated with <tt>key</tt>, or
     *         <tt>null</tt> if there was no mapping for <tt>key</tt>.
     *         (A <tt>null</tt> return can also indicate that the map
     *         previously associated <tt>null</tt> with <tt>key</tt>.)
     */
    public V put(K key, V value) {
        if (key == null)
            return putForNullKey(value);
        int hash = hash(key.hashCode());
        int i = indexFor(hash, table.length);
        for (Entry<K,V> e = table[i]; e != null; e = e.next) {
            Object k;
            if (e.hash == hash && ((k = e.key) == key || key.equals(k))) {
                V oldValue = e.value;
                e.value = value;
                e.recordAccess(this);
                return oldValue;
            }
        }

        modCount++;
        addEntry(hash, key, value, i);
        return null;
    }

我们看到先对key重新计算哈希值，然后根据该值找到数组小标，如果数组的这个位置有值，那么在这个位置上就以链表形式存储，新的值放在链表头。

4、get 取方法

 public V get(Object key) {
        if (key == null)
            return getForNullKey();
        int hash = hash(key.hashCode());
        for (Entry<K,V> e = table[indexFor(hash, table.length)];
             e != null;
             e = e.next) {
            Object k;
            if (e.hash == hash && ((k = e.key) == key || key.equals(k)))
                return e.value;
        }
        return null;
    }

先计算key的哈希值，然后找到数组中对应位置，让后通过for循环来遍历该位置上的链表，并用equals比较查找。

HashMap能存多少数据？

在put方法中，如果计算出的数组下标在table中找不到，会对数组table扩容：

/**
     * Adds a new entry with the specified key, value and hash code to
     * the specified bucket.  It is the responsibility of this
     * method to resize the table if appropriate.
     *
     * Subclass overrides this to alter the behavior of put method.
     */
    void addEntry(int hash, K key, V value, int bucketIndex) {
        if ((size >= threshold) && (null != table[bucketIndex])) {
            resize(2 * table.length);
            hash = (null != key) ? hash(key) : 0;
            bucketIndex = indexFor(hash, table.length);
        }

        createEntry(hash, key, value, bucketIndex);
    }

size是找个map中有多少kv对，threshold是需要进行resize的阈值，如果kv个数超过阈值，且数组中对应的位置有链表了，那么就需要将数组大小翻倍