从底层实现重新理解HashMap

最新推荐文章于 2024-02-20 17:48:11 发布

涂川江

最新推荐文章于 2024-02-20 17:48:11 发布

阅读量436

点赞数

本文链接：https://blog.csdn.net/TCJGGSDDU/article/details/75452613

版权

HashMap是平时经常使用的一种集合，是一种键值对（K-V）形式的存储结构，由此可以推测出一些HashMap的属性：
1.key和value都可以为空值。
2.value随意重复，key重复就会被覆盖。
3.放置其中的键值对无序。
4.非线程安全的。
查看HashMap的源码就会发现这样一个静态内部类：

 static class Entry<K,V> implements Map.Entry<K,V> {
        final K key;
        V value;
        Entry<K,V> next;
        int hash;

        ...
    }

从这里可以看出HashMap是由一个单向的Entry链表构成，只有后置的Entry，没有前置的Entry。
通过HashMap的构造方法可以看出在初始化的时候的容量大小为16：

 static final int DEFAULT_INITIAL_CAPACITY = 1 << 4;
 static final float DEFAULT_LOAD_FACTOR = 0.75f;
 public HashMap() {
        this(DEFAULT_INITIAL_CAPACITY, DEFAULT_LOAD_FACTOR);
  }
 public HashMap(int initialCapacity) {
        this(initialCapacity, DEFAULT_LOAD_FACTOR);
  }
public HashMap(int initialCapacity, float loadFactor) {
        if (initialCapacity < 0)
            throw new IllegalArgumentException("Illegal initial capacity: " +initialCapacity);
        if (initialCapacity > MAXIMUM_CAPACITY)
            initialCapacity = MAXIMUM_CAPACITY;
        if (loadFactor <= 0 || Float.isNaN(loadFactor))
            throw new IllegalArgumentException("Illegal load factor: " + loadFactor);
        this.loadFactor = loadFactor;
        threshold = initialCapacity;
        init();
  }

1.put方法

 public V put(K key, V value) {
        if (table == EMPTY_TABLE) {
            inflateTable(threshold);
        }
        if (key == null)
            return putForNullKey(value);
        int hash = hash(key);
        int i = indexFor(hash, table.length);
        for (Entry<K,V> e = table[i]; e != null; e = e.next) {
            Object k;
            if (e.hash == hash && ((k = e.key) == key || key.equals(k))) {
                V oldValue = e.value;
                e.value = value;
                e.recordAccess(this);
                return oldValue;
            }
        }

        modCount++;
        addEntry(hash, key, value, i);
        return null;
    }

第5行代码判断当key为null时的操作，for循环中判断是否存在相同的key，如果有，则进行覆盖操作，如果没有，则进行添加操作，通过查看putForNullKey()方法的代码：

  private V putForNullKey(V value) {
        for (Entry<K,V> e = table[0]; e != null; e = e.next) {
            if (e.key == null) {
                V oldValue = e.value;
                e.value = value;
                e.recordAccess(this);
                return oldValue;
            }
        }
        modCount++;
        addEntry(0, null, value, 0);
        return null;
    }

可以得出结论，当key为null时，会默认放在第一个位置上。
从第7行开始，就对key相对应的hashcode进行相关的处理。
modeCount++是用于fail-fast机制的，每次修改HashMap数据结构的时候都会自增一次这个值。最为关键的就是addEntry()方法:

  void addEntry(int hash, K key, V value, int bucketIndex) {
        if ((size >= threshold) && (null != table[bucketIndex])) {
            //扩容为原来的2倍
            resize(2 * table.length);
            hash = (null != key) ? hash(key) : 0;
            //获得hash值得index
            bucketIndex = indexFor(hash, table.length);
        }

        createEntry(hash, key, value, bucketIndex);
    }
   void createEntry(int hash, K key, V value, int  bucketIndex){
        Entry<K,V> e = table[bucketIndex];
        table[bucketIndex] = new Entry<>(hash, key, value, e);
        size++;
    }

可以看出，添加新的Entry的时候，首先判断是否需要扩容，然后新增Entry

2.删除

  public V remove(Object key) {
        Entry<K,V> e = removeEntryForKey(key);
        return (e == null ? null : e.value);
    }
    final Entry<K,V> removeEntryForKey(Object key) {
        if (size == 0) {
            return null;
        }
        int hash = (key == null) ? 0 : hash(key);
        int i = indexFor(hash, table.length);
        Entry<K,V> prev = table[i];
        Entry<K,V> e = prev;

        while (e != null) {
            Entry<K,V> next = e.next;
            Object k;
            if (e.hash == hash &&
                ((k = e.key) == key || (key != null && key.equals(k)))) {
                modCount++;
                size--;
                if (prev == e)
                    table[i] = next;
                else
                    prev.next = next;
                e.recordRemoval(this);
                return e;
            }
            prev = e;
            e = next;
        }

        return e;
    }

从源码中可以看出，执行删除操作时，先根据key的hash值，找到待删除的键值对位于table的哪个位置上。
还有一个很关键的地方：由createEntry()方法可以看出，HashMap中的Entry是放在table中的，然而

transient Entry<K,V>[] table = (Entry<K,V>[]) EMPTY_TABLE;

table 是由transient 关键字修饰的，即不会被序列化。这是为什么呢？
由put()方法addEntry()方法可以看出HashMap是基于HashCode的，而HashCode作为Object的方法，是native的：

  public native int hashCode();

不同的虚拟机有不同的hashcode算法，这与java的跨平台性相违背，
所以java自己实现了HashMap的序列化：

   private void writeObject(java.io.ObjectOutputStream s)
        throws IOException
    {
        // Write out the threshold, loadfactor, and any hidden stuff
        s.defaultWriteObject();

        // Write out number of buckets
        if (table==EMPTY_TABLE) {
            s.writeInt(roundUpToPowerOf2(threshold));
        } else {
           s.writeInt(table.length);
        }

        // Write out size (number of Mappings)
        s.writeInt(size);

        // Write out keys and values (alternating)
        if (size > 0) {
            for(Map.Entry<K,V> e : entrySet0()) {
                s.writeObject(e.getKey());
                s.writeObject(e.getValue());
            }
        }
    }



    /**
     * Reconstitute the {@code HashMap} instance from a stream (i.e.,
     * deserialize it).
     */
    private void readObject(java.io.ObjectInputStream s)
         throws IOException, ClassNotFoundException
    {
        // Read in the threshold (ignored), loadfactor, and any hidden stuff
        s.defaultReadObject();
        if (loadFactor <= 0 || Float.isNaN(loadFactor)) {
            throw new InvalidObjectException("Illegal load factor: " +
                                               loadFactor);
        }

        // set other fields that need values
        table = (Entry<K,V>[]) EMPTY_TABLE;

        // Read in number of buckets
        s.readInt(); // ignored.

        // Read number of mappings
        int mappings = s.readInt();
        if (mappings < 0)
            throw new InvalidObjectException("Illegal mappings count: " +
                                               mappings);

        // capacity chosen by number of mappings and desired load (if >= 0.25)
        int capacity = (int) Math.min(
                    mappings * Math.min(1 / loadFactor, 4.0f),
                    // we have limits...
                    HashMap.MAXIMUM_CAPACITY);

        // allocate the bucket array;
        if (mappings > 0) {
            inflateTable(capacity);
        } else {
            threshold = capacity;
        }

        init();  // Give subclass a chance to do its thing.

        // Read the keys and values, and put the mappings in the HashMap
        for (int i = 0; i < mappings; i++) {
            K key = (K) s.readObject();
            V value = (V) s.readObject();
            putForCreate(key, value);
        }
    }

最后，整理一下HashMap和HashTable的区别：
1、Hashtable是线程安全的，Hashtable所有对外提供的方法都使用了synchronized，也就是同步，而HashMap则是线程非安全的

2、Hashtable不允许空的value，空的value将导致空指针异常，而HashMap则无所谓，没有这方面的限制

涂川江

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
从底层实现重新理解HashMap

HashMap是平时经常使用的一种集合，是一种键值对（K-V）形式的存储结构，由此可以推测出一些HashMap的属性： 1.key和value都可以为空值。 2.value随意重复，key重复就会被覆盖。 3.放置其中的键值对无序。查看HashMap的源码就会发现这样一个静态内部类： static class Entry<K,V> implements Map.En
复制链接

扫一扫