Java集合在JDK1.7和JDK1.8中的不同——JDK1.7和JDK1.8中集合的对比研究——java集合深入理解

最新推荐文章于 2023-10-07 15:11:50 发布

后端阿一

最新推荐文章于 2023-10-07 15:11:50 发布

阅读量668

点赞数

分类专栏： Java 文章标签： java arraylist 集合源码 jdk1.8

本文链接：https://blog.csdn.net/qq_44704799/article/details/119798836

版权

Java 专栏收录该内容

6 篇文章 0 订阅

订阅专栏

本文详细对比了JDK1.7与JDK1.8中ArrayList的构造器、扩容机制、增删改查方法的差异。JDK1.8在构造器上使用了全局静态空数组，减少不必要的内存分配；在扩容机制上保持了1.7的1.5倍扩容策略，同时在某些情况下避免了不必要的数组复制。文章还探讨了add、remove等方法的实现细节，以及修剪、交集等操作的优化。通过对源码的深入分析，揭示了JDK1.8在性能和资源利用上的改进。

摘要由CSDN通过智能技术生成

与JDK1.7相比，JDK1.8对集合做了很多优化，这些优化里有很多优秀的算法、思想等等值得学习，所以在这里一一列出，便以后回顾，也希望对读者有些帮助

我们可以从构造器、扩容机制、增删改查、迭代器、并发修改异常等各个方面来分析

Collection

List

ArrayList
LinkedList
Vecto

Set

HashSet
LinkedHashSet
TreeSet

Map

HashMap
LinkedHashMap
TreeMap
HashTable

Properties

1、ArrayList

构造器
 扩容机制
 增删改查方法

1.1、构造器

1.1.1、无参构造器

JDK1.7中

/**
* Constructs an empty list with an initial capacity of ten.
 */
public ArrayList() {
    this(10);
}

JDK1.8中

private static final Object[] DEFAULTCAPACITY_EMPTY_ELEMENTDATA = {};
/**
 * Constructs an empty list with an initial capacity of ten.
 */
public ArrayList() {
    this.elementData = DEFAULTCAPACITY_EMPTY_ELEMENTDATA;
}

我们可以看出，在JDK1.7中无参构造器初始化时就实例化了一个长度为10的数组；
而在JDK1.8中，无参构造器值引用了一个全局静态的一个空数组；
这种改变有什么好处呢？

首先我们在构造器中就初始化一个数组就好比“饿汉式”的一种设计模式，所以我们可以类比“饿汉式”来分析，“饿汉式”和“懒汉式”的最大区别是：
1. “饿汉式”是用空间换时间，“懒汉式”是用时间换空间
2. “饿汉式”是采用预先加载法，调用是反应速度快；“懒汉式”是资源利用率高，但是第一次调用时效率低
3. 如何选择？如果单例模式在系统中会经常用到那么“饿汉式”是一个不错的选择，反之用“懒汉式”
4. 所以我们在实例化一个ArrayList的时候，如果经常需要对ArrayList进行增删改就需要提前申请好足够的容量，反之，就是要空参构造器（饱汉式）

1.1.2、初始化容量构造器

JDK1.7

/**
 * Constructs an empty list with the specified initial capacity.
 *
 * @param  initialCapacity  the initial capacity of the list
 * @throws IllegalArgumentException if the specified initial capacity
 *         is negative
 */
public ArrayList(int initialCapacity) {
    super();
    if (initialCapacity < 0)
        throw new IllegalArgumentException("Illegal Capacity: "+
                                           initialCapacity);
    this.elementData = new Object[initialCapacity];
}

JDK1.8

private static final Object[] EMPTY_ELEMENTDATA = {};
/**
 * Constructs an empty list with the specified initial capacity.
 *
 * @param  initialCapacity  the initial capacity of the list
 * @throws IllegalArgumentException if the specified initial capacity
 *         is negative
 */
public ArrayList(int initialCapacity) {
    if (initialCapacity > 0) {
        this.elementData = new Object[initialCapacity];
    } else if (initialCapacity == 0) {
        this.elementData = EMPTY_ELEMENTDATA;
    } else {
        throw new IllegalArgumentException("Illegal Capacity: "+
                                           initialCapacity);
    }
}

JDK1.8只是多出了一些条件判断，当初始化容量为0的时候，依然只引用了一个全局静态空数组，静态保证全局唯一，提高资源利用率

1.1.3、通过Collection创建ArrayList

JDK1.7

/**
 * Constructs a list containing the elements of the specified
 * collection, in the order they are returned by the collection's
 * iterator.
 *
 * @param c the collection whose elements are to be placed into this list
 * @throws NullPointerException if the specified collection is null
 */
public ArrayList(Collection<? extends E> c) {
    elementData = c.toArray();
    size = elementData.length;
    // c.toArray might (incorrectly) not return Object[] (see 6260652)
    if (elementData.getClass() != Object[].class)
        elementData = Arrays.copyOf(elementData, size, Object[].class);
}

JDK1.8

/**
 * Constructs a list containing the elements of the specified
 * collection, in the order they are returned by the collection's
 * iterator.
 *
 * @param c the collection whose elements are to be placed into this list
 * @throws NullPointerException if the specified collection is null
 */
public ArrayList(Collection<? extends E> c) {
    elementData = c.toArray();
    if ((size = elementData.length) != 0) {
        // c.toArray might (incorrectly) not return Object[] (see 6260652)
        if (elementData.getClass() != Object[].class)
            elementData = Arrays.copyOf(elementData, size, Object[].class);
    } else {
        // replace with empty array.
        this.elementData = EMPTY_ELEMENTDATA;
    }
}

public Object[] toArray() {
    return Arrays.copyOf(elementData, size);
}

JDK1.8相较于JDK1.7还是只多了一些判断，但是！！！
在这个源码中有一行注释值得深究，即c.toArray might (incorrectly) not return Object[] (see 6260652)
那么，为什么c.toArray可能返回的不是一个Object类型的数组呢，我们看ArrayList的源码中toArray方法返回的明明就是Object类型的数组呀？
上面的原因其实就是因为对象的多态性和ArrayList的泛型两者混合导致的一种bug

大家可以参考这一片博客来加以理解：集合源码中toArray方法的bug

1.2、扩容机制

首先，我们只有向ArrayList中添加数据的时候，才有可能引发扩容，所以我们从add方法入手分析：

1.2.1、JDK1.7扩容机制

/**
 * Appends the specified element to the end of this list.
 *
 * @param e element to be appended to this list
 * @return <tt>true</tt> (as specified by {@link Collection#add})
 */
public boolean add(E e) {
    ensureCapacityInternal(size + 1);  // Increments modCount!!
    elementData[size++] = e;
    return true;
}

ensureCapacityInternal：这个方法就是扩容机制对外暴露的接口，它隐藏了扩容的实现机制，我们无需知道扩容原理，就可直接实现扩容；

private void ensureCapacityInternal(int minCapacity) {
    modCount++;
    // overflow-conscious code
    if (minCapacity - elementData.length > 0)
        grow(minCapacity);
}

modCount这个属性，是处理并发修改的重要属性，我们在分析迭代器的时候在仔细深究
在源码中，只要是minCapacity，我们都可以理解为是最小需求容量

/**
 * Increases the capacity to ensure that it can hold at least the
 * number of elements specified by the minimum capacity argument.
 *
 * @param minCapacity the desired minimum capacity
 */
private void grow(int minCapacity) {
    // overflow-conscious code
    int oldCapacity = elementData.length;
    // 扩容为原来的1.5倍
    int newCapacity = oldCapacity + (oldCapacity >> 1);
    // 如果我们扩容（1.5倍）之后的容量还小于最小需求容量，那么就直接把minCapacity作为容量
    if (newCapacity - minCapacity < 0)
        newCapacity = minCapacity;
    if (newCapacity - MAX_ARRAY_SIZE > 0)
        newCapacity = hugeCapacity(minCapacity);
    // minCapacity is usually close to size, so this is a win:
    //底层都是数组拷贝
    elementData = Arrays.copyOf(elementData, newCapacity);
}

无非就是一些条件过滤

1.2.2、JDK1.8扩容机制

public boolean add(E e) {
    // 我们可以看到JDK1.8中扩容对外暴露的接口与JDK1.7相同
    ensureCapacityInternal(size + 1);  // Increments modCount!!
    elementData[size++] = e;
    return true;
}

private void ensureCapacityInternal(int minCapacity) {
    ensureExplicitCapacity(calculateCapacity(elementData, minCapacity));
}

以下方法是计算需求容量（minCapacity）的，无非就是与默认容量10相比，minCapacity < 10 ？ 10 ：minCapacity

private static int calculateCapacity(Object[] elementData, int minCapacity) {
    if (elementData == DEFAULTCAPACITY_EMPTY_ELEMENTDATA) {
        return Math.max(DEFAULT_CAPACITY, minCapacity);
    }
    return minCapacity;
}

private void ensureExplicitCapacity(int minCapacity) {
    modCount++;

    // overflow-conscious code
    if (minCapacity - elementData.length > 0)
        grow(minCapacity);
}

至于JDK1.8的grow方法和JDK1.7完全一样

扩容机制带来的思想：

当我们需要实现一个功能的时候，现需要能不能将其分装起来，实现为一个通用的功能，对外提供公共的接口，这样我们在之后的功能扩展的时候，就无需在实现这个功能了

1.3、增删改的一些方法

1.3.1、add方法（两个版本无修改）

我们可以看到将扩容方法封装，对外暴露接口的好处：

Increments modCount!!，我们只要调用ensureCapacityInternal方法，就会修改/确定容量，并修改modCount，而且无需知道底层实现了
因为add方法可能会需要扩容，所以我们就封装了一个扩容机制，而之后的remove方法，我们也封装了一个fastRemove方法
因为add，remove移动，添加，删除数组，可能会导致空指针、等并发修改异常的产生，所以都需要修改modCount属性，因此在ensureCapacityInternal和fastRemove中都封装了modCount++；这个在迭代器中仔细深究，现在只要留一个心眼

public boolean add(E e) {
    ensureCapacityInternal(size + 1);  // Increments modCount!!
    elementData[size++] = e;
    return true;
}

public void add(int index, E element) {
    //因为在类中多次出现这个代码，所以我们就封装了一个方法
    rangeCheckForAdd(index);

    ensureCapacityInternal(size + 1);  // Increments modCount!!
    System.arraycopy(elementData, index, elementData, index + 1,
                     size - index);
    elementData[index] = element;
    size++;
}

以上两个添加单个元素的方法，无非就是确定容量、移动元素（数组拷贝），size++；判断下标

public boolean addAll(Collection<? extends E> c) {
    // 这里我们注意到，它和我们之前构造器中说过toArray可能产生异常，为什么这个没有呢？
    // 哈哈，那是因为我们这个方法的形参中使用上限通配符(?)规定了泛型的上限，所以方法内部不可能再产生类型转换错误等异常了
    Object[] a = c.toArray();
    int numNew = a.length;
    ensureCapacityInternal(size + numNew);  // Increments modCount
    System.arraycopy(a, 0, elementData, size, numNew);
    size += numNew;
    return numNew != 0;
}

public boolean addAll(int index, Collection<? extends E> c) {
    rangeCheckForAdd(index);

    Object[] a = c.toArray();
    int numNew = a.length;
    ensureCapacityInternal(size + numNew);  // Increments modCount

    int numMoved = size - index;
    if (numMoved > 0)
        System.arraycopy(elementData, index, elementData, index + numNew,
                         numMoved);

    System.arraycopy(a, 0, elementData, index, numNew);
    size += numNew;
    return numNew != 0;
}

在某个下标处添加、删除元素，无非就是死套路：

判断下标合理性
是否需要扩容（涉及修改modCount）
移动元素（实际就是数组Copy）
修改size

1.3.2、remove方法

public boolean remove(Object o)：删除第一次出现的元素

public boolean remove(Object o) {
    //当我们传入的参数是引用类型的时候，都要考虑是否需要空值处理
    if (o == null) {
        for (int index = 0; index < size; index++)
            if (elementData[index] == null) {
                fastRemove(index);
                return true;
            }
    } else {
        for (int index = 0; index < size; index++)
            if (o.equals(elementData[index])) {
                fastRemove(index);
                return true;
            }
    }
    return false;
}

private void fastRemove(int index) {
    modCount++;
    int numMoved = size - index - 1;
    if (numMoved > 0)
        System.arraycopy(elementData, index+1, elementData, index,
                         numMoved);
    elementData[--size] = null; // clear to let GC do its work
}

public E remove(int index)：删除指定位置的元素

public E remove(int index) {
    rangeCheck(index);

    modCount++;
    E oldValue = elementData(index);

    int numMoved = size - index - 1;
    if (numMoved > 0)
        System.arraycopy(elementData, index+1, elementData, index,
                         numMoved);
    elementData[--size] = null; // clear to let GC do its work

    return oldValue;
}

public boolean removeAll(Collection<?> c)

public boolean removeAll(Collection<?> c) {
    //空值判断，抛空指针异常
    Objects.requireNonNull(c);
    //batch...: 批量操作，在jdbcTemplate中也有这种方法
    return batchRemove(c, false);
}

// 这个batchRemove方法值得深究一下，removeAll和retainAll两个方法都调用了这个方法
// 我们先说removeAll调用这个方法的执行流程
// 1.首先removeAll中传入complement = false，这个是什么意思呢？在代码中会更好的体会
private boolean batchRemove(Collection<?> c, boolean complement) {
    final Object[] elementData = this.elementData;
    int r = 0, w = 0;
    boolean modified = false;
    try {
        // 思路: 
        // 	1.complement为false: 遍历elementData中的元素，判断在c集合中有没有
        // 	如果没有，就将其移至最前面，最后把后面的置null，这样就实现了删除c集合中的元素，
        // 	即将elementData中独有的元素移至前面保留下来
        // 	2.complement为true: 遍历elementData中的元素，判断在c集合中有没有
        // 	如果有，就将其移至前面，最后把后面的置null，这样就实现了交集的操作，
        // 	即将elementData中的在c中也有的保留下来（交集）
        for (; r < size; r++)
            if (c.contains(elementData[r]) == complement)
                elementData[w++] = elementData[r];
    } finally {
        // Preserve behavioral compatibility with AbstractCollection,
        // even if c.contains() throws.
        //上面的翻译：保证与 AbstractCollection 的兼容，防止c.contains抛异常
        if (r != size) {
            System.arraycopy(elementData, r,
                             elementData, w,
                             size - r);
            w += size - r;
        }
        if (w != size) {
            // clear to let GC do its work
            for (int i = w; i < size; i++)
                elementData[i] = null;
            modCount += size - w;
            size = w;
            modified = true;
        }
    }
    return modified;
}

protected void removeRange(int fromIndex, int toIndex)：

删除[fromIndex，toIndex) 元素；
protected修饰，所以我们在自定义ArrayList的时候可以用到，但是这个方法没有检查 index 的合理性

protected void removeRange(int fromIndex, int toIndex) {
    modCount++;
    int numMoved = size - toIndex;
    System.arraycopy(elementData, toIndex, elementData, fromIndex,
                     numMoved);

    // clear to let GC do its work
    int newSize = size - (toIndex-fromIndex);
    for (int i = newSize; i < size; i++) {
        elementData[i] = null;
    }
    size = newSize;
}

无非还是四步死套路
clear to let GC do its work：当我们能够手动释放资源的时候就要手动释放资源，方便GC快速回收，提高资源利用率（什么Stream，connection链接什么的都要手动释放，否则…嘿嘿）

1.3.3、get方法

public E get(int index) {
    rangeCheck(index);

    return elementData(index);
}

1.3.4、set方法

public E set(int index, E element) {
    rangeCheck(index);

    E oldValue = elementData(index);
    elementData[index] = element;
    return oldValue;
}

1.4、修剪方法

作用：将elementData数组的大小缩小为size；

/**
 * Trims the capacity of this <tt>ArrayList</tt> instance to be the
 * list's current size.  An application can use this operation to minimize
 * the storage of an <tt>ArrayList</tt> instance.
 */
public void trimToSize() {
    modCount++;
    if (size < elementData.length) {
        elementData = (size == 0)
          ? EMPTY_ELEMENTDATA
          : Arrays.copyOf(elementData, size);
    }
}

1.5、交集方法

public boolean retainAll(Collection<?> c) {
    Objects.requireNonNull(c);
    return batchRemove(c, true);
}

// 这个batchRemove方法值得深究一下，removeAll和retainAll两个方法都调用了这个方法
// 我们先说removeAll调用这个方法的执行流程
// 1.首先removeAll中传入complement = false，这个是什么意思呢？在代码中会更好的体会
private boolean batchRemove(Collection<?> c, boolean complement) {
    final Object[] elementData = this.elementData;
    int r = 0, w = 0;
    boolean modified = false;
    try {
        // 思路: 
        // 	1.complement为false: 遍历elementData中的元素，判断在c集合中有没有
        // 	如果没有，就将其移至最前面，最后把后面的置null，这样就实现了删除c集合中的元素，
        // 	即将elementData中独有的元素移至前面保留下来
        // 	2.complement为true: 遍历elementData中的元素，判断在c集合中有没有
        // 	如果有，就将其移至前面，最后把后面的置null，这样就实现了交集的操作，
        // 	即将elementData中的在c中也有的保留下来（交集）
        for (; r < size; r++)
            if (c.contains(elementData[r]) == complement)
                elementData[w++] = elementData[r];
    } finally {
        // Preserve behavioral compatibility with AbstractCollection,
        // even if c.contains() throws.
        //上面的翻译：保证与 AbstractCollection 的兼容，防止c.contains抛异常
        if (r != size) {
            System.arraycopy(elementData, r,
                             elementData, w,
                             size - r);
            w += size - r;
        }
        if (w != size) {
            // clear to let GC do its work
            for (int i = w; i < size; i++)
                elementData[i] = null;
            modCount += size - w;
            size = w;
            modified = true;
        }
    }
    return modified;
}