记HashSet中的元素与size方法不一致的疑惑

最新推荐文章于 2022-06-08 10:39:13 发布

山鬼谣me

最新推荐文章于 2022-06-08 10:39:13 发布

阅读量2.1k

点赞数 2

分类专栏： Java 文章标签： Java 单例 HashSet

本文链接：https://blog.csdn.net/u013066244/article/details/88264750

版权

Java 专栏收录该内容

177 篇文章

订阅专栏

环境

java 8

前言

今天在测试线程不安全的单例代码时，发现HashSet中的元素和其size()调出来的结果不一致；
当然，宏观来说，不一致的原因，就是因为HashSet底层使用的是HashMap，而HashMap本身就是线程不安全的，所以就会导致这种原因；

但是呢，我忍不住想知道，到底哪一块的代码造成的，所以阅读了下源码；算是明白了；

先看效果图：
在这里插入图片描述

如上图，明明就是一个元素，size()确显示3

代码

	public static void main(String[] args) throws InterruptedException {
		Set<SingleTest2> set = new HashSet<>();
		for (int i = 0; i < 100; i++) {
			new Thread(() -> {
				SingleTest2 single = SingleTest2.getSingle();
				set.add(single);
				System.out.println(single);
				}).start();
		}
		Thread.sleep(1000);
		System.out.println(set.size() + "元素为：" + set);
	}

上面的代码就是，开100个线程，模拟高并发，去获取单例对象；

本来我是测试单例代码写的对不对，结果又跑来研究HashSet了。

HashSet的size方法

    /**
     * Returns the number of elements in this set (its cardinality).
     *
     * @return the number of elements in this set (its cardinality)
     */
    public int size() {
        return map.size();
    }

    /**
     * Returns the number of key-value mappings in this map.
     *
     * @return the number of key-value mappings in this map
     */
    public int size() {
        return size;
    }

从上面的代码中，可以看出，就是获取HashMap类中的一个size字段。

那么，当添加元素时，这个size是怎么计算的呢？接着往下看

HashSet 的 add方法

    public boolean add(E e) {
        return map.put(e, PRESENT)==null;
    }

可以看出，其调用的是HashMap的put方法；

而put方法最终会调用如下方法：

/**
 * Implements Map.put and related methods
 *
 * @param hash hash for key
 * @param key the key
 * @param value the value to put
 * @param onlyIfAbsent if true, don't change existing value
 * @param evict if false, the table is in creation mode.
 * @return previous value, or null if none
 */
  final V putVal(int hash, K key, V value, boolean onlyIfAbsent,
                 boolean evict) {
      Node<K,V>[] tab; Node<K,V> p; int n, i;
      if ((tab = table) == null || (n = tab.length) == 0)
          n = (tab = resize()).length;
      //重点看if这块代码；
      //else 会直接return掉，不会改变size大小
      if ((p = tab[i = (n - 1) & hash]) == null)
      	  //表面table[i]上没有元素，那么就new一个节点出来
          tab[i] = newNode(hash, key, value, null);
      else {
          Node<K,V> e; K k;
          if (p.hash == hash &&
              ((k = p.key) == key || (key != null && key.equals(k))))
              e = p;
          else if (p instanceof TreeNode)
              e = ((TreeNode<K,V>)p).putTreeVal(this, tab, hash, key, value);
          else {
              for (int binCount = 0; ; ++binCount) {
                  if ((e = p.next) == null) {
                      p.next = newNode(hash, key, value, null);
                      if (binCount >= TREEIFY_THRESHOLD - 1) // -1 for 1st
                          treeifyBin(tab, hash);
                      break;
                  }
                  if (e.hash == hash &&
                      ((k = e.key) == key || (key != null && key.equals(k))))
                      break;
                  p = e;
              }
          }
          if (e != null) { // existing mapping for key
              V oldValue = e.value;
              if (!onlyIfAbsent || oldValue == null)
                  e.value = value;
              afterNodeAccess(e);
              return oldValue;
          }
      }
      //这个是用于检测当前遍历的map是否被别的线程修改了
      ++modCount;
      //重点这里，size加一了
      if (++size > threshold)
          resize();
      afterNodeInsertion(evict);
      return null;
  }

我们重点看如下代码：

      //重点看if这块代码；
      //else 会直接return掉，不会改变size大小
      if ((p = tab[i = (n - 1) & hash]) == null)
      	  //表面table[i]上没有元素，那么就new一个节点出来
          tab[i] = newNode(hash, key, value, null);

当高并发时，多个线程进入上面这段代码时，可能都会去创建一个新节点（这样也会导致数据丢失），然后不走else的部分；
这样就会导致，执行下面的代码：

      //这个是用于检测当前遍历的map是否被别的线程修改了
      ++modCount;
      //重点这里，size加一了
      if (++size > threshold)
          resize();

上面的size自增了；在多线程的情况下，这个size就不对了。
这就能解释我的疑惑了；

为什么CopyOnWriteArraySet安全集合没问题呢？

因为他们底层不再像HashMap那样使用size进行自增，而是对数组Object[]计算长度，也就是length方法。

总结

这是Java基础知识，记录下，加深印象