CHM知多少？？

最新推荐文章于 2024-06-25 22:34:32 发布

化蝶飞舞

最新推荐文章于 2024-06-25 22:34:32 发布

阅读量2.3k

点赞数 1

分类专栏： Java 文章标签： java hashmap

本文链接：https://blog.csdn.net/yangyiping525/article/details/108682178

版权

Java 专栏收录该内容

7 篇文章 0 订阅

订阅专栏

什么是CHM ?

  1、CHM 全称：ConcurrentHashMap 是J.U.C工具包中所提供的,显然是个map集合，而且是个线程安全的HashMap,既然是线程安全的那么显然下并发场景下用的还是比较频繁的。
  2、CHM作用和HashMap差不太多，也可以说是一个线程安全的HashMap
  3、使用和普通的Map是一致的，包含put,get 方法等

    ConcurrentHashMap chmMap = new ConcurrentHashMap();
     chmMap.put("key","value");
     chmMap.get("key");

CHM的发展

 这里为什么说下它的发展呢，是因为在jdk1.7和jdk1.8这2个版本下发生了一些比较大的变化（即对它进行的优化升级）
 jdk1.7版本的主要内容：jdk1.7在线程安全方面主要利用到了分段锁（segment）,简单的说一个CHM是由N（初始化16）个segment组成的，这样的话就通过在每个segment端上加锁来控制线程安全的。

如下图:
在这里插入图片描述

jdk1.8 针对jdk1.7做了些改进 主要有如下：
    1、取消了segement分段锁这部分
    2、由原来的数字+单向链表 改为 数组+单向链表（或红黑树）
原因: 1、去掉segment的分段锁，使得锁的范围更小了，减少了阻塞的概率，提升锁的效率。
      2、数组+链表，缺点：单向链表的时间复杂度为0(n)，如果同一个节点hash碰撞过多的话，那么这个节点的单向链表的长度就很可能成为整个map查询的瓶颈了。因此优化了当单向链表的长度增加到了8（默认），那么就会由原来的链表转为红黑树，红黑树的查询效率比链表要高很多0(logn)，利用了二分查找等机制，加速了查找速度。
      如下图：

在这里插入图片描述

CHM的原理浅析

public V put(K key, V value) {
return putVal(key, value, false);
}
对于这个方法，我这边做有个我自己的理解，希望大致说下这个逻辑对后续的源码理解有帮助

大致逻辑如下 ：每一点说明一个逻辑判断
1、hash  - > hash = spread(key.hashCode()); // 获取下标
2、一个循环，没有结束条件 for (Node<K,V>[] tab = table;;)
        2.1、判断该数组是否进行过初始化(这个一块一定记住始终考虑并发的情况出现)      进行初始化tab[].   
        2.2、根据has获取对应数组的值，如果null ,cas(原子操作)，基础赋值（占据这个位置）
        2.3、考虑到该结点 是否处于迁移，如果该结点正在迁移，那么说明有线程在迁移，也可以加入，配合迁移（协助迁移）
        2.4、如果该结点不为空，那么存在以下几种情况
            2.4.1、 该节点下是单链表，一次循环找到对应结点位置，
            2.4.2、若该结点是 树，那么根据树的结构将数据存起来
        2.5、判断该结点下的个数,。TREEIFY_THRESHOLD = 8 
            如大于 8  if (binCount >= TREEIFY_THRESHOLD)   转为树  
3、计算元素个数

初始化

// 初始化数组
private final Node<K,V>[] initTable() {
     Node<K,V>[] tab; int sc;
     // 循环存在多线程竞争的情况下
     while ((tab = table) == null || tab.length == 0) {
         if ((sc = sizeCtl) < 0) // 以及被其他线程抢到了，开始初始化了
             Thread.yield(); // lost initialization race; just spin
         else if (U.compareAndSwapInt(this, SIZECTL, sc, -1)) {
             try {  //  cas原子操作，标记已经在初始化了，与第一个if相对应
                 if ((tab = table) == null || tab.length == 0) {
                 //  默认初始值16 。
                   //  private static final int DEFAULT_CAPACITY = 16;
                     int n = (sc > 0) ? sc : DEFAULT_CAPACITY;
                     @SuppressWarnings("unchecked")
                     Node<K,V>[] nt = (Node<K,V>[])new Node<?,?>[n];
                     table = tab = nt;
              sc = n - (n >>> 2); //扩容因子，下次需要扩容的大小（0.75）
                 }
             } finally {
                 sizeCtl = sc;
             }
             break;
         }
     }
     return tab;
 }
 /** Implementation for put and putIfAbsent */
 final V putVal(K key, V value, boolean onlyIfAbsent) {
    //键不能为空，否则报错
     if (key == null || value == null) throw new NullPointerException();
     //hash
     int hash = spread(key.hashCode()); 
     int binCount = 0;
     for (Node<K,V>[] tab = table;;) {
         Node<K,V> f; int n, i, fh;
         if (tab == null || (n = tab.length) == 0)
             // 初始化 数组
             tab = initTable(); 
         else if ((f = tabAt(tab, i = (n - 1) & hash)) == null) {
             if (casTabAt(tab, i, null,
                          new Node<K,V>(hash, key, value, null)))
                 break;                   // no lock when adding to empty bin
         }
         else if ((fh = f.hash) == MOVED)
             tab = helpTransfer(tab, f);
         else {
             V oldVal = null;
             synchronized (f) {
                 if (tabAt(tab, i) == f) {
                     if (fh >= 0) {
                         binCount = 1;
                         for (Node<K,V> e = f;; ++binCount) {
                             K ek;
                             if (e.hash == hash &&
                                 ((ek = e.key) == key ||
                                  (ek != null && key.equals(ek)))) {
                                 oldVal = e.val;
                                 if (!onlyIfAbsent)
                                     e.val = value;
                                 break;
                             }
                             Node<K,V> pred = e;
                             if ((e = e.next) == null) {
                                 pred.next = new Node<K,V>(hash, key,
                                                           value, null);
                                 break;
                             }
                         }
                     }
                     else if (f instanceof TreeBin) {  // 判断是否为树
                         Node<K,V> p;
                         binCount = 2;
                         if ((p = ((TreeBin<K,V>)f).putTreeVal(hash, key,
                                                        value)) != null) {
                             oldVal = p.val;
                             if (!onlyIfAbsent)
                                 p.val = value;
                         }
                     }
                 }
             }
             // static final int TREEIFY_THRESHOLD = 8;
             if (binCount != 0) {
             // 如果节点数，大于初始值默认8），那么转成红黑树
                 if (binCount >= TREEIFY_THRESHOLD)
                     treeifyBin(tab, i);
                 if (oldVal != null)
                     return oldVal;
                 break;
             }
         }
     }
     addCount(1L, binCount);
     return null;
 }
 
============
 Class<?> ak = Node[].class;
  ABASE = U.arrayBaseOffset(ak);
         int scale = U.arrayIndexScale(ak);
         if ((scale & (scale - 1)) != 0)
             throw new Error("data type scale not a power of two");
         ASHIFT = 31 - Integer.numberOfLeadingZeros(scale);
   =============     
@SuppressWarnings("unchecked")
 static final <K,V> Node<K,V> tabAt(Node<K,V>[] tab, int i) {
 // 此方法 是通过获取offset的偏移量 如上代码，实际等价于tab[i]，那么为什么不这么取值？
     return (Node<K,V>)U.getObjectVolatile(tab, ((long)i << ASHIFT) + ABASE);
 }

为什么通过tabAt()单独的去获取值呢？？不知直接用tab[i]更家直接的获取值呢？

原因: 看代码上getObjectVolatile 虽然有volatile关键字，但是我们都知道，因为对 volatile 写操作 happen-before 于 volatile 读操作，
因此其他线程对 table 的修改均对 get 读取可见的，由此可见直接tab[i] 不一定能获取的最新的值。
虽然 table 数组本身是增加了 volatile 属性，但是“volatile 的数组只针对数组的引用具有volatile 的语义，而不是它的元素”。所以就直接用内存上的偏移量来获取值。

个数计算
api 方法 - > chmMap.size();
这个计算计算个数由于涉及到多线程，那么单纯的计算个数是可能不准确的，因此这边利用的分而治之的算法方式，来获取最后的个数

       /**
     * Adds to count, and if table is too small and not already
     * resizing, initiates transfer. If already resizing, helps
     * perform transfer if work is available.  Rechecks occupancy
     * after a transfer to see if another resize is already needed
     * because resizings are lagging additions.
     *
     * @param x the count to add
     * @param check if <0, don't check resize, if <= 1 only check if uncontended  //  check 指的是扩容标记
     */
    private final void addCount(long x, int check) {
        CounterCell[] as; long b, s;
        if ((as = counterCells) != null ||
            !U.compareAndSwapLong(this, BASECOUNT, b = baseCount, s = b + x)) {
            //baseCount  是基础个数，可以认为是在没有竞争的情况下，通过baseCount的累加记录个数的
            CounterCell a; long v; int m;
            boolean uncontended = true; // 表示默认没有冲突，即无竞争
            if (as == null || (m = as.length - 1) < 0 ||
                (a = as[ThreadLocalRandom.getProbe() & m]) == null ||
                !(uncontended =
                  U.compareAndSwapLong(a, CELLVALUE, v = a.value, v + x))) {
                  //  计算元素个数 baseCount+x（1.竞争，2.无竞争）
                fullAddCount(x, uncontended);
                return;
            }
            if (check <= 1)
                return;
            s = sumCount();
        }
        if (check >= 0) {
            Node<K,V>[] tab, nt; int n, sc;
            while (s >= (long)(sc = sizeCtl) && (tab = table) != null &&
                   (n = tab.length) < MAXIMUM_CAPACITY) {
                int rs = resizeStamp(n);
                if (sc < 0) {
                    if ((sc >>> RESIZE_STAMP_SHIFT) != rs || sc == rs + 1 ||
                        sc == rs + MAX_RESIZERS || (nt = nextTable) == null ||
                        transferIndex <= 0)
                        break;
                    if (U.compareAndSwapInt(this, SIZECTL, sc, sc + 1))
                        transfer(tab, nt);
                }
                else if (U.compareAndSwapInt(this, SIZECTL, sc,
                                             (rs << RESIZE_STAMP_SHIFT) + 2))
                    transfer(tab, null);
                s = sumCount();
            }
        }
    }
// 计算个数，期间需要考虑扩容等，加锁等问题
private final void fullAddCount(long x, boolean wasUncontended) {
        int h;
        // 随机获取一个下标，如没有对应数组下标内为空，那么不存在竞争
        if ((h = ThreadLocalRandom.getProbe()) == 0) {
            ThreadLocalRandom.localInit();      // force initialization
            h = ThreadLocalRandom.getProbe();
            wasUncontended = true;
        }
        boolean collide = false;                // True if last slot nonempty
   for (;;) {
      CounterCell[] as; CounterCell a; int n; long v;
      if ((as = counterCells) != null && (n = as.length) > 0) {
         if ((a = as[(n - 1) & h]) == null) { // 若为空
           if (cellsBusy == 0) {      // Try to attach new Cell 为新的值，当前不再初始化或扩容状态下，创建新的cell值，设置 x（个数）
         CounterCell r = new CounterCell(x); // Optimistic create
              if (cellsBusy == 0 &&
               U.compareAndSwapInt(this, CELLSBUSY, 0, 1)) {  // cas 加锁
                 boolean created = false;
                  try {     // Recheck under lock  // 加锁防止并发修改对应的值
                                CounterCell[] rs; int m, j;
                                if ((rs = counterCells) != null &&
                                    (m = rs.length) > 0 &&
                                    rs[j = (m - 1) & h] == null) {
                                    rs[j] = r;  // 将元素的个数 放入对应于的数组内rs[j]
                                    created = true;
                                }
                            } finally {
                                cellsBusy = 0;
                            }
                            if (created)
                                break;
                            continue;           // Slot is now non-empty
                        }
                    }
                    collide = false;
                }
      else if (!wasUncontended)    // CAS already known to fail 
          wasUncontended = true;      // Continue after rehash ,未冲突
    else if (U.compareAndSwapLong(a, CELLVALUE, v = a.value, v + x))
        break;
       // 如果已经有其他线程建立了新的 counterCells 或者 CounterCells 大于 CPU 核心数（线程的并发数不会超过 cpu 核心数）
     else if (counterCells != as || n >= NCPU)
            collide = false;            // At max size or stale
     else if (!collide)
           collide = true;
     else if (cellsBusy == 0 &&
          U.compareAndSwapInt(this, CELLSBUSY, 0, 1)) {
      try { 
          if (counterCells == as) {// Expand table unless stale
                  // // 需要扩容了 n*2 （左移1位）
                  CounterCell[] rs = new CounterCell[n << 1];
                            for (int i = 0; i < n; ++i)
                                rs[i] = as[i];
                      counterCells = rs;
                        }
                    } finally {
                        cellsBusy = 0;
                    }
                    collide = false;
                    continue;                   // Retry with expanded table
                }
                h = ThreadLocalRandom.advanceProbe(h);
            }
            // 没有初始化，开始初始化 默认为2的大小
            else if (cellsBusy == 0 && counterCells == as &&
                     U.compareAndSwapInt(this, CELLSBUSY, 0, 1)) {
                boolean init = false;
                try {                           // Initialize table
                    if (counterCells == as) {
                        CounterCell[] rs = new CounterCell[2];
                        rs[h & 1] = new CounterCell(x);
                        counterCells = rs;
                        init = true;
                    }
                } finally {
                    cellsBusy = 0;
                }
                if (init)
                    break;
            }
            // cas操作成功，baseCount累加
            else if (U.compareAndSwapLong(this, BASECOUNT, v = baseCount, v + x))
                break;                          // Fall back on using base
        }
    }

  // 求计算个数的table初始化 默认大小为2，后面也可以扩容
      else if (cellsBusy == 0 && counterCells == as &&
                     U.compareAndSwapInt(this, CELLSBUSY, 0, 1)) {
                boolean init = false;
                try {                           // Initialize table
                    if (counterCells == as) {
                        CounterCell[] rs = new CounterCell[2];
                        rs[h & 1] = new CounterCell(x);
                        counterCells = rs;
                        init = true;
                    }
                } finally {
                    cellsBusy = 0;
                }

在这里插入图片描述
通过上面的这个图以及结合代码知道2点：
1、当没有并发的时候，会有个baseCount 来记录当前元素的个数的。
2、当出现并发时，那么就需要考虑到怎么样更加高效的计算出个数，那么这里就引入了通过对当前线程的Random随机落到一个数组的一个节点上，因此组中通过计算整个数组上的value值来得出最终的count数，如果数组不够的情况，也会有设计到扩容的哦。如下代码：

//计算总的元素个数
final long sumCount() {
        CounterCell[] as = counterCells; CounterCell a;
        long sum = baseCount;
        if (as != null) {
            for (int i = 0; i < as.length; ++i) {
                if ((a = as[i]) != null)
                    sum += a.value;
            }
        }
        return sum;
    }

// 创建cell数组，存在竞争以及扩容
 else if (cellsBusy == 0 && U.compareAndSwapInt(this, CELLSBUSY, 0, 1)) {
  try {
     if (counterCells == as) {// Expand table unless stale
         CounterCell[] rs = new CounterCell[n << 1];
          for (int i = 0; i < n; ++i)
               rs[i] = as[i];
                 counterCells = rs;
               }
          } finally {
                 cellsBusy = 0;
     }
         collide = false;
     continue;           // Retry with expanded table
     }
   h = ThreadLocalRandom.advanceProbe(h);

扩容
扩容阶段，个人认为主要有几点需要注意
扩容入口就是在计算元素个数的时候，有个参数需要注意：sizeCtl（扩容因子）
注意：2点
1、迁移协助扩容，以及扩容因子和扩容戳
2、迁移时的高低位之分（这一点参考 HashMap那一块，有详解：扩容高低位 - hashMap）

1、如果有线程正在扩容，那么当前线程就会协助扩容
    else if ((fh = f.hash) == MOVED) 
          tab = helpTransfer(tab, f);
2、如果没有在扩容，那么就是当达到扩容阈值的时候，直接出发扩容

// check 就是 addCount(1L, binCount); 中的binCount 的值
      if (check >= 0) {
          Node<K,V>[] tab, nt; int n, sc;
//s 标识集合大小，如果集合大小大于或等于扩容阈值（默认值的 0.75）
//并且 table 不为空并且 table 的长度小于最大容量
          while (s >= (long)(sc = sizeCtl) && (tab = table) != null &&
                 (n = tab.length) < MAXIMUM_CAPACITY) {
              int rs = resizeStamp(n); // 扩容戳 
              if (sc < 0) {
              //  这一块理解起来有点难，但是我觉总体来就是说：因为考虑到并发情况下，所以需要加锁，那么加锁又会影响其迁移的性能，因此就用过位运算来判断线程之间是互斥的，从而计算出每个线程迁移的节点是哪些.个人认为扩容时间戳就是这么个东西.将table[]数组单做一个共享的同步队列。每个线程负责自己区域，并且将迁移移走的节点标记（fwd-：ForwardingNode）
                  if ((sc >>> RESIZE_STAMP_SHIFT) != rs || sc == rs + 1 ||
                      sc == rs + MAX_RESIZERS || (nt = nextTable) == null ||
                      transferIndex <= 0)
                      break;
                  if (U.compareAndSwapInt(this, SIZECTL, sc, sc + 1))
                      transfer(tab, nt);
              }
              else if (U.compareAndSwapInt(this, SIZECTL, sc,
                                           (rs << RESIZE_STAMP_SHIFT) + 2))
                  transfer(tab, null);
              s = sumCount();
          }
      }

好了，关于CHM 暂时告一段落，写的不好，还望各位看官，多多指点，谢谢！

化蝶飞舞

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
CHM知多少？？

CHM的个人感悟什么是CHM ?CHM的发展CHM的原理浅析欢迎使用Markdown编辑器新的改变功能快捷键合理的创建标题，有助于目录的生成如何改变文本的样式插入链接与图片如何插入一段漂亮的代码片生成一个适合你的列表创建一个表格设定内容居中、居左、居右SmartyPants创建一个自定义列表如何创建一个注脚注释也是必不可少的KaTeX数学公式新的甘特图功能，丰富你的文章UML 图表FLowchart流程图导出与导入导出导入什么是CHM ? 1、CHM 全称：ConcurrentHashMap 是J.U
复制链接

扫一扫