HBase源码分析之regionserver读取流程分析

最新推荐文章于 2022-12-13 19:05:13 发布

VIP文章 xiangel

最新推荐文章于 2022-12-13 19:05:13 发布

阅读量1.7k

点赞数

分类专栏： HBase源码分析文章标签： hbase 源码

本文链接：https://blog.csdn.net/xiangel/article/details/53702823

版权

数据的读取包括Get和Scan2种，通过get的代码可以看出实际也是通过转换为一个Scan来处理的。

//HRegion.java
public List<Cell> get(Get get, boolean withCoprocessor) throws IOException {

    List<Cell> results = new ArrayList<Cell>();
    ...
    Scan scan = new Scan(get);

    RegionScanner scanner = null;
    try {
      scanner = getScanner(scan);
      scanner.next(results);
    } finally {
      if (scanner != null)
        scanner.close();
    }
    ...
    return results;
  }

接下来，来看getScaner方法的处理。getScanner方法开始做了family检查后，接下来就是调用instantiateRegionScanner

//HRegion.java
protected RegionScanner instantiateRegionScanner(Scan scan,
      List<KeyValueScanner> additionalScanners) throws IOException {
    if (scan.isReversed()) {
      if (scan.getFilter() != null) {
        scan.getFilter().setReversed(true);
      }
      return new ReversedRegionScannerImpl(scan, additionalScanners, this);
    }
    return new RegionScannerImpl(scan, additionalScanners, this);
  }

这里根据scan的类型的不同返回不同的RegionScanner对象。
顺序scan为RegionScannerImpl，反向scan为ReversedRegionScannerImpl
先来看RegionScannerImpl的实现

RegionScannerImpl(Scan scan, List<KeyValueScanner> additionalScanners, HRegion region)
        throws IOException {

      this.region = region;
      this.maxResultSize = scan.getMaxResultSize();
      if (scan.hasFilter()) {
        this.filter = new FilterWrapper(scan.getFilter());
      } else {
        this.filter = null;
      }

      /**
       * By default, calls to next/nextRaw must enforce the batch limit. Thus, construct a default
       * scanner context that can be used to enforce the batch limit in the event that a
       * ScannerContext is not specified during an invocation of next/nextRaw
       */
      defaultScannerContext = ScannerContext.newBuilder().setBatchLimit(scan.getBatch()).build();

      if (Bytes.equals(scan.getStopRow(), HConstants.EMPTY_END_ROW) && !scan.isGetScan()) {
        this.stopRow = null;
      } else {
        this.stopRow = scan.getStopRow();
      }
      // If we are doing a get, we want to be [startRow,endRow] normally
      // it is [startRow,endRow) and if startRow=endRow we get nothing.
      this.isScan = scan.isGetScan() ? -1 : 0;

      // synchronize on scannerReadPoints so that nobody calculates
      // getSmallestReadPoint, before scannerReadPoints is updated.
      IsolationLevel isolationLevel = scan.getIsolationLevel();
      synchronized(scannerReadPoints) {
        this.readPt = getReadpoint(isolationLevel);
        scannerReadPoints.put(this, this.readPt);
      }

      // Here we separate all scanners into two lists - scanner that provide data required
      // by the filter to operate (scanners list) and all others (joinedScanners list).
      List<KeyValueScanner> scanners = new ArrayList<KeyValueScanner>();
      List<KeyValueScanner> joinedScanners = new ArrayList<KeyValueScanner>();
      if (additionalScanners != null) {
        scanners.addAll(additionalScanners);
      }

      for (Map.Entry<byte[], NavigableSet<byte[]>> entry :
          scan.getFamilyMap().entrySet()) {
        Store store = stores.get(entry.getKey());
        KeyValueScanner scanner = store.getScanner(scan, entry.getValue(), this.readPt);
        if (this.filter == null || !scan.doLoadColumnFamiliesOnDemand()
          || this.filter.isFamilyEssential(entry.getKey())) {
          scanners.add(scanner);
        } else {
          joinedScanners.add(scanner);
        }
      }
      initializeKVHeap(scanners, joinedScanners, region);
    }

这里additionalScanners为null，
scan.doLoadColumnFamiliesOnDemand() scan默认是false。
filter.isFamilyEssential(entry.getKey()) filter都为true
所以这里每个family的scanner都是在scanners里面
再来看HStore的getScanner方法

public KeyValueScanner getScanner(Scan scan,
      final NavigableSet<byte []> targetCols, long readPt) throws IOException {
    lock.readLock().lock();

最低0.47元/天解锁文章

xiangel

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
HBase源码分析之regionserver读取流程分析

数据的读取包括Get和Scan2种，通过get的代码可以看出实际也是通过转换为一个Scan来处理的。//HRegion.javapublic List<Cell> get(Get get, boolean withCoprocessor) throws IOException { List<Cell> results = new ArrayList<Cell>(); ...
复制链接

扫一扫