2021年05月_Tianchao龙虾

原创 Pointnet++ 代码阅读笔记

Pointnet++ 代码阅读笔记代码地址:https://github.com/yanx27/Pointnet_Pointnet2_pytorchPointNet++ 论文笔记可以参考这篇博客。一、网络框架可以由论文的图中看到，主要分为三个部分:PointNetSetAbstraction这部分源码用了 farthest point sampling, query ball 算法。PointNetFeaturePropagation这部分包括 sample and grouping

2021-05-19 14:47:20 868 5

原创 Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression 论文笔记

Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression 论文笔记论文链接： https://arxiv.org/abs/1911.08287一、 Problem StatementBounding Box的回归分支在目标检测中很重要。目前，lnl_nln-norm loss 对获得最优的IoU metric不是很合适，IoU Loss 只有在bounding boxes 有重叠的时候起作用，而且不会提供任何移

2021-05-19 10:48:27 427

原创 PointNet 代码阅读笔记

PointNet 代码解读笔记代码地址:https://github.com/fxia22/pointnet.pytorch论文笔记看这篇笔记。1. T-Net先来看看T-Net的网络结构:(1) Input transformThe first transformation network is a mini-PointNet that takes raw point cloud as input and regresses to a 3 × 3 matrix. It’s composed

2021-05-14 08:23:00 344

原创 PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space 论文笔记

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space论文链接： https://arxiv.org/abs/1706.02413一、 Problem StatementPointnet 没有捕捉local structure信息。解决pointnet没有在不同分辨率下提取特征的问题。在不同层级提取局部特征的能力可以提升网络的泛化性。二、 Direction基于PointNet的基础上，引入了一个

2021-05-13 10:08:17 259

原创 Pointnet: Deep Learning on Point Sets for 3D Classification and Segmentatio 论文笔记

Pointnet: Deep Learning on Point Sets for 3D Classification and Segmentation论文链接： https://arxiv.org/abs/1612.00593一、 Problem Statement多数研究人员把点云数据转化为规则的3D voxel grids 或者 collections of images。但是这样的转化会造成数据繁杂，也引入了量化的影响，掩盖了点云数据natural invariances的特性。二、 D

2021-05-11 09:39:45 249

原创 Mish: A Self Regularized Non-Monotonic Activation Function 论文笔记

Mish: A Self Regularized Non-Monotonic Activation Function论文链接： https://arxiv.org/abs/1908.08681 BMVC 2020一、 Problem Statement对swish的一个提升。二、 Direction提出一个self regularized non-monotonic self gating 激活函数。三、 Method函数本体是:f(x)=xtanh⁡(softplus(x))=xtanh⁡

2021-05-10 10:53:17 662

原创 YOLO 9000: Better, Faster, Stronger 论文笔记

YOLO 9000: Better, Faster, Stronger论文链接： https://arxiv.org/abs/1612.08242一、Problem Statement作者改进了YOLOv1以及提出了目标分类与检测联合训练的方法，通过这种方法，YOLO9000可以同时在COCO和ImageNet数据集中进行训练，训练后的模型可以实现多达9000种物体的实时检测。二、DirectionBetter, Faster: YOLOv2Stronger: YOLO9000作者认为YOLO

2021-05-08 08:40:57 172

原创 YOLOv3: An Incremental Improvement 论文笔记

YOLOv3: An Incremental Improvement论文链接： https://arxiv.org/abs/1804.02767一、Problem StatementJust a bunch of small changes that make it better。二、Direction作者优化的方向:Bounding Box PredictionClass PredictionPredictions Across ScalesFeature Extractor三、M

2021-05-08 08:37:16 453

原创 YOLOv4: Optimal Speed and Accuracy of Object Detection 论文笔记

YOLOv4: Optimal Speed and Accuracy of Object Detection论文链接： https://arxiv.org/abs/2004.10934一、Problem Statement作者使用一些tricks提升检测效果，包括: Weighted-Residual-Connections(WRC), Cross-Stage-Partial-connections(CSP), Cross mini-Batch Normalization(CmBN), self-ad

2021-05-08 08:32:56 275

原创 SSD: Single Shot MultiBox Detector论文笔记

SSD: Single Shot MultiBox Detector论文链接： https://arxiv.org/abs/1512.02325一、Problem Statement作者认为目前的检测系统都是以下方法的变体:假设bounding box，对每个框重新取样像素或特征，再应用高质量分类器。这些方法对于嵌入式系统来说计算量过大，即使对于高端硬件，对于实时或接近实时的应用来说也太慢。本文提出了第一个基于深层网络的对象检测器，它不会对bounding box假设的像素或特征进行重新取样，但和上面

2021-05-08 08:25:07 206

原创 Soft-NMS -- Improving Object Detection With One Line of Code 论文笔记

Soft-NMS – Improving Object Detection With One Line of Code论文链接： https://arxiv.org/abs/1704.04503一、 Problem Statement传统的NMS首先会对预测出来的bounding box进行排序，然后选择一个最大值与所设置的阈值进行比较，从而删除那些重叠度较高的bounding box。但是会有下图所示的问题:传统的NMS会把相邻的检测框的置信度变为0。因此一个目标如果真的出现在重叠的阈值范围内，

2021-05-08 08:20:41 252

原创 Simple Copy-Past is a Strong Data Augmentation Method for Instance Segmentation 论文笔记

Simple Copy-Past is a Strong Data Augmentation Method for Instance Segmentation论文链接： https://arxiv.org/abs/2012.07177代码链接： https://github.com/conradry/copy-paste-aug一、 Problem Statement对实例分割提出一个数据增广的方式。二、 Direction简单的复制粘帖目标进行数据增广。三、 Method核心的思想是:目

2021-05-08 08:18:32 335 1

原创 Scaled-YOLOv4: Scaling Cross Stage Partial Network 论文笔记

Scaled-YOLOv4: Scaling Cross Stage Partial Network论文链接： https://arxiv.org/abs/2011.08036一、Problem StatementCSPNet的作者用其CSPNet的方法分别从网络的深度，宽度，结构和输入图像的分辨率改善YOLOV4。二、Direction作者发现在RegNet中，CNN最优的深度为60左右，且当bottleneck的比例设置为1，和cross-stage的宽度增长比例设置为2.5时，能获得最好的性

2021-05-08 08:16:31 502

原创 Res2Net: A New Multi-scale Backbone Architecture 论文笔记

Res2Net: A New Multi-scale Backbone Architecture论文链接： https://arxiv.org/abs/1904.01169 IEEE TPAMI 2021一、 Problem StatementMulti-scale ability对于网络来说是很重要的。目前大多数的backbone提高这个能力的方向是提高CNN中layer-wise multi-scale 表达的能力。作者提出了一个更好更有效率的提升办法。二、 Direction更强的multi

2021-05-08 08:10:32 355

原创 Pesuo-Lidar ++: Accurate Depth for 3D Object Detection in Autonomous Driving 论文笔记

Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving论文链接： https://arxiv.org/abs/1906.06310一、 Problem Statement提升pseudo-lidar 检测far-away目标的性能。作者观察到一个问题:双目深度估计的方法是比较可靠的，但是它们估计整个目标的深度不是太近就是太远。上图红色的点偏离绿色的bounding box 2米，所以深度估计有偏差。而

2021-05-08 08:08:03 897

原创 Path Aggregation Network for Instance Segmentation 论文笔记

Path Aggregation Network for Instance Segmentation论文链接： https://arxiv.org/abs/1803.01534一、Problem StatementCOCO2017实例分割挑战赛的冠军。作者认为Mask-RCNN的信息传递可以更进一步，特别是低层次的特征有助于大实例的识别。但是，对于低层次的特征传递有很长的一个路径，增加了精确定位信息的难度。还有一个问题就是，每一个proposal是基于特定的特征层级预测出来的，从其他层级丢弃的信

2021-05-07 08:05:04 284

原创 Generalized Focal Loss 论文笔记

Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection论文链接： https://arxiv.org/abs/2006.04388一、Problem Statement目前热门的One-stage detector的head末端通常会有三个representation: classification, localization, quality estima

2021-05-06 08:05:08 624

原创 Feature Pyramid Network for Object Detection 论文笔记

Feature Pyramid Network for Object Detection论文链接： https://arxiv.org/abs/1612.03144一、Problem Statement特征金字塔在识别系统中是一个基本的组成部分，用于在不同尺度上检测目标。但是因为传统的特征金字塔计算和存储花费太大，作者就提出以一个较小的额外花费来构建一个特征金字塔。它是一个为了在所有尺度上建立高层次语义特征映射的一个自顶向下且含有旁支的结构。(a)使用image pyramid，特征是通过不同图

2021-05-01 11:05:19 208

Tianchao龙虾