lovefengqiang-CSDN博客

Learning Rich Features from RGB-D Images for Object Detection and Segmentation

In this paper we study the problem of object detection for RGB-D images using semantically rich image and depth features.We pro- pose a new geocentric embedding for depth images that encodes height above ground and angle with gravity for each pixel in addition to the hor- izontal disparity. We demonstrate that this geocentric embedding works better than using raw depth images for learning feature representations with convolutional neural networks. Our nal object detection system achieves an average precision of 37.3%, which is a 56% relative improve- ment over existing methods. We then focus on the task of instance seg- mentation where we label pixels belonging to object instances found by our detector. For this task, we propose a decision forest approach that classies pixels in the detection window as foreground or background us- ing a family of unary and binary tests that query shape and geocentric pose features. Finally, we use the output from our object detectors in an existing superpixel classication framework for semantic scene segmenta- tion and achieve a 24% relative improvement over current state-of-the-art for the object categories that we study.We believe advances such as those represented in this paper will facilitate the use of perception in elds like robotics.

2018-04-22

Fast Object Detection in 3D Point Clouds Using Convolutional Neural Networks

Abstract— This paper proposes a computationally efficient approach to detecting objects natively in 3D point clouds using convolutional neural networks (CNNs). In particular, this is achieved by leveraging a feature-centric voting scheme to implement novel convolutional layers which explicitly exploit the sparsity encountered in the input. To this end, we examine the trade-off between accuracy and speed for different architectures and additionally propose to use an L1 penalty on the filter activations to further encourage sparsity in the intermediate representations. To the best of our knowledge, this is the first work to propose sparse convolutional layers and L1 regularisation for efficient large-scale processing of 3D data. We demonstrate the efficacy of our approach on the KITTI object detection benchmark and show that Vote3Deep models with as few as three layers outperform the previous state of the art in both laser and laser-vision based approaches by margins of up to 40% while remaining highly competitive in terms of processing time.

2018-04-08

Convolutional-Recursive Deep Learning for 3D Object Classification

Recent advances in 3D sensing technologies make it possible to easily record color and depth images which together can improve object recognition. Most current methods rely on very well-designed features for this new 3D modality. We introduce a model based on a combination of convolutional and recursive neural networks (CNN and RNN) for learning features and classifying RGB-D images.The CNN layer learns low-level translationally invariant features which are then given as inputs to multiple, fixed-tree RNNs in order to compose higher order features. RNNs can be seen as combining convolution and pooling into one efficient,hierarchical operation. Our main result is that even RNNs with random weights compose powerful features. Our model obtains state of the art performance on a standard RGB-D object dataset while being more accurate and faster during training and testing than comparable architectures such as two-layer CNNs.

2018-04-08

dwarf4_fileformat

DWARF is a debugging file format used by many compilers and debuggers to support source level debugging. It addresses the requirements of a number of procedural languages, such as C, C++, and Fortran, and is designed to be extensible to other languages. DWARF is architecture independent and applicable to any processor or operating system. It is widely used on Unix, Linux and other operating systems, as well as in stand-alone environments.

2012-12-12

lovefengqiang的博客

空空如也

Learning Rich Features from RGB-D Images for Object Detection and Segmentation

Fast Object Detection in 3D Point Clouds Using Convolutional Neural Networks

Convolutional-Recursive Deep Learning for 3D Object Classification

dwarf4_fileformat

MIPS的汇编语言程序设计教程

FPGA设计中的编程技巧

嵌入式系统编程_使用C和Gnu开发工具

数字电路及系统设计

Xilinx FPGA 最新培训讲义

ARM嵌入式系统结构与编程课件

数字设计与VHDL设计

C内存管理算法和实现

MP4编码解码源代码

VHDL编写的自动售货机

步进电机Linux驱动

fpga实现CAN总线源码

空空如也