2019年05月_luputo

原创 Pytorch自定义参数

Pytorch自定义参数如果想要灵活地使用模型，可能需要自定义参数，比如class Net(nn.Module): def __init__(self): super(Net,self).__init__() self.A = torch.randn((2,3),requires_grad=True) self.B = nn.Linear...

2019-05-22 23:01:22 11206 4

原创 CIDEr: Consensus-based Image Description Evaluation

CIDEr: Consensus-based Image Description Evaluation原文地址时间：2015Intro自动度量 description of an image 的困难提出了consensus-based 的度量协议：CIDErApproach给定一副图片和一系列人给出的描述，我们的目标是估计candidate和大多数ground truth的相似性...

2019-05-31 20:34:16 1478 2

原创 ROUGE: A Package for Automatic Evaluation of Summaries

ROUGE: A Package for Automatic Evaluation of Summaries原文地址时间：2004

2019-05-31 20:34:08 2151

原创 ImageNet Classification with Deep Convolutional Neural Networks

ImageNet Classification with Deep Convolutional Neural Networks原文地址时间：2012IntroImageNet是一个包含15 million的高分辨率图片数据集，其中包括了22000个类别，ILSVRC(ImageNet Large-Scale Visual Recognition Challenge)使用了ImageNet的...

2019-05-31 20:33:57 421

原创第四十三周学习笔记

第四十三周学习笔记论文阅读ImageNet Classification with Deep Convolutional Neural Networks，提出了AlexNet，在ImageNet上首度超过传统方法ROUGE: A Package for Automatic Evaluation of Summaries，一个基于recall的nlg度量CIDEr: Consensus-b...

2019-05-31 20:32:24 205

原创第四十二周学习笔记

第四十二周学习笔记论文阅读概述RCNN两个insight：proposal+CNN，ImageNet预训练模型+fine-tuning三个模块：proposal,CNN,SVM四个步骤：提出proposal，CNN提取特征，SVM分类，bounding box回归精炼boxFast RCNN解决RCNN慢、multi-stage、存储消耗多的问题对整张图片求特征图后，通过...

2019-05-24 19:32:32 150

原创 Semi-Supervised Classification with Graph Convolutional Networks

Semi-Supervised Classification with Graph Convolutional Networks原问地址时间：2017Intro要解决的问题：图上的结点分类，其中只有小部分结点有label。这是一个基于图的半监督学习，可以通过在损失函数中额外加一项graph-based regularization来解决：其中L0\mathcal{L}_0L0表示l...

2019-05-24 17:51:20 4180

原创 Single Image Haze Removal Using Dark Channel Prior

Single Image Haze Removal Using Dark Channel Prior时间：2019Intro本文使用了一个简单高效的去雾算法，他的原理基于一个对无雾图片的统计结果，在大多数非天空局部区域中总有一个像素点的至少一个通道的值很低，由此我们可以进行去雾操作背景常用的雾的形成公式为其中I是观测到的图像，J是原图，t是透射率，A是大气光成分，去雾就是要从I恢复J...

2019-05-24 17:48:44 2322

原创 Faster RCNN

Faster RCNN原文地址时间：2016年IntroSelective Search与高效的网络相比，还是大一个量级的时间复杂度，每张图片需要2s，相比于EdgeBoxes的0.2秒，使得Region proposal成为瓶颈。本文使用深度卷积网络来计算proposal，让Region Proposal Networks(RPNs)与目标检测网络共享卷积层，作者发现，卷积特征不仅能被...

2019-05-24 17:44:32 242

原创 Fast R-CNN

Fast R-CNN论文地址时间：2015年IntroR-CNN的缺点训练是一个多步骤的过程：fine-tune+SVM+bounding box regression训练在时间和空间上消耗很大test-time很慢R-CNN很慢的原因之一是卷积网络分别计算所有proposals，而没有共享计算（因为这些proposal大多有相交的部分）。SPPnets为了解决这个问题，先计...

2019-05-24 17:41:00 140

原创 RCNN论文阅读笔记

RCNN论文阅读笔记原文：Rich feature hierarchies for accurate object detection and semantic segmentation时间：2014年文章解决了什么问题目标检测发展停滞，通过复杂的ensemble模型才能勉强提高一些精度用了什么方法两个 insights在bottom-up region proposals中应用卷...

2019-05-24 17:32:44 188

原创 C++ 获取文件夹下所有文件名

C++ 获取文件夹下所有文件名使用c++17对文件系统的官方支持#include <filesystem>#include <iostream>namespace fs = std::filesystem;int main(){ string path = "./imgs"; for (const auto &entry : fs::direc...

2019-05-22 10:29:53 2228 1

原创 windows内网远程连接jupyter-notebook

windows内网远程连接jupyter-notebook服务器设置安装jupyter notebookpip install jupyter notebook设置密码ipython>>>from notebook.auth import passwd>>>passwd()输入并确认密码，赋值并保存输出的sha1:....生成jupyte...

2019-05-19 17:12:21 1375

原创第四十一周学习笔记

第四十一周学习笔记论文阅读概述Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering结合Bottom-Up和Top-Down的信息，Bottom-up:Faster RCNN，Top-Down:Hierarchical LSTM两个LSTM，第一个用来做attention，第...

2019-05-17 17:02:44 379

原创 On the Automatic Generation of Medical Imaging Reports

On the Automatic Generation of Medical Imaging Reports原文地址时间：2018年IntroGeneration of medical image reports的困难一份完整的报告包括各种形式的信息，包括finding、tag、impression等，如图所示报告中的非正常区域很难找到报告很长，包括很多句子为了解决以上问题，我们...

2019-05-17 14:36:54 1373 2

原创 Knowledge-Embedded Routing Network for Scene Graph Generation

Knowledge-Embedded Routing Network for Scene Graph Generation原文地址时间：2019年

2019-05-17 14:25:38 1498 2

原创 Graph-Based Global Reasoning Networks

Graph-Based Global Reasoning Networks原文地址时间：2018IntroCNN擅长提取局部关系，但是在处理全局上的区域间关系时显得低效，且需要堆叠很多层才可能完成，如图所示，人类很容易能提取其中的关系，而对CNN来说却很难，因为CNN在最后几层才能有足够大的感受野，为了解决这个问题，本文提出了一个单元来直接进行global relation reasoni...

2019-05-17 14:00:07 4986 1

原创 Multi-Label Image Recognition with Graph Convolutional Networks

Multi-Label Image Recognition with Graph Convolutional Networks原文地址时间：2019年Introduction

2019-05-17 13:46:01 3336 2

原创 Exploring Visual Relationship for Image Captioning

Exploring Visual Relationship for Image Captioning论文地址时间：2018年Intro尽管当前存在很多CNN+RNN模型，还有一个未被充分研究的问题，即如何利用视觉关系来帮助image captioning。本文提出了一个GCN-LSTM模型，如图所示Contribution本文的主要贡献是提出了对视觉关系的使用Model本文通过...

2019-05-17 13:30:20 2566 5

原创 Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering原文地址时间：2018contribution提出了结合bottom-up和top-down的视觉注意力机制Model给定一张图片III，取其k个salient region的特征Bottom-Up Attention...

2019-05-17 12:04:19 1052

原创第四十周学习笔记

第四十周学习笔记论文阅读Show and Tell: A Neural Image Caption Generator提出了image caption 的 cnn encoder + rnn decoder的基本形式Deep Visual-Semantic Alignments for Generating Image Descriptions目标变为生成caption及其对应...

2019-05-10 21:18:04 235 4

原创 Auto-Encoding Scene Graphs for Image Captioning 论文阅读笔记

Auto-Encoding Scene Graphs for Image Captioning原文地址时间：2018年Introductionend-to-end encoder-decoder模型存在一个问题：当将一张包括未见过的场景输入到网络中时，返回的结果仅仅就是一些显著的object，比如“there is a dog on the floor”，这样的结果与object dete...

2019-05-10 19:13:25 3823 6

原创 Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention原文地址时间：2015年概括提出了两个attention based model that automatically learns to describe the content of images，一个 soft attention 和...

2019-05-10 17:01:52 1206

原创 Image Captioning with Semantic Attention

Image Captioning with Semantic Attention原文地址inspiration当时的image caption的模式有两种top-down，将图片直接转化为词，优点是端到端，缺点是难以提取细节bottom-up，先用几个词描述图片的各个方面，然后将词组合到一起，优点是可以提取细节，缺点是没有端到端的formulation为了结合两者的优点，提出了一个...

2019-05-10 16:46:26 993

原创 Deep Visual-Semantic Alignments for Generating Image Descriptions

Deep Visual-Semantic Alignments for Generating Image DescriptionsTargetgenerates natural language descriptions of images and their regionsChallenge强大的模型来处理图片信息，生成自然语言模型需要摆脱硬编码模板和一些固定模型，仅仅从数据中学习...

2019-05-10 15:59:20 2140 1

原创 Show and Tell: A Neural Image Caption Generator

Show and Tell: A Neural Image Caption GeneratorTargetAutomatically describe the content of an imageDifficultyA description must capture not only the objects contained in an image, but it also must...

2019-05-10 14:54:18 376

原创 BLEU笔记

BLEU翻译总结自论文BLEU是一种自动度量机器翻译（Machine translation）性能的指标基本认识对于两个候选的翻译句Candidate1: It is a guide to action whichensures that the military always obeysthe commands of the party.Candidate2: It is to...

2019-05-05 14:37:14 322

原创第三十九周学习笔记

第三十九周学习笔记RCNN论文阅读笔记原文：Rich feature hierarchies for accurate object detection and semantic segmentation时间：2014年文章解决了什么问题目标检测发展停滞，通过复杂的ensemble模型才能勉强提高一些精度用了什么方法两个 insights在bottom-up region pro...

2019-05-03 09:27:27 158

Peter的脱发日记